Uzi Vishkin
Professor
5216 Iribe Center
(301) 405-6763
Education:
D.Sc., Technion - Israel Institute of Technology (Computer Science)
Special Awards/Honors:
ACM Fellow
Biography:
Uzi Vishkin is a professor of electrical and computer engineering with an appointment in the University of Maryland Institute for Advanced Computer Studies.
His research focuses on aligning the theory of parallel algorithms, parallel programming, and many-core hardware into a cohesive computing stack for future processors. Vishkin has developed a unique approach that enables single-threaded programming for parallelism to achieve competitive many-core performance, directly harnessing the theory of parallel algorithms.
Go here to view Vishkin's academic publications on Google Scholar.
Publications
2011
2011. Toolchain for Programming, Simulating and Studying the XMT Many-Core Architecture. Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on. :1282-1291.
2011. A Low-Overhead Asynchronous Interconnection Network for GALS Chip Multiprocessors. Computer-Aided Design of Integrated Circuits and Systems, IEEE Transactions on. 30(4):494-507.
2011. Using simple abstraction to reinvent computing for parallelism. Commun. ACM. 54(1):75-85.
2011. Brief announcement: better speedups for parallel max-flow. Proceedings of the 23rd ACM symposium on Parallelism in algorithms and architectures. :131-134.
2010
2010. Resource-Aware Compiler Prefetching for Many-Cores. Parallel and Distributed Computing (ISPDC), 2010 Ninth International Symposium on. :133-140.
2010. Lazy binary-splitting: a run-time adaptive work-stealing scheduler. Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. :179-190.
2010. Is teaching parallel algorithmic thinking to high school students possible?: one teacher's experience Proceedings of the 41st ACM technical symposium on Computer science education. :290-294.
2010. General-purpose vs. gpu: Comparison of many-cores on irregular workloads. Proceedings of the Second Usenix Workshop on Hot Topics in Parallelism. :14-15.
2009
2009. Brief announcement: performance potential of an easy-to-program PRAM-on-chip prototype versus state-of-the-art processor. Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures. :163-165.
2009. Algorithmic approach to designing an easy-to-program system: Can it lead to a HW-enhanced programmer's workflow add-on? Computer Design, 2009. ICCD 2009. IEEE International Conference on. :60-63.
2009. Mesh-of-Trees and Alternative Interconnection Networks for Single-Chip Parallelism. Very Large Scale Integration (VLSI) Systems, IEEE Transactions on. 17(10):1419-1432.
2009. Spawn-join instruction set architecture for providing explicit multithreading. 10/236,934(7523293)
2008
2008. An Immediate Concurrent Execution (ICE) Abstraction Proposal for Many-Cores. Computer Science Research Works.
2008. Fpga-based prototype of a pram-on-chip processor. Proceedings of the 5th conference on Computing frontiers. :55-66.
2008. Toward Realizing a PRAM-on-a-Chip Vision. Lecture Notes in Computer Science. 4854:5-6.
2008. An area-efficient high-throughput hybrid interconnection network for single-chip parallel processing. Proceedings of the 45th annual Design Automation Conference. :435-440.
2008. A pilot study to compare programming effort for two parallel programming models. Journal of Systems and Software. 81(11):1920-1930.
2007
2007. Layout-Accurate Design and Implementation of a High-Throughput Interconnection Network for Single-Chip Parallel Processing. High-Performance Interconnects, 2007. HOTI 2007. 15th Annual IEEE Symposium on. :21-28.
2007. PRAM-on-chip: first commitment to silicon. Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures. :301-302.
2007. Models for advancing PRAM and other algorithms into parallel programs for a PRAM-On-Chip platform. IN HANDBOOK OF PARALLEL COMPUTING: MODELS, ALGORITHMS AND APPLICATIONS, EDITORS.
2007. Plasmonics and the parallel programming problem. Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series. 6477:19-19.
2007. Towards Realizing a PRAM-On-Chip Vision. Workshop on Highly Parallel Processing on a Chip (HPPC). 28
2007. Thinking in parallel: Some basic data-parallel algorithms and techniques. UMIACS, University of Maryland, College Park. 1993
2007. Electron beam and optical proximity effect reduction for nanolithography: New results. Journal of Vacuum Science & Technology B. 25(6):2288-2294.
2006
2006. A bootstrapping model for directional wireless networks. Communications Letters, IEEE. 10(12):840-842.
2006. Bootstrapping free-space optical networks. Selected Areas in Communications, IEEE Journal on. 24(12):13-22.
2006. Programmer's Manual for XMTC Language, XMTC Compiler and XMT Simulator. Technical Reports from UMIACS, UMIACS-TR-2005-45.
2006. Case study of gate-level logic simulation on an extremely fine-grained chip multiprocessor. Journal of Embedded Computing. 2(2):181-190.
2004
2004. OPTICAL INTERCONNECT STRUCTURE IN A COMPUTER SYSTEM. (WO/2004/083904)
2004. Circuit architecture for reduced-synchrony on-chip interconnect. 10/166,008(6768336)
2004. Arbitrate-and-move primitives for high throughput on-chip interconnection networks. Circuits and Systems, 2004. ISCAS '04. Proceedings of the 2004 International Symposium on. 2:II-441-4Vol.2-II-441-4Vol.2.
2004. Reconfigurable optical wireless sensor networks. Proceedings of SPIE. 5237(1):136-146.
2004. PRAM-On-Chip: A Quest for Not-So-Obvious Non-obviousness. Mathematical Foundations of Computer Science 2004Mathematical Foundations of Computer Science 2004. 3153:104-105.
2004. Bending light for multi-chip virtual PRAMs? Proc. 3rd Workshop on Non-Slicon Computation, held in conjunction with the 31st International Symposium on Computer Architecture (ISCA 2004). :19-23.
2003
2003. Prefix sums and an application thereof. : 09/224,104(6542918)
2003. Towards a first vertical prototyping of an extremely fine-grained parallel programming approach. Theory of Computing Systems. 36(5):521-552.
2003. Deterministic Resource Discovery in Distributed Networks. Theory of Computing Systems. 36(5):479-495.
2002
2002. Two techniques for reconciling algorithm parallelism with memory constraints. Proceedings of the fourteenth annual ACM symposium on Parallel algorithms and architectures. :95-98.
2001
2001. Evaluating the XMT parallel programming model. High-Level Parallel Programming Models and Supportive Environments. :95-108.
2000
2000. Communication complexity of document exchange. Proceedings of the eleventh annual ACM-SIAM symposium on Discrete algorithms. :197-206.
2000. Experiments with list ranking for explicit multi-threaded (XMT) instruction parallelism. J. Exp. Algorithmics. 5
2000. A no-busy-wait balanced tree parallel algorithmic paradigm. Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architectures. :147-155.
2000. PRAM-On-Chip Vision. String Processing and Information Retrieval, 2000. SPIRE 2000. Proceedings. Seventh International Symposium on. :260-260.
2000. Evaluating multi-threading in the prototype XMT environment. Proc. 4th Workshop on Multi-Threaded Execution, Architecture and Compliation (MTEAC2000).
1999
1999. Trade-offs between communication throughput and parallel time. Journal of Complexity. 15(1):148-166.
1999. Experiments with list ranking for Explicit Multi-Threaded (XMT) instruction parallelism. Algorithm Engineering. :43-59.
1999. XMT-M: A Scalable Decentralized Processor. UMIACS-TR-99-55
1998
1998. Explicit Multi-Threading (XMT) bridging models for instruction parallelism (extended abstract). Proceedings of the tenth annual ACM symposium on Parallel algorithms and architectures. :140-151.
1998. Looking to Parallel Algorithms for ILP and Decentralization. CS-TR-3921
1997
1997. From algorithm parallelism to instruction-level parallelism: An encode-decode chain using prefix-sum. Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures. :260-271.
1996
1996. Efficient approximate and dynamic matching of patterns using a labeling paradigm. Foundations of Computer Science, 1996. Proceedings., 37th Annual Symposium on. :320-328.
1996. Can parallel algorithms enhance serial implementation? Communications of the ACM. 39(9):88-91.
1996. A fast parallel algorithm for finding the convex hull of a sorted point set. International Journal of Computational Geometry and Applications. 6:231-242.
1996. Sorting strings and constructing digital search trees in parallel. Theoretical Computer Science. 154(2):225-245.
1995
1995. On a technique for parsing a string. Combinatorial Pattern Matching. :386-386.
1995. On a Technique for Parsing a String (Invited Lecture). Lecture Notes in Computer Science. 937:386-386.
1995. Data compression using locally consistent parsing. TechnicM report, University of Maryland Department of Computer Science.
1995. Parallel algorithms for database operations and a database operation for parallel algorithms. Parallel Processing Symposium, 1995. Proceedings., 9th International. :173-179.
1995. A note on reducing parallel model simulations to integer sorting. Parallel Processing Symposium, 1995. Proceedings., 9th International. :208-212.
1995. Almost fully-parallel parentheses matching. Discrete applied mathematics. 57(1):11-28.
1994
1994. Biconnectivity approximations and graph carvings. Journal of the ACM (JACM). 41(2):214-235.
1994. On a parallel-algorithms method for string matching problems. Lecture Notes in Computer Science. 778:22-32.
1994. On the parallel complexity of digraph reachability. Information Processing Letters. 52(5):239-241.
1994. Optimal randomized parallel algorithms for computing the row maxima of a totally monotone matrix. Proceedings of the fifth annual ACM-SIAM symposium on Discrete algorithms. :613-621.
1994. Trade-offs between communication throughput and parallel time. Proceedings of the twenty-sixth annual ACM symposium on Theory of computing. :372-381.