1996 | Kunle Olukotun, Basem A. Nayfeh, Lance Hammond, Ken Wilson, and Kunyung Chang
The paper "The Case for a Single-Chip Multiprocessor" by Kunle Olukotun, Basem A. Nayfeh, Lance Hammond, Ken Wilson, and Kunyung Chang discusses the advantages of implementing a single-chip multiprocessor architecture in advanced integrated circuit technologies. The authors argue that while superscalar processors offer higher clock rates and instruction issue widths, they are limited by the complexity and area requirements of their dynamic scheduling and register renaming mechanisms. In contrast, a single-chip multiprocessor, composed of simpler processors, can achieve higher clock rates and better performance for applications with large amounts of parallelism at both fine and coarse-grained levels. The paper compares the performance of a six-issue superscalar processor and a 4 × two-issue multiprocessor, finding that the multiprocessor outperforms the superscalar by 50-100% on applications with coarse-grained parallelism and multiprogramming workloads. The authors conclude that the multiprocessor architecture is more efficient in terms of silicon area and can better exploit the capabilities of advanced integrated circuit technologies.The paper "The Case for a Single-Chip Multiprocessor" by Kunle Olukotun, Basem A. Nayfeh, Lance Hammond, Ken Wilson, and Kunyung Chang discusses the advantages of implementing a single-chip multiprocessor architecture in advanced integrated circuit technologies. The authors argue that while superscalar processors offer higher clock rates and instruction issue widths, they are limited by the complexity and area requirements of their dynamic scheduling and register renaming mechanisms. In contrast, a single-chip multiprocessor, composed of simpler processors, can achieve higher clock rates and better performance for applications with large amounts of parallelism at both fine and coarse-grained levels. The paper compares the performance of a six-issue superscalar processor and a 4 × two-issue multiprocessor, finding that the multiprocessor outperforms the superscalar by 50-100% on applications with coarse-grained parallelism and multiprogramming workloads. The authors conclude that the multiprocessor architecture is more efficient in terms of silicon area and can better exploit the capabilities of advanced integrated circuit technologies.