The multiflow trace scheduling compiler
PG Lowney, SM Freudenberger, TJ Karzes… - … -Level Parallelism: A …, 1993 - Springer
The Multiflow compiler uses the trace scheduling algorithm to find and exploit instruction-
level parallelism beyond basic blocks. The compiler generates code for VLIW computers that …
level parallelism beyond basic blocks. The compiler generates code for VLIW computers that …
A VLIW architecture for a trace scheduling compiler
RP Colwell, RP Nix, JJ O'Donnell… - ACM SIGARCH …, 1987 - dl.acm.org
Very Long Instruction Word (VLIW) architectures were promised to deliver far more than the
factor of two or three that current architectures achieve from overlapped execution. Using a …
factor of two or three that current architectures achieve from overlapped execution. Using a …
A VLIW architecture for a trace scheduling compiler
RP Colwell, RP Nix, JJ O'Donnell… - IEEE Transactions …, 1988 - ieeexplore.ieee.org
A VLIW (very long instruction word) architecture machine called the TRACE has been built
along with its companion Trace Scheduling compacting compiler. This machine has three …
along with its companion Trace Scheduling compacting compiler. This machine has three …
Transport-triggering vs. operation-triggering
J Hoogerbrugge, H Corporaal - International Conference on Compiler …, 1994 - Springer
Transport-triggered architectures are a new class of architectures that provide more
scheduling freedom and have unique compiler optimizations. This paper reports …
scheduling freedom and have unique compiler optimizations. This paper reports …
Factoring: A method for scheduling parallel loops
SF Hummel, E Schonberg, LE Flynn - Communications of the ACM, 1992 - dl.acm.org
A Method for Scheduling Parallel L oops~ lw advantage of, capability of rallel machines,
application programs must contain sufficient parallelism, and this parallelism must be …
application programs must contain sufficient parallelism, and this parallelism must be …
[PDF][PDF] An efficient resource-constrained global scheduling technique for superscalar and VLIW processors
SM Moon, K Ebcioğlu - Acm Sigmicro Newsletter, 1992 - dl.acm.org
We describe a new algorithm for parallelization of sequential code that eliminates anti and
output dependence8 by renaming registers on an as-needed basis during scheduling. A …
output dependence8 by renaming registers on an as-needed basis during scheduling. A …
Run-time parallelization and scheduling of loops
JH Saltz, R Mirchandaney, D Baxter - 1988 - ntrs.nasa.gov
The class of problems that can be effectively compiled by parallelizing compilers is
discussed. This is accomplished with the doconsider construct which would allow these …
discussed. This is accomplished with the doconsider construct which would allow these …
Exploiting the parallelism available in loops
DJ Lilja - Computer, 1994 - ieeexplore.ieee.org
Because a loop's body often executes many times, loops provide a rich opportunity for
exploiting parallelism. This article explains scheduling techniques and compares results on …
exploiting parallelism. This article explains scheduling techniques and compares results on …
Trace processors
E Rotenberg, Q Jacobson, Y Sazeides… - Proceedings of 30th …, 1997 - ieeexplore.ieee.org
Traces are dynamic instruction sequences constructed and cached by hardware. A
microarchitecture organized around traces is presented as a means for efficiently executing …
microarchitecture organized around traces is presented as a means for efficiently executing …
Resource requirements of dataflow programs
Parallel execution of programs requires more resources and more complex resource
management than sequential execution. If concurrent tasks can be spawned dynamically …
management than sequential execution. If concurrent tasks can be spawned dynamically …