The multiflow trace scheduling compiler

PG Lowney, SM Freudenberger, TJ Karzes… - … -Level Parallelism: A …, 1993 - Springer
The Multiflow compiler uses the trace scheduling algorithm to find and exploit instruction-
level parallelism beyond basic blocks. The compiler generates code for VLIW computers that …

A VLIW architecture for a trace scheduling compiler

RP Colwell, RP Nix, JJ O'Donnell… - ACM SIGARCH …, 1987 - dl.acm.org
Very Long Instruction Word (VLIW) architectures were promised to deliver far more than the
factor of two or three that current architectures achieve from overlapped execution. Using a …

A VLIW architecture for a trace scheduling compiler

RP Colwell, RP Nix, JJ O'Donnell… - IEEE Transactions …, 1988 - ieeexplore.ieee.org
A VLIW (very long instruction word) architecture machine called the TRACE has been built
along with its companion Trace Scheduling compacting compiler. This machine has three …

Transport-triggering vs. operation-triggering

J Hoogerbrugge, H Corporaal - International Conference on Compiler …, 1994 - Springer
Transport-triggered architectures are a new class of architectures that provide more
scheduling freedom and have unique compiler optimizations. This paper reports …

Factoring: A method for scheduling parallel loops

SF Hummel, E Schonberg, LE Flynn - Communications of the ACM, 1992 - dl.acm.org
A Method for Scheduling Parallel L oops~ lw advantage of, capability of rallel machines,
application programs must contain sufficient parallelism, and this parallelism must be …

[PDF][PDF] An efficient resource-constrained global scheduling technique for superscalar and VLIW processors

SM Moon, K Ebcioğlu - Acm Sigmicro Newsletter, 1992 - dl.acm.org
We describe a new algorithm for parallelization of sequential code that eliminates anti and
output dependence8 by renaming registers on an as-needed basis during scheduling. A …

Run-time parallelization and scheduling of loops

JH Saltz, R Mirchandaney, D Baxter - 1988 - ntrs.nasa.gov
The class of problems that can be effectively compiled by parallelizing compilers is
discussed. This is accomplished with the doconsider construct which would allow these …

Exploiting the parallelism available in loops

DJ Lilja - Computer, 1994 - ieeexplore.ieee.org
Because a loop's body often executes many times, loops provide a rich opportunity for
exploiting parallelism. This article explains scheduling techniques and compares results on …

Trace processors

E Rotenberg, Q Jacobson, Y Sazeides… - Proceedings of 30th …, 1997 - ieeexplore.ieee.org
Traces are dynamic instruction sequences constructed and cached by hardware. A
microarchitecture organized around traces is presented as a means for efficiently executing …

Resource requirements of dataflow programs

DE Culler, Arvind - ACM SIGARCH Computer Architecture News, 1988 - dl.acm.org
Parallel execution of programs requires more resources and more complex resource
management than sequential execution. If concurrent tasks can be spawned dynamically …