Name		Name	Last commit message	Last commit date
parent directory ..
solution		solution
Makefile		Makefile
README.md		README.md
dotp.c		dotp.c
main.c		main.c
testset.cfg		testset.cfg

README.md

Dot Product C program

This application contain a simple dot product function between 2 byte-vectors of 1000 elements.

The innermost kernel is written in assembler to give you the whole control of the instructions executed by the core.

For a quick guide to use assembler in GCC have a look at https://gcc.gnu.org/onlinedocs/gcc/Extended-Asm.html

Compile the application (make clean all)
Generate the assember (make dis > dis.s)
Run it in gui mode (make conf gui=1 run)

Identify the dot product parts in the assembler. After executing the code, check both the trace file in ./build/pulpissimo/trace_core_1f_0.log to see the instructions executed by the core. Can you find the dot product part?

How many cycles do you exept from such function? Why?

Loop Unrolling

Improve the application using the loop unrolling technique we have seen at the lecture. Complete the provided function and repeat the step above.

How many cycles do you exept from such function?
Analyze the trace around your function. Where is the stall?
Introduce the c.nop instruction to align the address of the first instruction of the HWloop to get best performance.

Use of the SIMD instructions

Rewrite the function to implement the dot product using the sum of dot product function that you find the PULP extensions of the RI5CY core (in its user_manual.doc)

Complete the function.
How many cycles do you exept from such function?
Fix the HWloop first instruction address if needed.
What is the speed up?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dot_product

dot_product

README.md

Dot Product C program

Loop Unrolling

Use of the SIMD instructions

Files

dot_product

Directory actions

More options

Directory actions

More options

Latest commit

History

dot_product

Folders and files

parent directory

README.md

Dot Product C program

Loop Unrolling

Use of the SIMD instructions