Member-only story

How to Reduce II in HLS: Part 4

1 min readMay 24, 2021

This week’s problem is the traditional matrix-vector multiplication kernel used in several applications such as machine learning and image processing.

This figure shows the software-oriented code that describes the kernel.

The code consists of a two-level loop nest that reads the matrix A and vector

and generates the output elements located in

If we assume n is 4096 and m is 2048, then this figure shows the synthesis report after synthesising the code with Vitis 2020.2.

As can be seen, the first loop is not pipelined, but the inner loop is pipelined with the initiation interval of 1.

After executing the code on Ultra96v2 using the Vitis-2020.2 software tool, the execution time would be 683.491 ms.

Now the question is, “How can we improve the performance?”

Please follow the solution in the video.

Originally published at http://highlevel-synthesis.com on May 24, 2021.

Create an account to read the full story.

The author made this story available to Medium members only.
If you’re new to Medium, create a new account to read this story on us.

Continue in app

Or, continue in mobile web

Already have an account? Sign in

Written by Mohammad Hosseinabady

48 Followers

40 Following

Designing digital systems and accelerating functions with HLS for FPGAs are fun.

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

Recommended from Medium

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jessica Stillman

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jeff Bezos’s morning routine has long included the one-hour rule. New neuroscience says yours probably should too.

Oct 30, 2024

25K

731

Solving Max-Cut Problems with D-Wave Quantum Annealing

Naoki

Solving Max-Cut Problems with D-Wave Quantum Annealing

Split the Network for Maximum Gain!

Nov 24, 2024

Lists

Staff picks

826 stories1649 saves

Stories to Help You Level-Up at Work

19 stories948 saves

Self-Improvement 101

20 stories3355 saves

Productivity 101

20 stories2818 saves

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

Level Up Coding

Jacob Bennett

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

Tools I use that are cheaper than Netflix

Jan 7

10.6K

260

Predict

Will Lockett

This Is How Tesla Will Die

The vultures are circling the tech giant.

5d ago

5.5K

134

How I Am Using a Lifetime 100% Free Server

Harendra

How I Am Using a Lifetime 100% Free Server

Get a server with 24 GB RAM + 4 CPU + 200 GB Storage + Always Free

Oct 26, 2024

9.4K

170

Ibrahim Bin Mansur

SLAM and Nav2 for Custom Robots in ROS2

Continuing from my last two articles where I made the custom robot models and added different plugins for differential drive, camera and…

Jan 30

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams