Re: Vector TG meeting minutes 2020/4/03
Thang Tran
In scalar code, there is always signed/zero extension for the data and alignment. I do not see a different with vector load/store. If alignment is needed, not much additional cost for signed/zero extension, and an extra pipeline stage is added.
Depended on how the load is pipelined, the load-to-use penalty may be none. So, widening is much preferred in our design.
Thanks, Thang
From: tech-vector-ext@... [mailto:tech-vector-ext@...]
On Behalf Of Alex Solomatnikov
Sent: Saturday, April 4, 2020 7:09 PM To: Thang Tran <thang@...> Cc: Nick Knight <nick.knight@...>; Krste Asanovic <krste@...>; tech-vector-ext@... Subject: Re: [RISC-V] [tech-vector-ext] Vector TG meeting minutes 2020/4/03
Bob Dreyer said he would share an example code.
Do you really have a 2x or 4x wider write port to the vector register file to make vlb and the like work at full memory bandwidth?
If yes, what is the impact on PPA, i.e. clock frequency, area, power?
If not, then extra widening instruction would not matter because vlb itself is the bottleneck.
Alex
On Sat, Apr 4, 2020 at 5:19 PM Thang Tran <thang@...> wrote:
|
|