|
vstart and thread migration
I appreciate you not punting this to "profiles" However, isn't this "mutually supported" effectively a profile constraint or consideration? Should we therefore include in the note that the specifics n
I appreciate you not punting this to "profiles" However, isn't this "mutually supported" effectively a profile constraint or consideration? Should we therefore include in the note that the specifics n
|
By
David Horner
· #251
·
|
|
laundry list of concerns following TG Minutes 2020/6/26
I trust this is not just noise. A laundry list of concerns Assess objectives and those met as we approach v1.0 of RVV Polymorphism has not made it into RVV V0.9 Tagging registers with a datatype is no
I trust this is not just noise. A laundry list of concerns Assess objectives and those met as we approach v1.0 of RVV Polymorphism has not made it into RVV V0.9 Tagging registers with a datatype is no
|
By
David Horner
· #249
·
|
|
mem model - RISC-V Vector Extension TG Minutes 2020/6/26
TL;DR; I make clarifications on meeting minutes. I propose we present 1) a relaxed RVV memory/process model, more relaxed than we believe current implementations require for optimal performance. Appli
TL;DR; I make clarifications on meeting minutes. I propose we present 1) a relaxed RVV memory/process model, more relaxed than we believe current implementations require for optimal performance. Appli
|
By
David Horner
· #248
·
|
|
Issue categorization - #460
No problem, happy you did. The very special nature and functionality of vsetvli justifies considering special encoding. All the following are valid considerations. None of which have been quantified.
No problem, happy you did. The very special nature and functionality of vsetvli justifies considering special encoding. All the following are valid considerations. None of which have been quantified.
|
By
David Horner
· #246
·
|
|
Issue categorization - #460
I think I understand how I confused the situation. Issue #458 introduced idea of using rd and rs1 values to encode more bits for vsetvli. I proposed that this become the only vsetvli format. Krste cou
I think I understand how I confused the situation. Issue #458 introduced idea of using rd and rs1 values to encode more bits for vsetvli. I proposed that this become the only vsetvli format. Krste cou
|
By
David Horner
· #243
·
|
|
Issue categorization - #460
It is a planned extension, that even if not implemented as currently proposed it will likely consume at least 2 bits in vtype. Do you envision a possibility that it will consume no vtype bits? The rea
It is a planned extension, that even if not implemented as currently proposed it will likely consume at least 2 bits in vtype. Do you envision a possibility that it will consume no vtype bits? The rea
|
By
David Horner
· #240
·
|
|
Issue categorization - #460
minor typos; substantial correction: also throughout the email vd should be rd vs1 should be rs1 And vs2 is completely bogus. Sorry I didn't catch this sooner.
minor typos; substantial correction: also throughout the email vd should be rd vs1 should be rs1 And vs2 is completely bogus. Sorry I didn't catch this sooner.
|
By
David Horner
· #238
·
|
|
Issue categorization - #460
I disagree that #460 should be deferred until after V1.0. Although I agree that the proposal itself can be implemented in a manner consistent with the current vsetvli definition, I disagree that the l
I disagree that #460 should be deferred until after V1.0. Although I agree that the proposal itself can be implemented in a manner consistent with the current vsetvli definition, I disagree that the l
|
By
David Horner
· #237
·
|
|
Issue categorization - object to closing #478
I don't agree that #478 should be closed for the reasons given. I do agree that another approach might be better to superseded that which was originally proposed. But I do not agree with the proposed
I don't agree that #478 should be closed for the reasons given. I do agree that another approach might be better to superseded that which was originally proposed. But I do not agree with the proposed
|
By
David Horner
· #236
·
|
|
On Vector Register Layout
The wide register being the destination, correct? Not sure I follow this, which two registers? If physical registers, then no. That design was the v0.8 method of vertical cycling through physical regi
The wide register being the destination, correct? Not sure I follow this, which two registers? If physical registers, then no. That design was the v0.8 method of vertical cycling through physical regi
|
By
David Horner
· #229
·
|
|
On Vector Register Layout
I will make a stab at even and odd layout for widening. 5) two versions of the widening ops are defined one for even and one odd. The registers are divided into even:odd pairs. Two versions of the wid
I will make a stab at even and odd layout for widening. 5) two versions of the widening ops are defined one for even and one odd. The registers are divided into even:odd pairs. Two versions of the wid
|
By
David Horner
· #227
·
|
|
On Vector Register Layout
These decisions are not made independently. E.g. Removing expanding loads led to fractional register mode. I believe there are other considerations that affect a definitive decision. 1) The even/odd a
These decisions are not made independently. E.g. Removing expanding loads led to fractional register mode. I believe there are other considerations that affect a definitive decision. 1) The even/odd a
|
By
David Horner
· #215
·
|
|
On Vector Register Layout
I agree that this should be default or base configuration. I believe the extended fractional layout is a 6th approach. Issue 465. Does it qualify as a known option? As with option 2 load and stores fo
I agree that this should be default or base configuration. I believe the extended fractional layout is a 6th approach. Issue 465. Does it qualify as a known option? As with option 2 load and stores fo
|
By
David Horner
· #199
·
|
|
Thoughts for Vector TG Meeting Friday June 12
There hasn't been any traffic on this feed about vector layout, and specifically about v0.9 SLEN size standards. So I will add my thoughts here with the hopes that I will not take excessive time durin
There hasn't been any traffic on this feed about vector layout, and specifically about v0.9 SLEN size standards. So I will add my thoughts here with the hopes that I will not take excessive time durin
|
By
David Horner
· #195
·
|
|
Vector Task Group minutes 2020/5/15
I'm not sure how to interpret this table (been puzzling over it). But I believe the LMUL>1 case for v0.9 is incorrect for the same reason as in the previous table. For v0.9 LMUL=1 and LMUL>1 have the
I'm not sure how to interpret this table (been puzzling over it). But I believe the LMUL>1 case for v0.9 is incorrect for the same reason as in the previous table. For v0.9 LMUL=1 and LMUL>1 have the
|
By
David Horner
· #188
·
|
|
Vector Task Group minutes 2020/5/15
The very significant correction is that for v09 SLEN=VLEN memory ordering IS preserved for LMUL>1 should be v08 v09 Grigoris LMUL>1 SLEN=VLEN memory ordering lost memory order retained memory ordering
The very significant correction is that for v09 SLEN=VLEN memory ordering IS preserved for LMUL>1 should be v08 v09 Grigoris LMUL>1 SLEN=VLEN memory ordering lost memory order retained memory ordering
|
By
David Horner
· #186
·
|
|
Vector Task Group minutes 2020/5/15 - CLSTR for in-register to in-memory alignment
I made this into its own thread as well. I think there is a parallel in the in-register/in-memory issue and memory consistency model/methods. The complexity of consistency models and methods is enormo
I made this into its own thread as well. I think there is a parallel in the in-register/in-memory issue and memory consistency model/methods. The complexity of consistency models and methods is enormo
|
By
David Horner
· #180
·
|
|
Vector Task Group minutes 2020/5/15 - V0.8 design with SLEN=8
I have some suggestions for the reasons for moving from v0.8 vertical striping to v0.9 horizontal SLEN (interleave) Under v0.8 A) when vl < VLEN/SEW*LMUL the top elements are not filled. This can lead
I have some suggestions for the reasons for moving from v0.8 vertical striping to v0.9 horizontal SLEN (interleave) Under v0.8 A) when vl < VLEN/SEW*LMUL the top elements are not filled. This can lead
|
By
David Horner
· #178
·
|
|
Vector Task Group minutes 2020/5/15 - precise layout not matter
I believe this can be weakened to required: select-able distribution patterns that are sufficiently compatible that they avoid fragmenting the software ecosystem.
I believe this can be weakened to required: select-able distribution patterns that are sufficiently compatible that they avoid fragmenting the software ecosystem.
|
By
David Horner
· #176
·
|
|
Vector Task Group minutes 2020/5/15
This is v0.8 with SLEN=8.
This is v0.8 with SLEN=8.
|
By
David Horner
· #171
·
|