|
Re: MLEN=1 update
I applaud the ordinal nature of the mask structure that is independent of SEW and LMUL.
The problem that I have is the mismatch between units of measure, which is bytes in element lengths and bits in
I applaud the ordinal nature of the mask structure that is independent of SEW and LMUL.
The problem that I have is the mismatch between units of measure, which is bytes in element lengths and bits in
|
By
David Horner
·
#147
·
|
|
Vector TG group meeting tomorrow
We’ll be meeting tomorrow morning. Meeting details on member calendar.
Krste (on iPhone, forgive terseness)
We’ll be meeting tomorrow morning. Meeting details on member calendar.
Krste (on iPhone, forgive terseness)
|
By
Krste Asanovic
·
#146
·
|
|
Re: MLEN=1 update
Right, now there’s just a single blanket statement that no vector instruction can overwrite the mask register
Krste (on iPhone, forgive terseness)
Right, now there’s just a single blanket statement that no vector instruction can overwrite the mask register
Krste (on iPhone, forgive terseness)
|
By
Krste Asanovic
·
#145
·
|
|
Re: MLEN=1 update
Never mind, I misread the git commit log. It appears that no instructions can overwrite the mask register. Sorry.
Never mind, I misread the git commit log. It appears that no instructions can overwrite the mask register. Sorry.
|
By
Nick Knight
·
#144
·
|
|
Re: MLEN=1 update
Hi Krste,
It seems that many (all?) masked instructions are now allowed to overwrite the mask register. This new scheme would indeed be simpler --- is this true? Can you give any high-level insight
Hi Krste,
It seems that many (all?) masked instructions are now allowed to overwrite the mask register. This new scheme would indeed be simpler --- is this true? Can you give any high-level insight
|
By
Nick Knight
·
#143
·
|
|
MLEN=1 update
I've made a major update to mask encoding, pushed to repo.
The earlier change to support fractional LMUL effectively "broke" the
earlier mask encoding. The new scheme is simpler, but is
I've made a major update to mask encoding, pushed to repo.
The earlier change to support fractional LMUL effectively "broke" the
earlier mask encoding. The new scheme is simpler, but is
|
By
Krste Asanovic
·
#142
·
|
|
Re: Sparse Matrix-Vector Multiply (again) and Bit-Vector Compression
PS: Instead of loading the RHS entries with an indexed load, we could also use a unit-stride load (possibly masked), followed by a vcompress. This didn't seem like a clear win, and required additional
PS: Instead of loading the RHS entries with an indexed load, we could also use a unit-stride load (possibly masked), followed by a vcompress. This didn't seem like a clear win, and required additional
|
By
Nick Knight
·
#141
·
|
|
Re: Sparse Matrix-Vector Multiply (again) and Bit-Vector Compression
Hi Nagendra,
The goal of the Zvediv extension is to add some support for sub-byte datatypes. So my hope is that applications that use bitvectors to encode nonzero structure will be supported via
Hi Nagendra,
The goal of the Zvediv extension is to add some support for sub-byte datatypes. So my hope is that applications that use bitvectors to encode nonzero structure will be supported via
|
By
Nick Knight
·
#140
·
|
|
Re: [RISC-V][tech-vector-ext] Intrinsics for vector programming in C.
sorry I wasn’t clear.
the data types, eg vint64m1_t, are for data elements.
all of the example API definitions also use these same element data types as arguments, but they should be using a data
sorry I wasn’t clear.
the data types, eg vint64m1_t, are for data elements.
all of the example API definitions also use these same element data types as arguments, but they should be using a data
|
By
Guy Lemieux
·
#139
·
|
|
Re: [RISC-V][tech-vector-ext] Intrinsics for vector programming in C.
On Thu, May 7, 2020 at 07:28 PM, Kai Wang wrote:
On Fri, May 8, 2020 at 1:47 AM Guy Lemieux <glemieux@...> wrote:
well done!
can you perhaps explain how the vector operands are to be
On Thu, May 7, 2020 at 07:28 PM, Kai Wang wrote:
On Fri, May 8, 2020 at 1:47 AM Guy Lemieux <glemieux@...> wrote:
well done!
can you perhaps explain how the vector operands are to be
|
By
yihsiu.hsu@...
·
#138
·
|
|
Re: [RISC-V][tech-vector-ext] Intrinsics for vector programming in C.
How many registers will be used for a type depends on the LMUL for the type when LMUL >= 1. For example, vint32m1_t will occupy one register and vint16m2_t will occupy two registers and the register
How many registers will be used for a type depends on the LMUL for the type when LMUL >= 1. For example, vint32m1_t will occupy one register and vint16m2_t will occupy two registers and the register
|
By
Kai Wang
·
#137
·
|
|
Sparse Matrix-Vector Multiply (again) and Bit-Vector Compression
I am now investigating how to efficiently implement sparse matrix X (dense) vector multiplications (spMV) using RISCV vectors using bit-vector format of compressing the sparse matrix. The inner loop
I am now investigating how to efficiently implement sparse matrix X (dense) vector multiplications (spMV) using RISCV vectors using bit-vector format of compressing the sparse matrix. The inner loop
|
By
Nagendra Gulur
·
#136
·
|
|
Re: [RISC-V][tech-vector-ext] Intrinsics for vector programming in C.
well done!
can you perhaps explain how the vector operands are to be used/allocated?
ie, it appears you wish to use the type
system to name vector registers, but there is no limit on these so there
well done!
can you perhaps explain how the vector operands are to be used/allocated?
ie, it appears you wish to use the type
system to name vector registers, but there is no limit on these so there
|
By
Guy Lemieux
·
#135
·
|
|
[RISC-V][tech-vector-ext] Intrinsics for vector programming in C.
Hi,
We, EPI, SiPearl, and SiFive, have come out with a RFC for vector intrinsics. Although there are still some issues under discussion, we think it is time to publish the document to collect more
Hi,
We, EPI, SiPearl, and SiFive, have come out with a RFC for vector intrinsics. Although there are still some issues under discussion, we think it is time to publish the document to collect more
|
By
Kai Wang
·
#134
·
|
|
Re: Vector extension TG meeting minutes 2020/5/1
I pushed an update to spec that includes the sign/zero-extension
instructions so now can hopefuly answer this question more concretely.
The point of fractional LMUL is to reduce register usage, so
I pushed an update to spec that includes the sign/zero-extension
instructions so now can hopefuly answer this question more concretely.
The point of fractional LMUL is to reduce register usage, so
|
By
Krste Asanovic
·
#133
·
|
|
Re: Vector extension TG meeting minutes 2020/5/1
Hi Krste,
On the SLEN=VLEN discussion, I agree with you that using a cast
instruction might well make for a de facto split in the code base. But
I don't think the solution is to force a de jure
Hi Krste,
On the SLEN=VLEN discussion, I agree with you that using a cast
instruction might well make for a de facto split in the code base. But
I don't think the solution is to force a de jure
|
By
Bill Huffman
·
#132
·
|
|
Re: Vector extension TG meeting minutes 2020/5/1
BTW, even if overlapping of source/destination registers is not an issue, would the use of fractional LMUL to load data, would it still be 2 extra instructions in the loop to change from fractional
BTW, even if overlapping of source/destination registers is not an issue, would the use of fractional LMUL to load data, would it still be 2 extra instructions in the loop to change from fractional
|
By
Thang Tran
·
#131
·
|
|
Re: Vector extension TG meeting minutes 2020/5/1
Hi All,
The issue that I brought up about the extra register that is needed to load data with LMUL=8 and then signed/unsigned extension that would need an extra register. I am still puzzled/confused
Hi All,
The issue that I brought up about the extra register that is needed to load data with LMUL=8 and then signed/unsigned extension that would need an extra register. I am still puzzled/confused
|
By
Thang Tran
·
#130
·
|
|
Re: Vector extension TG meeting minutes 2020/5/1
Note: To help understand the #423 as the meeting introduction only mentioned transient SEW and LMUL overrides.
The proposal is generalized and includes extending modifiers beyond what are mapped by
Note: To help understand the #423 as the meeting introduction only mentioned transient SEW and LMUL overrides.
The proposal is generalized and includes extending modifiers beyond what are mapped by
|
By
David Horner
·
#129
·
|
|
Vector extension TG meeting minutes 2020/5/1
Date: 2020/5/1
Task Group: Vector Extension
Chair: Krste Asanovic
Co-Chair: Roger Espasa
Number of Attendees: ~20
Current issues on github: https://github.com/riscv/riscv-v-spec
Issues
Date: 2020/5/1
Task Group: Vector Extension
Chair: Krste Asanovic
Co-Chair: Roger Espasa
Number of Attendees: ~20
Current issues on github: https://github.com/riscv/riscv-v-spec
Issues
|
By
Krste Asanovic
·
#128
·
|