
Re: Smaller embedded version of the Vector extension
If there was no cost, then supporting VLEN=64 on general apps
processor profile would be a good thing to do. But not allowing
standard software to assume VLEN>=128 imposes a nontrivial impact
Krste Asanovic
Re: Smaller embedded version of the Vector extension
that's pretty handy, actually. I'm not sure it should be a property of
the V spec itself, rather it could be a requirement that software
which is translated in this method could require an
Guy Lemieux
Re: Smaller embedded version of the Vector extension
Zalman Stern
Re: Smaller embedded version of the Vector extension
If the minimum VLEN is at least 128bits, one can translate NEON/SSE intrinsics directly without having to have every vector instruction dominated by a loop over the vector length.
Zalman Stern
Re: Smaller embedded version of the Vector extension
Krste, to be clear,The issue
The RVV spec should be inclusive, rather than exclusive. Setting VLEN
Sorry I wasn't clear. Of course, an implementation with VLEN=64 would
likely be slower than one
Guy Lemieux
Re: Smaller embedded version of the Vector extension
Software should still work with VLEN>=64 if written correctly, as it should be VLEN agnostic.
Maybe it should be a recommendation that VLEN>=128, with a minimum of 64 for app processors?
Lower
Tony Cole
Re: Smaller embedded version of the Vector extension
This is the requirement for app processors, which are not generally small cores.
Most competing SIMD extensions are at least 128b per vector register.
Lower performance on codes that work well on
Krste Asanovic
Re: Smaller embedded version of the Vector extension
What is the advantage to RVV requiring VLEN >= 128?
I think this should be changed to VLEN >= 64 because:
1) VLEN = 64 is more likely for small implementations; creating a
mandatory expectation to
Guy Lemieux
Re: Smaller embedded version of the Vector extension
see github issue #550
Krste
Krste Asanovic
Re: Smaller embedded version of the Vector extension
This is a good question.
So if the RVM22 profile requires VLEN=32, ELEN=64, LMUL=8 then the vector registers will have the same amount of state as ARM MVE.
Tariq
Tariq Kurd
Re: 答复: [RISCV] [techvectorext] Smaller embedded version of the Vector extension
Hi, Krste:
The RISCV V TG have the plan to support a lowcost vector extension in RVMxx profile?
Best Regards
Shaofei
2021.6.3
邮件原件
发件人: krste@...
Shaofei (B)
Re: Smaller embedded version of the Vector extension
Hi Tony,
All of the vector permutation instructions can be simulated using the memory system. For example, vslide can be simulated by storing the vector register and loading it at an offset; vrgather
Nick Knight
Re: Smaller embedded version of the Vector extension
Hi Bruce,
Do you mean vrgather instead of vslide?
I use vrgather_vx_* and vslidedown to perform a vector element rotate (and other things), see:
Tony Cole
Re: Smaller embedded version of the Vector extension
 could an extension just change state like the number of vector registers?

Don't understand tbis question  please elaborate.
Krste
Krste Asanovic
Re: Smaller embedded version of the Vector extension
Thanks, I must have missed this bit:
"4.5. Mapping with LMUL > 1 and ELEN > VLEN
If vector registers are grouped to support larger SEW, with ELEN > VLEN, the vector registers in the group are
Tony Cole
Re: Smaller embedded version of the Vector extension
Section 4.5,
Krste
 On Wed, Jun 2, 2021 at 8:38 AM Andrew Waterman <andrew@...> wrote:
 It’s actually not fundamental to the ISA design that VLEN >= ELEN. An
 implementation with
Krste Asanovic
Re: Smaller embedded version of the Vector extension
could an extension just change state like the number of vector registers?
mark
Re: Smaller embedded version of the Vector extension
We do allow supported SEW to vary with LMUL, so implementation can
support singlewidth operations on SEW=64. See section 4.5,
Krste
 So, (on a 32x 32bit vector register machine) the widening and
Krste Asanovic
Re: Smaller embedded version of the Vector extension
The VLEN>=128 constraint is only for the application processor "V"
extension for the app profile  not for embedded vectors which can
have VLEN=32.
From spec Introduction:
'
The term base vector
Krste Asanovic
Re: Smaller embedded version of the Vector extension
It seems that restriction of minimum LMUL=2 would be half number of vector registers and LMUL=4 would be 8 vector registers.
Thang
Thang Tran
