From: Luke Kenneth Casson Leighton Date: Tue, 5 Dec 2023 14:36:42 +0000 (+0000) Subject: prepare assembler for warm-words, pospopcount X-Git-Url: https://git.libre-soc.org/?a=commitdiff_plain;h=d96e724f9878250007b5c68b70879e420841f410;p=libreriscv.git prepare assembler for warm-words, pospopcount --- diff --git a/openpower/sv/cookbook/pospopcnt.mdwn b/openpower/sv/cookbook/pospopcnt.mdwn index 2c9b6533b..eb8c722bb 100644 --- a/openpower/sv/cookbook/pospopcnt.mdwn +++ b/openpower/sv/cookbook/pospopcnt.mdwn @@ -53,5 +53,43 @@ bit-position, of an array of input values. pospopcnt +# Visual representation of the pospopcount algorithm + +# Walkthrough of the assembler + +``` +mtspr 9, 3" # move r3 to CTR +``` + +``` +# VL = MIN(CTR,MAXVL=8), Rc=1 (CR0 set if CTR ends) +setvl 3,0,8,0,1,1" # set MVL=8, VL=MIN(MVL,CTR) +``` + +``` +# load VL bytes (update r4 addr) but compressed (dw=8) +addi 6, 0, 0 # initialise all 64-bits of r6 to zero +sv.lbzu/pi/dw=8 *6, 1(4) # should be /lf here as well +``` + +``` +# gather performs the transpose (which gets us to positional..) +gbbd 8,6 +``` + +``` +# now those bits have been turned around, popcount and sum them +setvl 0,0,8,0,1,1 # set MVL=VL=8 +sv.popcntd/sw=8 *24,*8 # do the (now transposed) popcount +``` + +``` +sv.add *16,*16,*24 # and accumulate in results +``` + +``` +# branch back if CTR still non-zero. works even though VL=8 +sv.bc/all 16, *0, -0x28 # reduce CTR by VL and stop if -ve +``` [[!tag svp64_cookbook ]]