add new file for common section from ls006 and int_fp_mv

author Jacob Lifshay <programmerjake@gmail.com>

Thu, 6 Apr 2023 03:22:09 +0000 (20:22 -0700)

committer Jacob Lifshay <programmerjake@gmail.com>

Thu, 6 Apr 2023 03:22:09 +0000 (20:22 -0700)
author Jacob Lifshay <programmerjake@gmail.com>
Thu, 6 Apr 2023 03:22:09 +0000 (20:22 -0700)
committer Jacob Lifshay <programmerjake@gmail.com>
Thu, 6 Apr 2023 03:22:09 +0000 (20:22 -0700)
diff --git a/openpower/sv/int_fp_mv/moves_and_conversions.mdwn b/openpower/sv/int_fp_mv/moves_and_conversions.mdwn

new file mode 100644 (file)

index 0000000..7bf99f9
--- /dev/null
+++ b/openpower/sv/int_fp_mv/moves_and_conversions.mdwn
@@ -0,0 +1,772 @@
+<!-- main body for int_fp_mv.mdwn (without fmvis/fishmv) and ls006.mdwn -->
+# Immediate Tables
+
+Tables that are used by
+`fmvtg[s][.]`/`fmvfg[s][.]`/`fcvt[s]tg[o][.]`/`fcvtfg[s][.]`:
+
+## `IT` -- Integer Type
+
+| `IT` | Integer Type    | Assembly Alias Mnemonic |
+|------|-----------------|-------------------------|
+| 0    | Signed 32-bit   | `<op>w`                 |
+| 1    | Unsigned 32-bit | `<op>uw`                |
+| 2    | Signed 64-bit   | `<op>d`                 |
+| 3    | Unsigned 64-bit | `<op>ud`                |
+
+## `CVM` -- Float to Integer Conversion Mode
+
+| `CVM` | `rounding_mode` | Semantics                        |
+|-------|-----------------|----------------------------------|
+| 000   | from `FPSCR`    | [OpenPower semantics]            |
+| 001   | Truncate        | [OpenPower semantics]            |
+| 010   | from `FPSCR`    | [Java/Saturating semantics]      |
+| 011   | Truncate        | [Java/Saturating semantics]      |
+| 100   | from `FPSCR`    | [JavaScript semantics]           |
+| 101   | Truncate        | [JavaScript semantics]           |
+| rest  | --              | illegal instruction trap for now |
+
+[OpenPower semantics]: #fp-to-int-openpower-conversion-semantics
+[Java/Saturating semantics]: #fp-to-int-java-saturating-conversion-semantics
+[JavaScript semantics]: #fp-to-int-javascript-conversion-semantics
+
+----------
+
+# Moves
+
+These instructions perform a straight unaltered bit-level copy from one Register
+File to another.
+
+## Floating Move To GPR
+
+```
+    fmvtg RT, FRB
+    fmvtg. RT, FRB
+```
+
+| 0-5 | 6-10 | 11-15 | 16-20 | 21-30 | 31 | Form   |
+|-----|------|-------|-------|-------|----|--------|
+| PO  | RT   | 0     | FRB   | XO    | Rc | X-Form |
+
+```
+    RT <- (FRB)
+```
+
+Move a 64-bit float from a FPR to a GPR, just copying bits of the IEEE 754
+representation directly. This is equivalent to `stfd` followed by `ld`.
+As `fmvtg` is just copying bits, `FPSCR` is not affected in any way.
+
+Rc=1 tests RT and sets CR0, exactly like all other Scalar Fixed-Point
+operations.
+
+Special Registers altered:
+
+```
+    CR0     (if Rc=1)
+```
+
+----------
+
+## Floating Move To GPR Single
+
+```
+    fmvtgs RT, FRB
+    fmvtgs. RT, FRB
+```
+
+| 0-5 | 6-10 | 11-15 | 16-20 | 21-30 | 31 | Form   |
+|-----|------|-------|-------|-------|----|--------|
+| PO  | RT   | 0     | FRB   | XO    | Rc | X-Form |
+
+```
+    RT <- [0] * 32 || SINGLE((FRB))  # SINGLE since that's what stfs uses
+```
+
+Move a 32-bit float from a FPR to a GPR, just copying bits of the IEEE 754
+representation directly. This is equivalent to `stfs` followed by `lwz`.
+As `fmvtgs` is just copying bits, `FPSCR` is not affected in any way.
+
+Rc=1 tests RT and sets CR0, exactly like all other Scalar Fixed-Point
+operations.
+
+Special Registers altered:
+
+```
+    CR0     (if Rc=1)
+```
+
+----------
+
+\newpage{}
+
+## Double-Precision Floating Move From GPR
+
+```
+    fmvfg FRT, RB
+    fmvfg. FRT, RB
+```
+
+| 0-5 | 6-10 | 11-15 | 16-20 | 21-30 | 31 | Form   |
+|-----|------|-------|-------|-------|----|--------|
+| PO  | FRT  | 0     | RB    | XO    | Rc | X-Form |
+
+```
+    FRT <- (RB)
+```
+
+move a 64-bit float from a GPR to a FPR, just copying bits of the IEEE 754
+representation directly. This is equivalent to `std` followed by `lfd`.
+As `fmvfg` is just copying bits, `FPSCR` is not affected in any way.
+
+Rc=1 tests FRT and sets CR1, exactly like all other Scalar Floating-Point
+operations.
+
+Special Registers altered:
+
+```
+    CR1     (if Rc=1)
+```
+
+----------
+
+## Floating Move From GPR Single
+
+```
+    fmvfgs FRT, RB
+    fmvfgs. FRT, RB
+```
+
+| 0-5 | 6-10 | 11-15 | 16-20 | 21-30 | 31 | Form   |
+|-----|------|-------|-------|-------|----|--------|
+| PO  | FRT  | 0     | RB    | XO    | Rc | X-Form |
+
+```
+    FRT <- DOUBLE((RB)[32:63])  # DOUBLE since that's what lfs uses
+```
+
+move a 32-bit float from a GPR to a FPR, just copying bits of the IEEE 754
+representation directly. This is equivalent to `stw` followed by `lfs`.
+As `fmvfgs` is just copying bits, `FPSCR` is not affected in any way.
+
+Rc=1 tests FRT and sets CR1, exactly like all other Scalar Floating-Point
+operations.
+
+Special Registers altered:
+
+```
+    CR1     (if Rc=1)
+```
+
+----------
+
+\newpage{}
+
+# Conversions
+
+Unlike the move instructions
+these instructions perform conversions between Integer and
+Floating Point. Truncation can therefore occur, as well
+as exceptions.
+
+## Double-Precision Floating Convert From Integer In GPR
+
+```
+    fcvtfg FRT, RB, IT
+    fcvtfg. FRT, RB, IT
+```
+
+| 0-5 | 6-10 | 11-12 | 13-15 | 16-20 | 21-30 | 31 | Form   |
+|-----|------|-------|-------|-------|-------|----|--------|
+| PO  | FRT  | IT    | 0     | RB    | XO    | Rc | X-Form |
+
+```
+    if IT[0] = 0 then  # 32-bit int -> 64-bit float
+        # rounding never necessary, so don't touch FPSCR
+        # based off xvcvsxwdp
+        if IT = 0 then  # Signed 32-bit
+            src <- bfp_CONVERT_FROM_SI32((RB)[32:63])
+        else  # IT = 1 -- Unsigned 32-bit
+            src <- bfp_CONVERT_FROM_UI32((RB)[32:63])
+        FRT <- bfp64_CONVERT_FROM_BFP(src)
+    else
+        # rounding may be necessary. based off xscvuxdsp
+        reset_xflags()
+        switch(IT)
+            case(0):  # Signed 32-bit
+                src <- bfp_CONVERT_FROM_SI32((RB)[32:63])
+            case(1):  # Unsigned 32-bit
+                src <- bfp_CONVERT_FROM_UI32((RB)[32:63])
+            case(2):  # Signed 64-bit
+                src <- bfp_CONVERT_FROM_SI64((RB))
+            default:  # Unsigned 64-bit
+                src <- bfp_CONVERT_FROM_UI64((RB))
+        rnd <- bfp_ROUND_TO_BFP64(FPSCR.RN, src)
+        result <- bfp64_CONVERT_FROM_BFP(rnd)
+        cls <- fprf_CLASS_BFP64(result)
+
+        if xx_flag = 1 then SetFX(FPSCR.XX)
+
+        FRT <- result
+        FPSCR.FPRF <- cls
+        FPSCR.FR <- inc_flag
+        FPSCR.FI <- xx_flag
+```
+<!-- note the PowerISA spec. explicitly has empty lines before/after SetFX,
+don't remove them -->
+
+Convert from a unsigned/signed 32/64-bit integer in RB to a 64-bit
+float in FRT.
+
+If converting from a unsigned/signed 32-bit integer to a 64-bit float,
+rounding is never necessary, so `FPSCR` is unmodified and exceptions are
+never raised. Otherwise, `FPSCR` is modified and exceptions are raised
+as usual.
+
+Rc=1 tests FRT and sets CR1, exactly like all other Scalar Floating-Point
+operations.
+
+Special Registers altered:
+
+```
+    CR1               (if Rc=1)
+    FPRF FR FI FX XX  (if IT[0]=1)
+```
+
+### Assembly Aliases
+
+| Assembly Alias       | Full Instruction     |
+|----------------------|----------------------|
+| `fcvtfgw FRT, RB`    | `fcvtfg FRT, RB, 0`  |
+| `fcvtfgw. FRT, RB`   | `fcvtfg. FRT, RB, 0` |
+| `fcvtfguw FRT, RB`   | `fcvtfg FRT, RB, 1`  |
+| `fcvtfguw. FRT, RB`  | `fcvtfg. FRT, RB, 1` |
+| `fcvtfgd FRT, RB`    | `fcvtfg FRT, RB, 2`  |
+| `fcvtfgd. FRT, RB`   | `fcvtfg. FRT, RB, 2` |
+| `fcvtfgud FRT, RB`   | `fcvtfg FRT, RB, 3`  |
+| `fcvtfgud. FRT, RB`  | `fcvtfg. FRT, RB, 3` |
+
+----------
+
+\newpage{}
+
+## Floating Convert From Integer In GPR Single
+
+```
+    fcvtfgs FRT, RB, IT
+    fcvtfgs. FRT, RB, IT
+```
+
+| 0-5 | 6-10 | 11-12 | 13-15 | 16-20 | 21-30 | 31 | Form   |
+|-----|------|-------|-------|-------|-------|----|--------|
+| PO  | FRT  | IT    | 0     | RB    | XO    | Rc | X-Form |
+
+```
+    # rounding may be necessary. based off xscvuxdsp
+    reset_xflags()
+    switch(IT)
+        case(0):  # Signed 32-bit
+            src <- bfp_CONVERT_FROM_SI32((RB)[32:63])
+        case(1):  # Unsigned 32-bit
+            src <- bfp_CONVERT_FROM_UI32((RB)[32:63])
+        case(2):  # Signed 64-bit
+            src <- bfp_CONVERT_FROM_SI64((RB))
+        default:  # Unsigned 64-bit
+            src <- bfp_CONVERT_FROM_UI64((RB))
+    rnd <- bfp_ROUND_TO_BFP32(FPSCR.RN, src)
+    result32 <- bfp32_CONVERT_FROM_BFP(rnd)
+    cls <- fprf_CLASS_BFP32(result32)
+    result <- DOUBLE(result32)
+
+    if xx_flag = 1 then SetFX(FPSCR.XX)
+
+    FRT <- result
+    FPSCR.FPRF <- cls
+    FPSCR.FR <- inc_flag
+    FPSCR.FI <- xx_flag
+```
+<!-- note the PowerISA spec. explicitly has empty lines before/after SetFX,
+don't remove them -->
+
+Convert from a unsigned/signed 32/64-bit integer in RB to a 32-bit
+float in FRT, following the usual 32-bit float in 64-bit float format.
+`FPSCR` is modified and exceptions are raised as usual.
+
+Rc=1 tests FRT and sets CR1, exactly like all other Scalar Floating-Point
+operations.
+
+Special Registers altered:
+
+```
+    CR1     (if Rc=1)
+    FPRF FR FI FX XX
+```
+
+### Assembly Aliases
+
+| Assembly Alias       | Full Instruction     |
+|----------------------|----------------------|
+| `fcvtfgws FRT, RB`   | `fcvtfg FRT, RB, 0`  |
+| `fcvtfgws. FRT, RB`  | `fcvtfg. FRT, RB, 0` |
+| `fcvtfguws FRT, RB`  | `fcvtfg FRT, RB, 1`  |
+| `fcvtfguws. FRT, RB` | `fcvtfg. FRT, RB, 1` |
+| `fcvtfgds FRT, RB`   | `fcvtfg FRT, RB, 2`  |
+| `fcvtfgds. FRT, RB`  | `fcvtfg. FRT, RB, 2` |
+| `fcvtfguds FRT, RB`  | `fcvtfg FRT, RB, 3`  |
+| `fcvtfguds. FRT, RB` | `fcvtfg. FRT, RB, 3` |
+
+----------
+
+\newpage{}
+
+## Floating-point to Integer Conversion Overview
+
+<div id="fpr-to-gpr-conversion-mode"></div>
+
+IEEE 754 doesn't specify what results are obtained when converting a NaN
+or out-of-range floating-point value to integer, so different programming
+languages and ISAs have made different choices.  Below is an overview
+of the different variants, listing the languages and hardware that
+implements each variant.
+
+For convenience, we will give those different conversion semantics names
+based on which common ISA or programming language uses them, since there
+may not be an established name for them:
+
+**Standard OpenPower conversion**
+
+This conversion performs "saturation with NaN converted to minimum
+valid integer". This is also exactly the same as the x86 ISA conversion
+semantics.  OpenPOWER however has instructions for both:
+
+* rounding mode read from FPSCR
+* rounding mode always set to truncate
+
+**Java/Saturating conversion**
+
+For the sake of simplicity, the FP -> Integer conversion semantics
+generalized from those used by Java's semantics (and Rust's `as`
+operator) will be referred to as [Java/Saturating conversion
+semantics](#fp-to-int-java-saturating-conversion-semantics).
+
+Those same semantics are used in some way by all of the following
+languages (not necessarily for the default conversion method):
+
+* Java's
+  [FP -> Integer conversion](https://docs.oracle.com/javase/specs/jls/se16/html/jls-5.html#jls-5.1.3)
+  (only for long/int results)
+* Rust's FP -> Integer conversion using the
+  [`as` operator](https://doc.rust-lang.org/reference/expressions/operator-expr.html#semantics)
+* LLVM's
+  [`llvm.fptosi.sat`](https://llvm.org/docs/LangRef.html#llvm-fptosi-sat-intrinsic) and
+  [`llvm.fptoui.sat`](https://llvm.org/docs/LangRef.html#llvm-fptoui-sat-intrinsic) intrinsics
+* SPIR-V's OpenCL dialect's
+  [`OpConvertFToU`](https://www.khronos.org/registry/spir-v/specs/unified1/SPIRV.html#OpConvertFToU) and
+  [`OpConvertFToS`](https://www.khronos.org/registry/spir-v/specs/unified1/SPIRV.html#OpConvertFToS)
+  instructions when decorated with
+  [the `SaturatedConversion` decorator](https://www.khronos.org/registry/spir-v/specs/unified1/SPIRV.html#_a_id_decoration_a_decoration).
+* WebAssembly has also introduced
+ [trunc_sat_u](ttps://webassembly.github.io/spec/core/exec/numerics.html#op-trunc-sat-u) and
+ [trunc_sat_s](https://webassembly.github.io/spec/core/exec/numerics.html#op-trunc-sat-s)
+
+**JavaScript conversion**
+
+For the sake of simplicity, the FP -> Integer conversion
+semantics generalized from those used by JavaScripts's `ToInt32`
+abstract operation will be referred to as [JavaScript conversion
+semantics](#fp-to-int-javascript-conversion-semantics).
+
+This instruction is present in ARM assembler as FJCVTZS
+<https://developer.arm.com/documentation/dui0801/g/hko1477562192868>
+
+**Rc=1 and OE=1**
+
+All of these instructions have an Rc=1 mode which sets CR0
+in the normal way for any instructions producing a GPR result.
+Additionally, when OE=1, if the numerical value of the FP number
+is not 100% accurately preserved (due to truncation or saturation
+and including when the FP number was NaN) then this is considered
+to be an integer Overflow condition, and CR0.SO, XER.SO and XER.OV
+are all set as normal for any GPR instructions that overflow.
+
+\newpage{}
+
+### FP to Integer Conversion Simplified Pseudo-code
+
+Key for pseudo-code:
+
+| term                      | result type | definition                                                                                         |
+|---------------------------|-------------|----------------------------------------------------------------------------------------------------|
+| `fp`                      | --          | `f32` or `f64` (or other types from SimpleV)                                                       |
+| `int`                     | --          | `u32`/`u64`/`i32`/`i64` (or other types from SimpleV)                                              |
+| `uint`                    | --          | the unsigned integer of the same bit-width as `int`                                                |
+| `int::BITS`               | `int`       | the bit-width of `int`                                                                             |
+| `uint::MIN_VALUE`         | `uint`      | the minimum value `uint` can store: `0`                   |
+| `uint::MAX_VALUE`          | `uint`       | the maximum value `uint` can store: `2^int::BITS - 1`  |
+| `int::MIN_VALUE`          | `int`       | the minimum value `int` can store : `-2^(int::BITS-1)`              |
+| `int::MAX_VALUE`          | `int`       | the maximum value `int` can store :  `2^(int::BITS-1) - 1`  |
+| `int::VALUE_COUNT`        | Integer     | the number of different values `int` can store (`2^int::BITS`). too big to fit in `int`.           |
+| `rint(fp, rounding_mode)` | `fp`        | rounds the floating-point value `fp` to an integer according to rounding mode `rounding_mode`      |
+
+<div id="fp-to-int-openpower-conversion-semantics"></div>
+OpenPower conversion semantics (section A.2 page 1009 (page 1035) of
+Power ISA v3.1B):
+
+```
+    def fp_to_int_open_power<fp, int>(v: fp) -> int:
+        if v is NaN:
+            return int::MIN_VALUE
+        if v >= int::MAX_VALUE:
+            return int::MAX_VALUE
+        if v <= int::MIN_VALUE:
+            return int::MIN_VALUE
+        return (int)rint(v, rounding_mode)
+```
+
+<div id="fp-to-int-java-saturating-conversion-semantics"></div>
+[Java/Saturating conversion semantics](https://docs.oracle.com/javase/specs/jls/se16/html/jls-5.html#jls-5.1.3)
+(only for long/int results)
+(with adjustment to add non-truncate rounding modes):
+
+```
+    def fp_to_int_java_saturating<fp, int>(v: fp) -> int:
+        if v is NaN:
+            return 0
+        if v >= int::MAX_VALUE:
+            return int::MAX_VALUE
+        if v <= int::MIN_VALUE:
+            return int::MIN_VALUE
+        return (int)rint(v, rounding_mode)
+```
+
+<div id="fp-to-int-javascript-conversion-semantics"></div>
+Section 7.1 of the ECMAScript / JavaScript
+[conversion semantics](https://262.ecma-international.org/11.0/#sec-toint32)
+(with adjustment to add non-truncate rounding modes):
+
+```
+    def fp_to_int_java_script<fp, int>(v: fp) -> int:
+        if v is NaN or infinite:
+            return 0
+        v = rint(v, rounding_mode)  # assume no loss of precision in result
+        v = v mod int::VALUE_COUNT  # 2^32 for i32, 2^64 for i64, result is non-negative
+        bits = (uint)v
+        return (int)bits
+```
+
+----------
+
+\newpage{}
+
+## Double-Precision Floating Convert To Integer In GPR
+
+```
+    fcvttg RT, FRB, CVM, IT
+    fcvttg. RT, FRB, CVM, IT
+    fcvttgo RT, FRB, CVM, IT
+    fcvttgo. RT, FRB, CVM, IT
+```
+
+| 0-5 | 6-10 | 11-12 | 13-15 | 16-20 | 21 | 22-30 | 31 | Form    |
+|-----|------|-------|-------|-------|----|-------|----|---------|
+| PO  | RT   | IT    | CVM   | FRB   | OE | XO    | Rc | XO-Form |
+
+```
+    # based on xscvdpuxws
+    reset_xflags()
+    src <- bfp_CONVERT_FROM_BFP64((FRB))
+
+    switch(IT)
+        case(0):  # Signed 32-bit
+            range_min <- bfp_CONVERT_FROM_SI32(0x8000_0000)
+            range_max <- bfp_CONVERT_FROM_SI32(0x7FFF_FFFF)
+            js_mask <- 0xFFFF_FFFF
+        case(1):  # Unsigned 32-bit
+            range_min <- bfp_CONVERT_FROM_UI32(0)
+            range_max <- bfp_CONVERT_FROM_UI32(0xFFFF_FFFF)
+            js_mask <- 0xFFFF_FFFF
+        case(2):  # Signed 64-bit
+            range_min <- bfp_CONVERT_FROM_SI64(-0x8000_0000_0000_0000)
+            range_max <- bfp_CONVERT_FROM_SI64(0x7FFF_FFFF_FFFF_FFFF)
+            js_mask <- 0xFFFF_FFFF_FFFF_FFFF
+        default:  # Unsigned 64-bit
+            range_min <- bfp_CONVERT_FROM_UI64(0)
+            range_max <- bfp_CONVERT_FROM_UI64(0xFFFF_FFFF_FFFF_FFFF)
+            js_mask <- 0xFFFF_FFFF_FFFF_FFFF
+
+    if CVM[2] = 1 or FPSCR.RN = 0b01 then
+        rnd <- bfp_ROUND_TO_INTEGER_TRUNC(src)
+    else if FPSCR.RN = 0b00 then
+        rnd <- bfp_ROUND_TO_INTEGER_NEAR_EVEN(src)
+    else if FPSCR.RN = 0b10 then
+        rnd <- bfp_ROUND_TO_INTEGER_CEIL(src)
+    else if FPSCR.RN = 0b11 then
+        rnd <- bfp_ROUND_TO_INTEGER_FLOOR(src)
+
+    switch(CVM)
+        case(0, 1):  # OpenPower semantics
+            if IsNaN(rnd) then
+                result <- si64_CONVERT_FROM_BFP(range_min)
+            else if bfp_COMPARE_GT(rnd, range_max) then
+                result <- ui64_CONVERT_FROM_BFP(range_max)
+            else if bfp_COMPARE_LT(rnd, range_min) then
+                result <- si64_CONVERT_FROM_BFP(range_min)
+            else if IT[1] = 1 then  # Unsigned 32/64-bit
+                result <- ui64_CONVERT_FROM_BFP(range_max)
+            else  # Signed 32/64-bit
+                result <- si64_CONVERT_FROM_BFP(range_max)
+        case(2, 3):  # Java/Saturating semantics
+            if IsNaN(rnd) then
+                result <- [0] * 64
+            else if bfp_COMPARE_GT(rnd, range_max) then
+                result <- ui64_CONVERT_FROM_BFP(range_max)
+            else if bfp_COMPARE_LT(rnd, range_min) then
+                result <- si64_CONVERT_FROM_BFP(range_min)
+            else if IT[1] = 1 then  # Unsigned 32/64-bit
+                result <- ui64_CONVERT_FROM_BFP(range_max)
+            else  # Signed 32/64-bit
+                result <- si64_CONVERT_FROM_BFP(range_max)
+        default:  # JavaScript semantics
+            # CVM = 6, 7 are illegal instructions
+            # this works because the largest type we try to convert from has
+            # 53 significand bits, and the largest type we try to convert to
+            # has 64 bits, and the sum of those is strictly less than the 128
+            # bits of the intermediate result.
+            limit <- bfp_CONVERT_FROM_UI128([1] * 128)
+            if IsInf(rnd) or IsNaN(rnd) then
+                result <- [0] * 64
+            else if bfp_COMPARE_GT(bfp_ABSOLUTE(rnd), limit) then
+                result <- [0] * 64
+            else
+                result128 <- si128_CONVERT_FROM_BFP(rnd)
+                result <- result128[64:127] & js_mask
+
+    switch(IT)
+        case(0):  # Signed 32-bit
+            result <- EXTS64(result[32:63])
+            result_bfp <- bfp_CONVERT_FROM_SI32(result[32:63])
+        case(1):  # Unsigned 32-bit
+            result <- EXTZ64(result[32:63])
+            result_bfp <- bfp_CONVERT_FROM_UI32(result[32:63])
+        case(2):  # Signed 64-bit
+            result_bfp <- bfp_CONVERT_FROM_SI64(result)
+        default:  # Unsigned 64-bit
+            result_bfp <- bfp_CONVERT_FROM_UI64(result)
+
+    if vxsnan_flag = 1 then SetFX(FPSCR.VXSNAN)
+    if vxcvi_flag = 1 then SetFX(FPSCR.VXCVI)
+    if xx_flag = 1 then SetFX(FPSCR.XX)
+
+    vx_flag <- vxsnan_flag | vxcvi_flag
+    vex_flag <- FPSCR.VE & vx_flag
+
+    if vex_flag = 0 then
+        RT <- result
+        FPSCR.FPRF <- undefined
+        FPSCR.FR <- inc_flag
+        FPSCR.FI <- xx_flag
+        if IsNaN(src) or not bfp_COMPARE_EQ(src, result_bfp) then
+            overflow <- 1  # signals SO only when OE = 1
+    else
+        FPSCR.FR <- 0
+        FPSCR.FI <- 0
+```
+
+Convert from 64-bit float in FRB to a unsigned/signed 32/64-bit integer
+in RT, with the conversion overflow/rounding semantics following the
+chosen `CVM` value. `FPSCR` is modified and exceptions are raised as usual.
+
+These instructions have an Rc=1 mode which sets CR0 in the normal
+way for any instructions producing a GPR result.  Additionally, when OE=1,
+if the numerical value of the FP number is not 100% accurately preserved
+(due to truncation or saturation and including when the FP number was
+NaN) then this is considered to be an Integer Overflow condition, and
+CR0.SO, XER.SO and XER.OV are all set as normal for any GPR instructions
+that overflow.
+
+Special Registers altered:
+
+```
+    CR0              (if Rc=1)
+    XER SO, OV, OV32 (if OE=1)
+    FPRF=0bUUUUU FR FI FX XX VXSNAN VXCV
+```
+
+### Assembly Aliases
+
+| Assembly Alias            | Full Instruction           |
+|---------------------------|----------------------------|
+| `fcvttgw RT, FRB, CVM`    | `fcvttg RT, FRB, CVM, 0`   |
+| `fcvttgw. RT, FRB, CVM`   | `fcvttg. RT, FRB, CVM, 0`  |
+| `fcvttgwo RT, FRB, CVM`   | `fcvttgo RT, FRB, CVM, 0`  |
+| `fcvttgwo. RT, FRB, CVM`  | `fcvttgo. RT, FRB, CVM, 0` |
+| `fcvttguw RT, FRB, CVM`   | `fcvttg RT, FRB, CVM, 1`   |
+| `fcvttguw. RT, FRB, CVM`  | `fcvttg. RT, FRB, CVM, 1`  |
+| `fcvttguwo RT, FRB, CVM`  | `fcvttgo RT, FRB, CVM, 1`  |
+| `fcvttguwo. RT, FRB, CVM` | `fcvttgo. RT, FRB, CVM, 1` |
+| `fcvttgd RT, FRB, CVM`    | `fcvttg RT, FRB, CVM, 2`   |
+| `fcvttgd. RT, FRB, CVM`   | `fcvttg. RT, FRB, CVM, 2`  |
+| `fcvttgdo RT, FRB, CVM`   | `fcvttgo RT, FRB, CVM, 2`  |
+| `fcvttgdo. RT, FRB, CVM`  | `fcvttgo. RT, FRB, CVM, 2` |
+| `fcvttgud RT, FRB, CVM`   | `fcvttg RT, FRB, CVM, 3`   |
+| `fcvttgud. RT, FRB, CVM`  | `fcvttg. RT, FRB, CVM, 3`  |
+| `fcvttgudo RT, FRB, CVM`  | `fcvttgo RT, FRB, CVM, 3`  |
+| `fcvttgudo. RT, FRB, CVM` | `fcvttgo. RT, FRB, CVM, 3` |
+
+----------
+
+\newpage{}
+
+## Floating Convert Single To Integer In GPR
+
+```
+    fcvtstg RT, FRB, CVM, IT
+    fcvtstg. RT, FRB, CVM, IT
+    fcvtstgo RT, FRB, CVM, IT
+    fcvtstgo. RT, FRB, CVM, IT
+```
+
+| 0-5 | 6-10 | 11-12 | 13-15 | 16-20 | 21 | 22-30 | 31 | Form    |
+|-----|------|-------|-------|-------|----|-------|----|---------|
+| PO  | RT   | IT    | CVM   | FRB   | OE | XO    | Rc | XO-Form |
+
+```
+    # based on xscvdpuxws
+    reset_xflags()
+    src <- bfp_CONVERT_FROM_BFP32(SINGLE((FRB)))
+
+    switch(IT)
+        case(0):  # Signed 32-bit
+            range_min <- bfp_CONVERT_FROM_SI32(0x8000_0000)
+            range_max <- bfp_CONVERT_FROM_SI32(0x7FFF_FFFF)
+            js_mask <- 0xFFFF_FFFF
+        case(1):  # Unsigned 32-bit
+            range_min <- bfp_CONVERT_FROM_UI32(0)
+            range_max <- bfp_CONVERT_FROM_UI32(0xFFFF_FFFF)
+            js_mask <- 0xFFFF_FFFF
+        case(2):  # Signed 64-bit
+            range_min <- bfp_CONVERT_FROM_SI64(-0x8000_0000_0000_0000)
+            range_max <- bfp_CONVERT_FROM_SI64(0x7FFF_FFFF_FFFF_FFFF)
+            js_mask <- 0xFFFF_FFFF_FFFF_FFFF
+        default:  # Unsigned 64-bit
+            range_min <- bfp_CONVERT_FROM_UI64(0)
+            range_max <- bfp_CONVERT_FROM_UI64(0xFFFF_FFFF_FFFF_FFFF)
+            js_mask <- 0xFFFF_FFFF_FFFF_FFFF
+
+    if CVM[2] = 1 or FPSCR.RN = 0b01 then
+        rnd <- bfp_ROUND_TO_INTEGER_TRUNC(src)
+    else if FPSCR.RN = 0b00 then
+        rnd <- bfp_ROUND_TO_INTEGER_NEAR_EVEN(src)
+    else if FPSCR.RN = 0b10 then
+        rnd <- bfp_ROUND_TO_INTEGER_CEIL(src)
+    else if FPSCR.RN = 0b11 then
+        rnd <- bfp_ROUND_TO_INTEGER_FLOOR(src)
+
+    switch(CVM)
+        case(0, 1):  # OpenPower semantics
+            if IsNaN(rnd) then
+                result <- si64_CONVERT_FROM_BFP(range_min)
+            else if bfp_COMPARE_GT(rnd, range_max) then
+                result <- ui64_CONVERT_FROM_BFP(range_max)
+            else if bfp_COMPARE_LT(rnd, range_min) then
+                result <- si64_CONVERT_FROM_BFP(range_min)
+            else if IT[1] = 1 then  # Unsigned 32/64-bit
+                result <- ui64_CONVERT_FROM_BFP(range_max)
+            else  # Signed 32/64-bit
+                result <- si64_CONVERT_FROM_BFP(range_max)
+        case(2, 3):  # Java/Saturating semantics
+            if IsNaN(rnd) then
+                result <- [0] * 64
+            else if bfp_COMPARE_GT(rnd, range_max) then
+                result <- ui64_CONVERT_FROM_BFP(range_max)
+            else if bfp_COMPARE_LT(rnd, range_min) then
+                result <- si64_CONVERT_FROM_BFP(range_min)
+            else if IT[1] = 1 then  # Unsigned 32/64-bit
+                result <- ui64_CONVERT_FROM_BFP(range_max)
+            else  # Signed 32/64-bit
+                result <- si64_CONVERT_FROM_BFP(range_max)
+        default:  # JavaScript semantics
+            # CVM = 6, 7 are illegal instructions
+            # this works because the largest type we try to convert from has
+            # 53 significand bits, and the largest type we try to convert to
+            # has 64 bits, and the sum of those is strictly less than the 128
+            # bits of the intermediate result.
+            limit <- bfp_CONVERT_FROM_UI128([1] * 128)
+            if IsInf(rnd) or IsNaN(rnd) then
+                result <- [0] * 64
+            else if bfp_COMPARE_GT(bfp_ABSOLUTE(rnd), limit) then
+                result <- [0] * 64
+            else
+                result128 <- si128_CONVERT_FROM_BFP(rnd)
+                result <- result128[64:127] & js_mask
+
+    switch(IT)
+        case(0):  # Signed 32-bit
+            result <- EXTS64(result[32:63])
+            result_bfp <- bfp_CONVERT_FROM_SI32(result[32:63])
+        case(1):  # Unsigned 32-bit
+            result <- EXTZ64(result[32:63])
+            result_bfp <- bfp_CONVERT_FROM_UI32(result[32:63])
+        case(2):  # Signed 64-bit
+            result_bfp <- bfp_CONVERT_FROM_SI64(result)
+        default:  # Unsigned 64-bit
+            result_bfp <- bfp_CONVERT_FROM_UI64(result)
+
+    if vxsnan_flag = 1 then SetFX(FPSCR.VXSNAN)
+    if vxcvi_flag = 1 then SetFX(FPSCR.VXCVI)
+    if xx_flag = 1 then SetFX(FPSCR.XX)
+
+    vx_flag <- vxsnan_flag | vxcvi_flag
+    vex_flag <- FPSCR.VE & vx_flag
+
+    if vex_flag = 0 then
+        RT <- result
+        FPSCR.FPRF <- undefined
+        FPSCR.FR <- inc_flag
+        FPSCR.FI <- xx_flag
+        if IsNaN(src) or not bfp_COMPARE_EQ(src, result_bfp) then
+            overflow <- 1  # signals SO only when OE = 1
+    else
+        FPSCR.FR <- 0
+        FPSCR.FI <- 0
+```
+
+Convert from 32-bit float in FRB to a unsigned/signed 32/64-bit integer
+in RT, with the conversion overflow/rounding semantics following the
+chosen `CVM` value, following the usual 32-bit float in 64-bit float
+format. `FPSCR` is modified and exceptions are raised as usual.
+
+These instructions have an Rc=1 mode which sets CR0 in the normal
+way for any instructions producing a GPR result.  Additionally, when OE=1,
+if the numerical value of the FP number is not 100% accurately preserved
+(due to truncation or saturation and including when the FP number was
+NaN) then this is considered to be an Integer Overflow condition, and
+CR0.SO, XER.SO and XER.OV are all set as normal for any GPR instructions
+that overflow.
+
+Special Registers altered:
+
+```
+    CR0              (if Rc=1)
+    XER SO, OV, OV32 (if OE=1)
+    FPRF=0bUUUUU FR FI FX XX VXSNAN VXCV
+```
+
+### Assembly Aliases
+
+| Assembly Alias             | Full Instruction            |
+|----------------------------|-----------------------------|
+| `fcvtstgw RT, FRB, CVM`    | `fcvtstg RT, FRB, CVM, 0`   |
+| `fcvtstgw. RT, FRB, CVM`   | `fcvtstg. RT, FRB, CVM, 0`  |
+| `fcvtstgwo RT, FRB, CVM`   | `fcvtstgo RT, FRB, CVM, 0`  |
+| `fcvtstgwo. RT, FRB, CVM`  | `fcvtstgo. RT, FRB, CVM, 0` |
+| `fcvtstguw RT, FRB, CVM`   | `fcvtstg RT, FRB, CVM, 1`   |
+| `fcvtstguw. RT, FRB, CVM`  | `fcvtstg. RT, FRB, CVM, 1`  |
+| `fcvtstguwo RT, FRB, CVM`  | `fcvtstgo RT, FRB, CVM, 1`  |
+| `fcvtstguwo. RT, FRB, CVM` | `fcvtstgo. RT, FRB, CVM, 1` |
+| `fcvtstgd RT, FRB, CVM`    | `fcvtstg RT, FRB, CVM, 2`   |
+| `fcvtstgd. RT, FRB, CVM`   | `fcvtstg. RT, FRB, CVM, 2`  |
+| `fcvtstgdo RT, FRB, CVM`   | `fcvtstgo RT, FRB, CVM, 2`  |
+| `fcvtstgdo. RT, FRB, CVM`  | `fcvtstgo. RT, FRB, CVM, 2` |
+| `fcvtstgud RT, FRB, CVM`   | `fcvtstg RT, FRB, CVM, 3`   |
+| `fcvtstgud. RT, FRB, CVM`  | `fcvtstg. RT, FRB, CVM, 3`  |
+| `fcvtstgudo RT, FRB, CVM`  | `fcvtstgo RT, FRB, CVM, 3`  |
+| `fcvtstgudo. RT, FRB, CVM` | `fcvtstgo. RT, FRB, CVM, 3` |
author	Jacob Lifshay <programmerjake@gmail.com>
	Thu, 6 Apr 2023 03:22:09 +0000 (20:22 -0700)
committer	Jacob Lifshay <programmerjake@gmail.com>
	Thu, 6 Apr 2023 03:22:09 +0000 (20:22 -0700)