isa_conflict_resolution.mdwn

   1 # Resolving ISA conflicts and providing a pain-free RISC-V Standards Upgrade Path
   2
   3 ## Executive Summary
   4
   5 A non-invasive backwards-compatible change to make mvendorid and marchid
   6 being read-only to be a formal declaration of an architecture having no
   7 Custom Extensions, and being permitted to be WARL in order to support
   8 multiple simultaneous architectures on the same processor (or per hart
   9 or harts) permits not only backwards and forwards compatibility with
  10 existing implementations of the RISC-V Standard, not only permits seamless
  11 transitions to future versions of the RISC-V Standard (something that is
  12 not possible at the moment), but fixes the problem of clashes in Custom
  13 Extension opcodes on a global worldwide permanent and ongoing basis.
  14
  15 Summary of impact and benefits:
  16
  17 * Implementation impact for existing implementations (even though
  18   the Standard is not finalised) is zero.
  19 * Impact for future implementations compliant with (only one) version of the
  20   RISC-V Standard is zero.
  21 * Benefits for implementations complying with (one or more) versions
  22   of the RISC-V Standard is: increased customer acceptance due to
  23   a smooth upgrade path at the customer's pace and initiative vis-a-vis
  24   legacy proprietary software.
  25 * Benefits for implementations deploying multiple Custom Extensions
  26   are a massive reduction in NREs and the hugely reduced ongoing software
  27   toolchain maintenance costs plus the benefit of having security updates
  28   from upstream software sources due to
  29   *globally unique identifying information* resulting in zero binary
  30   encoding conflicts in the toolchains and resultant binaries
  31   *even for Custom Extensions*.
  32
  33 ## Introduction
  34
  35 In a lengthy thread that ironically was full of conflict indicative
  36 of the future direction in which RISC-V will go if left unresolved,
  37 multiple Custom Extensions were noted to be permitted free rein to
  38 introduce global binary-encoding conflict with no means of resolution
  39 described or endorsed by the RISC-V Standard: a practice that has known
  40 disastrous and irreversible consequences for any architecture that
  41 permits such practices (1).
  42
  43 Much later on in the discussion it was realised that there is also no way
  44 within the current RISC-V Specification to transition to improved versions
  45 of the standard, regardless of whether the fixes are absolutely critical
  46 show-stoppers or whether they are just keeping the standard up-to-date (2).
  47
  48 With no transition path there is guaranteed to be tension and conflict
  49 within the RISC-V Community over whether revisions should be made:
  50 should existing legacy designs be prioritised, mutually-exclusively over
  51 future designs (and what happens during the transition period is absolute
  52 chaos, with the compiler toolchain, software ecosystem and ultimately
  53 the end-users bearing the full brunt of the impact).  If several
  54 overlapping revisions are required that have not yet transitioned out
  55 of use (which could take well over two decades to occur) the situation
  56 becomes disastrous for the credibility of the entire RISC-V ecosystem.
  57
  58 It was also pointed out that Compliance is an extremely important factor
  59 to take into consideration, and that Custom Extensions (as being optional)
  60 effectively and quite reasonably fall entirely outside of the scope of
  61 Compliance Testing.  At this point in the discussion however it was not
  62 yet noted the stark problem that the *mandatory* RISC-V Specification
  63 also faces, by virtue of there being no transitional way to bring in
  64 show-stopping critical alterations.
  65
  66 To put this into perspective, just taking into account hardware costs
  67 alone: with production mask charges for 28nm being around USD $1.5m,
  68 engineering development costs and licensing of RTLs for peripherals
  69 being of a similar magnitude, no manufacturer is going to back away
  70 from selling a "flawed" or "legacy" product (whether it complies with
  71 the RISC-V Specification or not) without a bitter fight.
  72
  73 It was also pointed out that there will be significant software tool
  74 maintenance costs for manufacturers, meaning that the probability will
  75 be extremely high that they will refuse to shoulder such costs, and
  76 will publish and continue to publish (and use) hopelessly out-of-date
  77 unpatched tools.  This practice is well-known to result in security
  78 flaws going unpatched, with one of many immediate undesirable consequences
  79 being that product in extremely large volume gets discarded into landfill.
  80
  81 **All and any of the issues that were discussed, and all of those that
  82 were not, can be avoided by providing a hardware-level runtime-enabled
  83 forwards and backwards compatible transition path between *all* parts
  84 (mandatory or not) of current and future revisions of the RISC-V ISA
  85 Standard.**
  86
  87 The rest of the discussion - indicative as it was of the stark mutually
  88 exclusive gap being faced by the RISC-V ISA Standard given that it does
  89 not cope with the problem - was an effort by two groups in two clear
  90 camps: one that wanted things to remain as they are, and another that
  91 made efforts to point out that the consequences of not taking action
  92 are clearly extreme and irreversible (which, unfortunately, given the
  93 severity, some of the first group were unable to believe, despite there
  94 being clear historical precedent for the exact same mistake being made in
  95 other architectures, and the consequences on the same being absolutely
  96 clear).
  97
  98 However after a significant amount of time, certain clear requirements came
  99 out of the discussion:
 100
 101 * Any proposal must be a minimal change with minimal (or zero) impact
 102 * Any proposal should place no restriction on existing or future
 103   ISA encoding space
 104 * Any proposal should take into account that there are existing implementors
 105   of the (yet to be finalised but still "partly frozen") Standard who may
 106   resist, for financial investment reasons, efforts to make any change
 107   (at all) that could cost them immediate short-term profits.
 108
 109 Several proposals were put forward (and some are still under discussion)
 110
 111 * "Do nothing": problem is not severe: no action needed.
 112 * "Do nothing": problem is out-of-scope for RISC-V Foundation.
 113 * "Do nothing": problem complicates Compliance Testing (and is out of scope)
 114 * "MISA": the MISA CSR enables and disables extensions already: use that
 115 * "MISA-like": a new CSR which switches in and out new encodings
 116   (without destroying state)
 117 * "mvendorid/marchid WARL": switching the entire "identity" of a machine
 118 * "ioctl-like": a OO proposal based around the linux kernel "ioctl" system.
 119
 120 Each of these will be discussed below in their own sections.
 121
 122 # Do nothing (no problem exists)
 123
 124 (Summary: not an option)
 125
 126 There were several solutions offered that fell into this category.
 127 A few of them are listed in the introduction; more are listed below,
 128 and it was exhaustively (and exhaustingly) established that none of
 129 them are workable.
 130
 131 Initially it was pointed out that Fabless Semiconductor companies could
 132 simply license multiple Custom Extensions and a suitable RISC-V core, and
 133 modify them accordingly.  The Fabless Semi Company would be responsible
 134 for paying the NREs on re-developing the test vectors (as the extension
 135 licensers would be extremely unlikely to do that without payment), and
 136 given that said Companies have an "integration" job to do, it would
 137 be reasonable to expect them to have such additional costs as well.
 138
 139 The costs of this approach were outlined and discussed as being
 140 disproportionate and extreme compared to the actual likely cost of
 141 licensing the Custom Extensions in the first place.  Additionally it
 142 was pointed out that not only hardware NREs would be involved but
 143 custom software tools (compilers and more) would also be required
 144 (and maintained separately, on the basis that upstream would not
 145 accept them except under extreme pressure, and then only with
 146 prejudice).
 147
 148 All similar schemes involving customisation of the custom extensions
 149 were likewise rejected, but not before the customisation process was
 150 mistakenly conflated with tne *normal* integration process of developing
 151 a custom processor (Bus Architectures, Cache layouts, peripheral layouts).
 152
 153 The most compelling hardware-related reason (excluding the severe impact on
 154 the software ecosystem) for rejecting the customisation-of-customisation
 155 approach was the case where Extensions were using an instruction encoding
 156 space (48-bit, 64-bit) *greater* than that which the chosen core could
 157 cope with (32-bit, 48-bit).
 158
 159 Overall, none of the options presented were feasible, and, in addition,
 160 with no clear leadership from the RISC-V Foundation on how to avoid
 161 global world-wide encoding conflict, even if they were followed through,
 162 still would result in the failure of the RISC-V ecosystem due to
 163 irreversible global conflicting ISA binary-encoding meanings (POWERPC's
 164 Altivec / SPE nightmare).
 165
 166 This in addition to the case where the RISC-V Foundation wishes to
 167 fix a critical show-stopping update to the Standard, post-release,
 168 where billions of dollars have been spent on deploying RISC-V in the
 169 field.
 170
 171 # Do nothing (out of scope)
 172
 173 (Summary: may not be RV Foundation's "scope", still results in
 174 problem, so not an option)
 175
 176 This was one of the first arguments presented: The RISC-V Foundation
 177 considers Custom Extensions to be "out of scope"; that "it's not their
 178 problem, therefore there isn't a problem".
 179
 180 The logical errors in this argument were quickly enumerated: namely that
 181 the RISC-V Foundation is not in control of the uses to which RISC-V is
 182 put, such that public global conflicts in binary-encoding are a hundred
 183 percent guaranteed to occur (*outside* of the control and remit of the
 184 RISC-V Foundation), and a hundred percent guaranteed to occur in
 185 *commodity* hardware where Debian, Fedora, SUSE and other distros will
 186 be hardest hit by the resultant chaos, and that will just be the more
 187 "visible" aspect of the underlying problem.
 188
 189 # Do nothing (Compliance too complex, therefore out of scope)
 190
 191 (Summary: may not be RV Foundation's "scope", still results in
 192 problem, so not an option)
 193
 194 The summary here was that Compliance testing of Custom Extensions is
 195 not just out-of-scope, but even if it was taken into account that
 196 binary-encoding meanings could change, it would still be out-of-scope.
 197
 198 However at the time that this argument was made, it had not yet been
 199 appreciated fully the impact that revisions to the Standard would have,
 200 when billions of dollars worth of (older, legacy) RISC-V hardware had
 201 already been deployed.
 202
 203 Two interestingly diametrically-opposed equally valid arguments exist here:
 204
 205 * Whilst Compliance testing of Custom Extensions is definitely legitimately
 206   out of scope, Compliance testing of simultaneous legacy (old revisions of
 207   ISA Standards) and current (new revisions of ISA Standard) definitely
 208   is not.  Efforts to reduce *Compliance Testing* complexity is therefore
 209   "Compliance Tail Wagging Standard Dog".
 210 * Beyond a certain threshold, complexity of Compliance Testing is so
 211   burdensome that it risks outright rejection of the entire Standard.
 212
 213 Meeting these two diametrically-opposed perspectives requires that the
 214 solution be very, very simple.
 215
 216 # MISA
 217
 218 (Summary: MISA not suitable, leads to better idea)
 219
 220 MISA permits extensions to be disabled by masking out the relevant bit.
 221 Hypothetically it could be used to disable one extension, then enable
 222 another that happens to use the same binary encoding.
 223
 224 *However*:
 225
 226 * MISA Extension disabling is permitted (optionally) to **destroy**
 227   the state information.  Thus it is totally unsuitable for cases
 228   where instructions from different Custom extensions are needed in
 229   quick succession.
 230 * MISA was only designed to cover Standard Extensions.
 231 * There is nothing to prevent multiple Extensions being enabled
 232   that wish to simultaneously interpret the same binary encoding.
 233 * There is nothing in the MISA specification which permits
 234   *future* versions (bug-fixes) of the RISC-V ISA to be "switched in".
 235
 236 Overall, whilst the MISA concept is a step in the right direction it's
 237 a hundred percent unsuitable for solving the problem.
 238
 239 # MISA-like
 240
 241 (Summary: basically same as mvend/march WARL except needs an extra CSR where
 242 mv/ma doesn't. Along right lines, doesn't meet full requirements)
 243
 244 Out of the MISA discussion came a "MISA-like" proposal, which would
 245 take into account the flaws pointed out by trying to use "MISA":
 246
 247 * The MISA-like CSR's meaning would be identified by compilers using the
 248   mvendor-id/march-id tuple as a compiler target
 249 * Each custom-defined bit of the MISA-like CSR would (mutually-exclusively)
 250   redirect binary encoding(s) to specific encodings
 251 * No Extension would *actually* be disabled: its internal state would
 252   be left on (permanently) so that switching of ISA decoding
 253   could be done inside inner loops without adverse impact on
 254   performance.
 255
 256 Whilst it was the first "workable" solution it was also noted that the
 257 scheme is invasive: it requires an entirely new CSR to be added
 258 to the privileged spec (thus making existing implementations redundant).
 259 This does not fulfil the "minimum impact" requirement.
 260
 261 Also interesting around the same time an additional discussion was
 262 raised that covered the *compiler* side of the same equation.  This
 263 revolved around using mvendorid-marchid tuples at the compiler level,
 264 to be put into assembly output (by gcc), preserving the required
 265 *globally* unique identifying information for binutils to successfully
 266 turn the custom instruction into an actual binary-encoding (plus
 267 binary-encoding of the context-switching information).  (**TBD, Jacob,
 268 separate page?  review this para?**)
 269
 270 # mvendorid/marchid WARL
 271
 272 (Summary: the only idea that meets the full requirements.  Needs
 273  toolchain backup, but only when the first chip is released)
 274
 275 Coming out of the software-related proposal by Jacob Bachmeyer, which
 276 hinged on the idea of a globally-maintained gcc / binutils database
 277 that kept and coordinated architectural encodings (curated by the Free
 278 Software Foundation), was to quite simply make the mvendorid and marchid
 279 CSRs have WARL (writeable) characteristics.  For instances where mvendorid
 280 and marchid are readable, that would be taken to be a Standards-mandatory
 281 "declaration" that the architecture has *no* Custom Extensions (and that
 282 it conforms precisely to one and only one specific variant of the
 283 RISC-V Specification).
 284
 285 This incredibly simple non-invasive idea has some unique and distinct
 286 advantages over other proposals:
 287
 288 * Existing designs - even though the specification is not finalised
 289   (but has "frozen" aspects) - would be completely unaffected: the
 290   change is to the "wording" of the specification to "retrospectively"
 291   fit reality.
 292 * Unlike with the MISA idea this is *purely* at the "decode" phase:
 293   no internal Extension state information is permitted to be disabled,
 294   altered or destroyed as a direct result of writing to the
 295   mvendor/march-id CSRs.
 296 * Compliance Testing may be carried out with a different vendorid/marchid
 297   tuple set prior to a test, allowing a vendor to claim *Certified*
 298   compatibility with *both* one (or more) legacy variants of the RISC-V
 299   Specification *and* with a present one.
 300 * With sufficient care taken in the implementation an implementor
 301   may have multiple interpretations of the same binary encoding within
 302   an inner loop, with a single instruction (to the WARL register)
 303   changing the meaning.
 304
 305 A couple of points were made:
 306
 307 * Compliance Testing may **fail** any system that has mvendorid/marchid
 308   as WARL.  This however is a clear case of "Compliance Tail Wagging Standard
 309   Dog".
 310 * The redirection of meaning of certain binary encodings to multiple
 311   engines was considered extreme, eyebrow-raising, and also (importantly)
 312   potentially expensive, introducing significant latency at the decode
 313   phase.
 314
 315 On this latter point, it was observed that MISA already switches out entire
 316 sets of instructions (interacts at the "decode" phase).  The difference
 317 between what MISA does and the mvendor/march-id WARL idea is that whilst
 318 MISA only switches instruction decoding on (or off), the WARL idea
 319 *redirects* encoding, to *different* engines, fortunately in a deliberately
 320 mutually-exclusive fashion.
 321
 322 Implementations would therefore, in each Extension (assuming one separate
 323 "decode" engine per Extension), simply have an extra (mutually-exclusively
 324 enabled) wire in the AND gate for any given binary encoding, and in this
 325 way there would actually be very little impact on the latency.  The assumption
 326 here is that there are not dozens of Extensions vying for the same binary
 327 encoding (at which point the Fabless Semi Company has other much more
 328 pressing issues to deal with that make resolving encoding conflicts trivial
 329 by comparison).
 330
 331 Also pointed out was that in certain cases pipeline stalls could be introduced
 332 during the switching phase, if needed, just as they may be needed for
 333 correct implementation of (mandatory) support for MISA.
 334
 335 **This is the only one of the proposals that meet the full requirements**
 336
 337 # ioctl-like
 338
 339 (Summary: good solid orthogonal idea.  See [[ioctl]] for full details)
 340
 341 ==RB===
 342
 343 This proposal adds a standardised extension interface to the RV instruction set by introducing a fixed small number (e.g. 8) of "overloadable" R-type opcodes ext_ctl0, .. ext_ctl7. Each takes a process local interface cookie in rs1. Based on the cookie, the CPU routes the "overloaded" instructions to a "device" on or off the CPU that implements the actual semantics.
 344
 345 The cookie is "opened" with an additional r-type instruction ext_open that takes a 20 bit identifier and "closed" with an  ext_close instruction. The implementing hardware device can use the cookie to reference internal state. Thus, interfaces may be statefull.
 346
 347 CPU's and devices may implement several interfaces, indeed, are expected to. E.g. a single hardware device might expose a functional interface with 6 overloaded instructions, expose configuration with two highly device specific management interfaces with 8 resp. 4 overloaded instructions, and respond to a standardised save state interface with 4 overloaded instructions.
 348
 349 Having a standardised overloadable interface simply avoids much of the need for isa extensions for hardware with non standard interfaces and semantics. This is analogous to the way that the standardised overloadable ioctl interface of the kernel almost completely avoids the need for extending the kernel with syscalls for the myriad of hardware devices with their specific interfaces and semantics.
 350
 351 Since the rs1 input of the overloaded  ext_ctl instruction's are taken by the interface cookie, they are restricted in use compared to a normal R-type instruction (it is possible to pass 12 bits of additional info by or ing it with the cookie). Delegation is also expected to come at a small additional performance price compared to a "native" instruction. This should be an acceptable tradeoff in most cases.
 352
 353 The expanded flexibility comes at the cost: the standard can specify the semantics of the delegation mechanism and the interfacing with the rest of the cpu, but the actual semantics of the overloaded instructions can only be defined by the designer of the interface. Likewise, a device can be conforming as far as delegation and interaction with the CPU is concerned, but whether the hardware is conforming to the semantics of the interface is outside the scope of spec. Being able to specify that semantics using the methods used for RV itself is clearly very valuable. One impetus for doing that is using it for purposes of its own, effectively freeing opcode space for other purposes. Also, some interfaces may become de facto or de jure standards themselves, necessitating hardware to implement competing interfaces. I.e., facilitating a free for all, may lead to standards proliferation. C'est la vie.
 354
 355 The only "ISA-collisions" that can still occur are in the 20 bit (~10^6) interface identifier space, with 12 more bits to identify a device on a hart that implements the interface. One suggestion is setting aside 2^19 id's that are handed out for a small fee by a central (automated) registration (making sure the space is not just claimed), while the remaining 2^19 are used as a good hash on a long, plausibly globally unique human readable interface name. This gives implementors the choice between a guaranteed private identifier paying a fee, or relying on low probabilities. The interface identifier could also easily be extended to 42 bits on RV64.
 356
 357
 358 ====End RB==
 359
 360 This proposal basically mirrors the concept of POSIX ioctls, providing
 361 (arbitrarily) 8 functions (opcodes) whose meaning may be over-ridden
 362 in an object-orientated fashion by calling an "open handle" (and close)
 363 function (instruction) that switches (redirects) the 8 functions over to
 364 different opcodes.
 365
 366
 367 The "open handle" opcode takes a GUID (globally-unique identifier)
 368 and an ioctl number, and stores the UUID in a table indexed by the
 369 ioctl number:
 370
 371     handle_global_state[8] # stores UUID or index of same
 372
 373     def open_handle(uuid, ioctl_num):
 374           handle_global_state[ioctl_num] = uuid
 375
 376     def close_handle(ioctl_num):
 377           handle_global_state[ioctl_num] = -1 # clear table entry
 378
 379
 380 "Ioctls" (arbitrarily 8 separate R-type opcodes) then perform a redirect
 381 based on what the global state for that numbered "ioctl" has been set to:
 382
 383     def ioctl_fn0(*rargs): # star means "take all arguments as a tuple"
 384         if handle_global_state[0] == CUSTOMEXT1UUID:
 385            CUSTOMEXT1_FN0(*rargs) # apply all arguments to function
 386         elif handle_global_state[0] == CUSTOMEXT2UUID:
 387            CUSTOMEXT2_FN0(*rargs) # apply all arguments to function
 388         else:
 389             raise Exception("undefined opcode")
 390
 391 === RB ==
 392
 393 not quite I think. It is more like
 394
 395 // Hardware, implementing interface with UUID 0xABCD
 396
 397     def A_shutdown(cookie, data):
 398        ...
 399
 400     def A_init(data)
 401
 402     def A_do_stuff(cookie, data):
 403        ...
 404
 405     def A_do_more_stuff(cookie, data):
 406        ...
 407
 408     interfaceA = {
 409                   "shutdown": A_shutdown,
 410                   "init":     A_init,
 411                   "ctl0":     A_do_stuff,
 412                   "ctl1":     A_do_more_stuff
 413                  }
 414
 415 // hardware implementing interface with UUID = 0x1234
 416
 417     def B_do_things(cookie, data):
 418        ...
 419     def B_shutdown(cookie, data)
 420        ...
 421
 422     interfaceB = {
 423                   "shutdown": B_shutdown,
 424                   "ctl0":     B_do_things
 425                  }
 426
 427
 428 // The CPU being wired to the devices
 429
 430     cpu_interfaces = {
 431                   0xABCD: interfaceA,
 432                   0x1234: interfaceB
 433                  }
 434
 435 // The functionality that the CPU must implement to use the extension interface
 436
 437     cpu_open_handles = {}
 438
 439     __handleId = 0
 440     def new_unused_handle_id()
 441         __handleId = __handleId + 1
 442         return __handleId
 443
 444     def ext_open(uuid, data):
 445         interface = cpu_interface[uuid]
 446         if interface == NIL:
 447             raise Exception("No such interface")
 448
 449         handleId = new_unused_handle_id()
 450         cpu_open_handles[handleId] = (interface, CurrentVirtualMemoryAddressSpace)
 451
 452         cookie = A_init(data)                      # Here device takes over
 453
 454         return (handle_id, cookie)
 455
 456     def ext_close(handle, data):
 457         (handleId, cookie) = handle
 458         intf_VMA = cpu_open_handles[handleId]
 459         if intf_VMA == NIL:
 460              return -1
 461
 462         (interface, VMA) = intf_VMA
 463         if VMA != CurrentVirtualMemoryAddressSpace:
 464              return -1
 465         assert(interface != NIL)
 466         shutdown = interface["shutdown"]
 467         if shutdown != NIL:
 468
 469              err = interface.shutdown(cookie, data)  # Here device takes over
 470
 471              if err != 0:
 472                  return err
 473         cpu_open_handles[handleId] = NIL
 474         return 0
 475
 476     def ext_ctl0(handle, data):
 477         (handleId, cookie) = handle
 478         intf_VMA = cpu_open_handles[handleId]
 479         if intf_VMA == NIL:
 480              raise Exception("No such interface")
 481
 482         (interface, VMA) = intf_VMA
 483         if VMA != CurrentVirtualMemoryAddressSpace:
 484              raise Exception("No such interface")  #Disclosing that the interface exists in different address is security hole
 485
 486         assert(interface != NIL)
 487         ctl0 = interface["ctl0"]
 488         if ctl0 == NIL:
 489             raise Exception("No such Instruction")
 490
 491         return ctl0(cookie, data)                  # Here device takes over
 492
 493
 494 The other ext_ctl's are similar.
 495
 496 ==End RB==
 497
 498
 499
 500
 501 The proposal is functionally near-identical to that of the mvendor/march-id
 502 except extended down to individual opcodes.  As such it could hypothetically
 503 be proposed as an independent Standard Extension in its own right that extends
 504 the Custom Opcode space *or* fits into the brownfield spaces within the
 505 existing ISA opcode space *or* is used as the basis of an independent
 506 Custom Extension in its own right.
 507
 508 ==RB==
 509 I really think it should be in browncode
 510 ==RB==
 511
 512 One of the reasons for seeking an extension of the Custom opcode space is
 513 that the Custom opcode space is severely limited: only 2 opcodes are free
 514 within the 32-bit space, and only four total remain in the 48 and 64-bit
 515 space.
 516
 517 Despite the proposal (which is still undergoing clarification)
 518 being worthwhile in its own right, and standing on its own merits and
 519 thus definitely worthwhile pursuing, it is non-trivial and much more
 520 invasive than the mvendor/march-id WARL concept.
 521
 522
 523
 524 # Comments, Discussion and analysis
 525
 526 TBD: placeholder as of 26apr2018
 527
 528 # Summary and Conclusion
 529
 530 In the early sections (those in the category "no action") it was established
 531 in each case that the problem is not solved.  Avoidance of responsibility,
 532 or conflation of "not our problem" with "no problem" does not make "problem"
 533 go away.  Even "making it the Fabless Semiconductor's design problem" resulted
 534 in a chip being *more costly to engineer as hardware **and** more costly
 535 from a software-support perspective to maintain*... without actually
 536 fixing the problem.
 537
 538 The first idea considered which could fix the problem was to just use
 539 the pre-existing MISA CSR, however this was determined not to have
 540 the right coverage (Standard Extensions only), and also crucially it
 541 destroyed state.  Whilst unworkable it did lead to the first "workable"
 542 solution, "MISA-like".
 543
 544 The "MISA-like" proposal, whilst meeting most of the requirements, led to
 545 a better idea: "mvendor/march-id WARL", which, in combination with an offshoot
 546 idea related to gcc and binutils, is the only proposal that fully meets the
 547 requirements.
 548
 549 The "ioctl-like" idea *also* solves the problem, but, unlike the WARL idea
 550 does not meet the full requirements to be "non-invasive" and "backwards
 551 compatible" with pre-existing (pre-Standards-finalised) implementations.
 552 It does however stand on its own merit as a way to extend the extremely
 553 small Custom Extension opcode space, even if it itself implemented *as*
 554 a Custom Extension into which *other* Custom Extensions are subsequently
 555 shoe-horned.  This approach has the advantage that it requires no "approval"
 556 from the RISC-V Foundation... but without the RISC-V Standard "approval"
 557 guaranteeing no binary-encoding conflicts, still does not actually solve the
 558 problem (if deployed as a Custom Extension for extending Custom Extensions).
 559
 560 Overall the mvendor/march-id WARL idea meets the three requirements,
 561 and is the only idea that meets the three requirements:
 562
 563 * **Any proposal must be a minimal change with minimal (or zero) impact**
 564   (met through being purely a single backwards-compatible change to the
 565   wording of the specification: mvendor/march-id changes from read-only
 566   to WARL)
 567 * **Any proposal should place no restriction on existing or future
 568   ISA encoding space**
 569   (met because it is just a change to one pre-existing CSR, as opposed
 570   to requiring additional CSRs or requiring extra opcodes or changes
 571   to existing opcodes)
 572 * **Any proposal should take into account that there are existing implementors
 573   of the (yet to be finalised but still "partly frozen") Standard who may
 574   resist, for financial investment reasons, efforts to make any change
 575   (at all) that could cost them immediate short-term profits.**
 576   (met because existing implementations, with the exception of those
 577   that have Custom Extensions, come under the "vendor/arch-id read only
 578   is a formal declaration of an implementation having no Custom Extensions"
 579   fall-back category)
 580
 581 So to summarise:
 582
 583 * The consequences of not tackling this are severe: the RISC-V Foundation
 584   cannot take a back seat.  If it does, clear historical precedent shows
 585   100% what the outcome will be (1).
 586 * Making the mvendorid and marchid CSRs WARL solves the problem in a
 587   minimal to zero-disruptive backwards-compatible fashion that provides
 588   indefinite transparent *forwards*-compatibility.
 589 * The retro-fitting cost onto existing implementations (even though the
 590   specification has not been finalised) is zero to negligeable
 591   (only changes to words in the specification required at this time:
 592   no vendor need discard existing designs, either being designed,
 593   taped out, or actually in production).
 594 * The benefits are clear (pain-free transition path for vendors to safely
 595   upgrade over time; no fights over Custom opcode space; no hassle for
 596   software toolchain; no hassle for GNU/Linux Distros)
 597 * The implementation details are clear (and problem-free except for
 598   vendors who insist on deploying dozens of conflicting Custom Extensions:
 599   an extreme unlikely outlier).
 600 * Compliance Testing is straightforward and allows vendors to seek and
 601   obtain *multiple* Compliance Certificates with past, present and future
 602   variants of the RISC-V Standard (in the exact same processor,
 603   simultaneously), in order to support end-customer legacy scenarios and
 604   provide the same with a way to avoid "impossible-to-make" decisions that
 605   throw out ultra-costly multi-decade-investment in proprietary legacy
 606   software at the same as the (legacy) hardware.
 607
 608 -------
 609
 610 # Conversation Exerpts
 611
 612 The following conversation exerpts are taken from the ISA-dev discussion
 613
 614 ## (1) Albert Calahan on SPE / Altiven conflict in POWERPC
 615
 616 > Yes. Well, it should be blocked via legal means. Incompatibility is
 617 > a disaster for an architecture.
 618 >
 619 > The viability of PowerPC was badly damaged when SPE was
 620 > introduced. This was a vector instruction set that was incompatible
 621 > with the AltiVec instruction set. Software vendors had to choose,
 622 > and typically the choice was "neither". Nobody wants to put in the
 623 > effort when there is uncertainty and a market fragmented into
 624 > small bits.
 625 >
 626 > Note how Intel did not screw up. When SSE was added, MMX remained.
 627 > Software vendors could trust that instructions would be supported.
 628 > Both MMX and SSE remain today, in all shipping processors. With very
 629 > few exceptions, Intel does not ship chips with missing functionality.
 630 > There is a unified software ecosystem.
 631 >
 632 > This goes beyond the instruction set. MMU functionality also matters.
 633 > You can add stuff, but then it must be implemented in every future CPU.
 634 > You can not take stuff away without harming the architecture.
 635
 636 ## (2) Luke Kenneth Casson Leighton on Standards backwards-compatibility
 637
 638 > For the case where "legacy" variants of the RISC-V Standard are
 639 > backwards-forwards-compatibly supported over a 10-20 year period in
 640 > Industrial and Military/Goverment-procurement scenarios (so that the
 641 > impossible-to-achieve pressure is off to get the spec ABSOLUTELY
 642 > correct, RIGHT now), nobody would expect a seriously heavy-duty amount
 643 > of instruction-by-instruction switching: it'd be used pretty much once
 644 > and only once at boot-up (or once in a Hypervisor Virtual Machine
 645 > client) and that's it.
 646
 647 ## (3) Allen Baum on Standards Compliance
 648
 649 > Putting my compliance chair hat on: One point that was made quite
 650 > clear to me is that compliance will only test that an implementation
 651 > correctly implements the portions of the spec that are mandatory, and
 652 > the portions of the spec that are optional and the implementor claims
 653 > it is implementing. It will test nothing in the custom extension space,
 654 > and doesn't monitor or care what is in that space.
 655
 656 # References
 657
 658 * <https://groups.google.com/a/groups.riscv.org/forum/#!topic/isa-dev/7bbwSIW5aqM>
 659 * <https://groups.google.com/a/groups.riscv.org/forum/#!topic/isa-dev/InzQ1wr_3Ak%5B1-25%5D>