(no commit message)
[libreriscv.git] / resources.mdwn
1 # Resources and Specifications
2
3 This page aims to collect all the resources and specifications we need
4 in one place for quick access. We will try our best to keep links here
5 up-to-date. Feel free to add more links here.
6
7 [[!toc ]]
8
9 # Getting Started
10
11 This section is primarily a series of useful links found online
12
13 * [FSiC2019](https://wiki.f-si.org/index.php/FSiC2019)
14 * Fundamentals to learn to get started [[3d_gpu/tutorial]]
15
16 ## Is Open Source Hardware Profitable?
17 [RaptorCS on FOSS Hardware Interview](https://www.youtube.com/watch?v=o5Ihqg72T3c&feature=youtu.be)
18
19 # OpenPOWER ISA
20
21 * [3.0 PDF](https://openpowerfoundation.org/?resource_lib=power-isa-version-3-0)
22 * [2.07 PDF](https://openpowerfoundation.org/?resource_lib=ibm-power-isa-version-2-07-b)
23
24 ## Overview of the user ISA:
25
26 [Raymond Chen's PowerPC series](https://devblogs.microsoft.com/oldnewthing/20180806-00/?p=99425)
27
28 ## OpenPOWER OpenFSI Spec (2016)
29
30 * [OpenPOWER OpenFSI Spec](http://openpowerfoundation.org/wp-content/uploads/resources/OpenFSI-spec-100/OpenFSI-spec-20161212.pdf)
31
32 * [OpenPOWER OpenFSI Compliance Spec](http://openpowerfoundation.org/wp-content/uploads/resources/openpower-fsi-thts-1.0/openpower-fsi-thts-20180130.pdf)
33
34 ## JTAG
35
36 * [Useful JTAG implementation reference: Design Of IEEE 1149.1 TAP Controller IP Core by Shelja, Nandakumar and Muruganantham, DOI:10.5121/csit.2016.60910](https://airccj.org/CSCP/vol6/csit65610.pdf)
37
38 Abstract
39
40 "The objective of this work is to design and implement a TAP controller IP core compatible with IEEE 1149.1-2013 revision of the standard. The test logic architecture also includes the Test Mode Persistence controller and its associated logic. This work is expected to serve as a ready to use module that can be directly inserted in to a new digital IC designs with little modifications."
41
42 # RISC-V Instruction Set Architecture
43
44 **PLEASE UPDATE** - we are no longer implementing full RISCV, only user-space
45 RISCV
46
47 The Libre RISC-V Project is building a hybrid CPU/GPU SoC. As the name
48 of the project implies, we will be following the RISC-V ISA I due to it
49 being open-source and also because of the huge software and hardware
50 ecosystem building around it. There are other open-source ISAs but none
51 of them have the same momentum and energy behind it as RISC-V.
52
53 To fully take advantage of the RISC-V ecosystem, it is important to be
54 compliant with the RISC-V standards. Doing so will allow us to to reuse
55 most software as-is and avoid major forks.
56
57 * [Official compiled PDFs of RISC-V ISA Manual]
58 (https://github.com/riscv/riscv-isa-manual/releases/latest)
59 * [Working draft of the proposed RISC-V Bitmanipulation extension](https://github.com/riscv/riscv-bitmanip/blob/master/bitmanip-draft.pdf)
60 * [RISC-V "V" Vector Extension](https://riscv.github.io/documents/riscv-v-spec/)
61 * [RISC-V Supervisor Binary Interface Specification](https://github.com/riscv/riscv-sbi-doc/blob/master/riscv-sbi.md)
62
63 Note: As far as I know, we aren't using the RISC-V V Extension directly
64 at the moment. However, there are many wiki pages that make a reference
65 to the V extension so it would be good to include it here as a reference
66 for comparative/informative purposes with regard to Simple-V.
67
68 # Radix MMU
69 - [Qemu emulation](https://github.com/qemu/qemu/commit/d5fee0bbe68d5e61e2d2beb5ff6de0b9c1cfd182)
70
71 # D-Cache
72
73 ## D-Cache Possible Optimizations papers and links
74 - [ACDC: Small, Predictable and High-Performance Data Cache](https://dl.acm.org/doi/10.1145/2677093)
75
76 # BW Enhancing Shared L1 Cache Design research done in cooperation with AMD
77 - [Youtube video PACT 2020 - Analyzing and Leveraging Shared L1 Caches in GPUs](https://m.youtube.com/watch?v=CGIhOnt7F6s)
78 - [Url to PDF of paper on author's website (clicking will download the pdf)](https://adwaitjog.github.io/docs/pdf/sharedl1-pact20.pdf)
79
80
81 # RTL Arithmetic SQRT, FPU etc.
82
83 ## Sqrt
84 * [Fast Floating Point Square Root](https://pdfs.semanticscholar.org/5060/4e9aff0e37089c4ab9a376c3f35761ffe28b.pdf)
85 * [Reciprocal Square Root Algorithm](http://www.acsel-lab.com/arithmetic/arith15/papers/ARITH15_Takagi.pdf)
86
87 ## CORDIC and related algorithms
88
89 * <https://bugs.libre-soc.org/show_bug.cgi?id=127> research into CORDIC
90 * <https://bugs.libre-soc.org/show_bug.cgi?id=208>
91 * [BKM (log(x) and e^x)](https://en.wikipedia.org/wiki/BKM_algorithm)
92 * [CORDIC](http://www.andraka.com/files/crdcsrvy.pdf)
93 - Does not have an easy way of computing tan(x)
94 * [zipcpu CORDIC](https://zipcpu.com/dsp/2017/08/30/cordic.html)
95 * [Low latency and Low error floating point TCORDIC](https://ieeexplore.ieee.org/document/7784797) (email Michael or Cole if you don't have IEEE access)
96 * <http://www.myhdl.org/docs/examples/sinecomp/> MyHDL version of CORDIC
97
98 ## IEEE Standard for Floating-Point Arithmetic (IEEE 754)
99
100 Almost all modern computers follow the IEEE Floating-Point Standard. Of
101 course, we will follow it as well for interoperability.
102
103 * IEEE 754-2019: <https://standards.ieee.org/standard/754-2019.html>
104
105 Note: Even though this is such an important standard used by everyone,
106 it is unfortunately not freely available and requires a payment to
107 access. However, each of the Libre RISC-V members already have access
108 to the document.
109
110 * [Lecture notes - Floating Point Appreciation](http://pages.cs.wisc.edu/~markhill/cs354/Fall2008/notes/flpt.apprec.html)
111
112 Among other things, has a nice explanation on arithmetic, rounding modes and the sticky bit.
113
114 * [What Every Computer Scientist Should Know About Floating-Point Arithmetic](https://docs.oracle.com/cd/E19957-01/806-3568/ncg_goldberg.html)
115
116 Nice resource on rounding errors (ulps and epsilon) and the "table maker's dilemma".
117
118 ## Past FPU Mistakes to learn from
119
120 * [Intel Underestimates Error Bounds by 1.3 quintillion on
121 Random ASCII – tech blog of Bruce Dawson ](https://randomascii.wordpress.com/2014/10/09/intel-underestimates-error-bounds-by-1-3-quintillion/)
122 * [Intel overstates FPU accuracy 06/01/2013](http://notabs.org/fpuaccuracy)
123
124 # Khronos Standards
125
126 The Khronos Group creates open standards for authoring and acceleration
127 of graphics, media, and computation. It is a requirement for our hybrid
128 CPU/GPU to be compliant with these standards *as well* as with IEEE754,
129 in order to be commercially-competitive in both areas: especially Vulkan
130 and OpenCL being the most important. SPIR-V is also important for the
131 Kazan driver.
132
133 Thus the [[zfpacc_proposal]] has been created which permits runtime dynamic
134 switching between different accuracy levels, in userspace applications.
135
136 [**SPIR-V Main Page Link**](https://www.khronos.org/registry/spir-v/)
137
138 * [SPIR-V 1.5 Specification Revision 1](https://www.khronos.org/registry/spir-v/specs/unified1/SPIRV.html)
139 * [SPIR-V OpenCL Extended Instruction Set](https://www.khronos.org/registry/spir-v/specs/unified1/OpenCL.ExtendedInstructionSet.100.html)
140 * [SPIR-V GLSL Extended Instruction Set](https://www.khronos.org/registry/spir-v/specs/unified1/GLSL.std.450.html)
141
142 [**Vulkan Main Page Link**](https://www.khronos.org/registry/vulkan/)
143
144 * [Vulkan 1.1.122](https://www.khronos.org/registry/vulkan/specs/1.1-extensions/html/index.html)
145
146 [**OpenCL Main Page**](https://www.khronos.org/registry/OpenCL/)
147
148 * [OpenCL 2.2 API Specification](https://www.khronos.org/registry/OpenCL/specs/2.2/html/OpenCL_API.html)
149 * [OpenCL 2.2 Extension Specification](https://www.khronos.org/registry/OpenCL/specs/2.2/html/OpenCL_Ext.html)
150 * [OpenCL 2.2 SPIR-V Environment Specification](https://www.khronos.org/registry/OpenCL/specs/2.2/html/OpenCL_Env.html)
151
152 * OpenCL released the proposed OpenCL 3.0 spec for comments in april 2020
153
154 * [Announcement video](https://youtu.be/h0_syTg6TtY)
155 * [Announcement video slides (PDF)](https://www.khronos.org/assets/uploads/apis/OpenCL-3.0-Launch-Apr20.pdf)
156
157 Note: We are implementing hardware accelerated Vulkan and
158 OpenCL while relying on other software projects to translate APIs to
159 Vulkan. E.g. Zink allows for OpenGL-to-Vulkan in software.
160
161 # Graphics and Compute API Stack
162
163 I found this informative post that mentions Kazan and a whole bunch of
164 other stuff. It looks like *many* APIs can be emulated on top of Vulkan,
165 although performance is not evaluated.
166
167 <https://synappsis.wordpress.com/2017/06/03/opengl-over-vulkan-dev/>
168
169 * Pixilica is heading up an initiative to create a RISC-V graphical ISA
170
171 * [Pixilica 3D Graphical ISA Slides](https://b5792ddd-543e-4dd4-9b97-fe259caf375d.filesusr.com/ugd/841f2a_c8685ced353b4c3ea20dbb993c4d4d18.pdf)
172
173
174 # Various POWER Communities
175 - [An effort to make a 100% Libre POWER Laptop](https://www.powerpc-notebook.org/en/)
176 The T2080 is a POWER8 chip.
177 - [Power Progress Community](https://www.powerprogress.org/campaigns/donations-to-all-the-power-progress-community-projects/)
178 Supporting/Raising awareness of various POWER related open projects on the FOSS
179 community
180 - [OpenPOWER](https://openpowerfoundation.org)
181 Promotes and ensure compliance with the Power ISA amongst members.
182 - [OpenCapi](https://opencapi.org)
183 High performance interconnect for POWER machines. One of the big advantages
184 of the POWER architecture. Notably more performant than PCIE Gen4, and is
185 designed to be layered on top of the physical PCIE link.
186 - [OpenPOWER “Virtual Coffee” Calls](https://openpowerfoundation.org/openpower-virtual-coffee-calls/)
187 Truly open bi-weekly teleconference lines for anybody interested in helping
188 advance or adopting the POWER architecture.
189
190 # Conferences
191
192 ## Free Silicon Conference
193
194 The conference brought together experts and enthusiasts who want to build
195 a complete Free and Open Source CAD ecosystem for designing analog and
196 digital integrated circuits. The conference covered the full spectrum of
197 the design process, from system architecture, to layout and verification.
198
199 * <https://wiki.f-si.org/index.php/FSiC2019#Foundries.2C_PDKs_and_cell_libraries>
200
201 * LIP6's Coriolis - a set of backend design tools:
202 <https://www-soc.lip6.fr/equipe-cian/logiciels/coriolis/>
203
204 Note: The rest of LIP6's website is in French, but there is a UK flag
205 in the corner that gives the English version.
206
207 * KLayout - Layout viewer and editor: <https://www.klayout.de/>
208
209 # The OpenROAD Project
210
211 OpenROAD seeks to develop and foster an autonomous, 24-hour, open-source
212 layout generation flow (RTL-to-GDS).
213
214 * <https://theopenroadproject.org/>
215
216 # Other RISC-V GPU attempts
217
218 * <https://fossi-foundation.org/2019/09/03/gsoc-64b-pointers-in-rv32>
219
220 * <http://bjump.org/manycore/>
221
222 * <https://resharma.github.io/RISCV32-GPU/>
223
224 TODO: Get in touch and discuss collaboration
225
226 # Tests, Benchmarks, Conformance, Compliance, Verification, etc.
227
228 ## RISC-V Tests
229
230 RISC-V Foundation is in the process of creating an official conformance
231 test. It's still in development as far as I can tell.
232
233 * //TODO LINK TO RISC-V CONFORMANCE TEST
234
235 ## IEEE 754 Testing/Emulation
236
237 IEEE 754 has no official tests for floating-point but there are
238 well-known third party tools to check such as John Hauser's TestFloat.
239
240 There is also his SoftFloat library, which is a software emulation
241 library for IEEE 754.
242
243 * <http://www.jhauser.us/arithmetic/>
244
245 Jacob is also working on an IEEE 754 software emulation library written
246 in Rust which also has Python bindings:
247
248 * Source: <https://salsa.debian.org/Kazan-team/simple-soft-float>
249 * Crate: <https://crates.io/crates/simple-soft-float>
250 * Autogenerated Docs: <https://docs.rs/simple-soft-float/>
251
252 A cool paper I came across in my research is "IeeeCC754++ : An Advanced
253 Set of Tools to Check IEEE 754-2008 Conformity" by Dr. Matthias Hüsken.
254
255 * Direct link to PDF:
256 <http://elpub.bib.uni-wuppertal.de/servlets/DerivateServlet/Derivate-7505/dc1735.pdf>
257
258 ## Khronos Tests
259
260 OpenCL Conformance Tests
261
262 * <https://github.com/KhronosGroup/OpenCL-CTS>
263
264 Vulkan Conformance Tests
265
266 * <https://github.com/KhronosGroup/VK-GL-CTS>
267
268 MAJOR NOTE: We are **not** allowed to say we are compliant with any of
269 the Khronos standards until we actually make an official submission,
270 do the paperwork, and pay the relevant fees.
271
272 ## Formal Verification
273
274 Formal verification of Libre RISC-V ensures that it is bug-free in
275 regards to what we specify. Of course, it is important to do the formal
276 verification as a final step in the development process before we produce
277 thousands or millions of silicon.
278
279 Some learning resources I found in the community:
280
281 * ZipCPU: <http://zipcpu.com/> ZipCPU provides a comprehensive
282 tutorial for beginners and many exercises/quizzes/slides:
283 <http://zipcpu.com/tutorial/>
284 * Western Digital's SweRV CPU blog (I recommend looking at all their
285 posts): <https://tomverbeure.github.io/>
286 * <https://tomverbeure.github.io/risc-v/2018/11/19/A-Bug-Free-RISC-V-Core-without-Simulation.html>
287 * <https://tomverbeure.github.io/rtl/2019/01/04/Under-the-Hood-of-Formal-Verification.html>
288
289 ## Automation
290
291 * <https://www.ohwr.org/project/wishbone-gen>
292
293 # LLVM
294
295 ## Adding new instructions:
296
297 * <https://archive.fosdem.org/2015/schedule/event/llvm_internal_asm/>
298
299 # Branch Prediction
300
301 * <https://danluu.com/branch-prediction/>
302
303 # Python RTL Tools
304
305 * [Migen - a Python RTL](https://jeffrey.co.in/blog/2014/01/d-flip-flop-using-migen/)
306 * [LiTeX](https://github.com/timvideos/litex-buildenv/wiki/LiteX-for-Hardware-Engineers)
307 An SOC builder written in Python Migen DSL. Allows you to generate functional
308 RTL for a SOC configured with cache, a RISCV core, ethernet, DRAM support,
309 and parameterizeable CSRs.
310 * [Migen Tutorial](http://blog.lambdaconcept.com/doku.php?id=migen:tutorial>)
311 * There is a great guy, Robert Baruch, who has a good
312 [tutorial](https://github.com/RobertBaruch/nmigen-tutorial) on nMigen.
313 He also build an FPGA-proven Motorola 6800 CPU clone with nMigen and put
314 [the code](https://github.com/RobertBaruch/n6800) and
315 [instructional videos](https://www.youtube.com/playlist?list=PLEeZWGE3PwbbjxV7_XnPSR7ouLR2zjktw)
316 online.
317 * [Minerva](https://github.com/lambdaconcept/minerva)
318 An SOC written in Python nMigen DSL
319 * [Using our Python Unit Tests(old)](http://lists.libre-riscv.org/pipermail/libre-riscv-dev/2019-March/000705.html)
320 * <https://chisel.eecs.berkeley.edu/api/latest/chisel3/util/DecoupledIO.html>
321 * <http://www.clifford.at/papers/2016/yosys-synth-formal/slides.pdf>
322
323 # Other
324
325 * <https://wiki.f-si.org/index.php/FSiC2019>
326 * <https://fusesoc.net>
327 * <https://www.lowrisc.org/open-silicon/>
328 * <http://fpgacpu.ca/fpga/Pipeline_Skid_Buffer.html> pipeline skid buffer
329 * <https://pyvcd.readthedocs.io/en/latest/vcd.gtkw.html> GTKwave
330 * <http://www.sunburst-design.com/papers/CummingsSNUG2002SJ_Resets.pdf>
331 Synchronous Resets? Asynchronous Resets? I am so confused! How will I
332 ever know which to use? by Clifford E. Cummings
333 * <http://www.sunburst-design.com/papers/CummingsSNUG2008Boston_CDC.pdf>
334 Clock Domain Crossing (CDC) Design & Verification Techniques Using
335 SystemVerilog, by Clifford E. Cummings
336 In particular, see section 5.8.2: Multi-bit CDC signal passing using
337 1-deep / 2-register FIFO synchronizer.
338 * <http://www2.eecs.berkeley.edu/Pubs/TechRpts/2016/EECS-2016-143.pdf>
339 Understanding Latency Hiding on GPUs, by Vasily Volkov
340 * Efabless "Openlane" <https://github.com/efabless/openlane>
341 * Co-simulation plugin for verilator, transferring to ECP5
342 <https://github.com/vmware/cascade>
343 * Multi-read/write ported memories
344 <https://tomverbeure.github.io/2019/08/03/Multiport-Memories.html>
345 * Data-dependent fail-on-first aka "Fault-tolerant speculative vectorisation"
346 <https://arxiv.org/pdf/1803.06185.pdf>
347 * OpenPOWER Foundation Membership
348 <https://openpowerfoundation.org/membership/how-to-join/membership-kit-9-27-16-4/>
349
350
351 # Real/Physical Projects
352
353 * [Samuel's KC5 code](http://chiselapp.com/user/kc5tja/repository/kestrel-3/dir?ci=6c559135a301f321&name=cores/cpu)
354 * <https://chips4makers.io/blog/>
355 * <https://hackaday.io/project/7817-zynqberry>
356 * <https://github.com/efabless/raven-picorv32>
357 * <https://efabless.com>
358 * <https://efabless.com/design_catalog/default>
359 * <https://wiki.f-si.org/index.php/The_Raven_chip:_First-time_silicon_success_with_qflow_and_efabless>
360 * <https://mshahrad.github.io/openpiton-asplos16.html>
361
362 # ASIC tape-out pricing
363
364 * <https://europractice-ic.com/wp-content/uploads/2020/05/General-MPW-EUROPRACTICE-200505-v8.pdf>
365
366 # Funding
367
368 * <https://toyota-ai.ventures/>
369 * [NLNet Applications](http://bugs.libre-riscv.org/buglist.cgi?columnlist=assigned_to%2Cbug_status%2Cresolution%2Cshort_desc%2Ccf_budget&f1=cf_nlnet_milestone&o1=equals&query_format=advanced&resolution=---&v1=NLnet.2019.02)
370
371 # Good Programming/Design Practices
372
373 * [Liskov Substitution Principle](https://en.wikipedia.org/wiki/Liskov_substitution_principle)
374 * [Principle of Least Astonishment](https://en.wikipedia.org/wiki/Principle_of_least_astonishment)
375 * <https://peertube.f-si.org/videos/watch/379ef007-40b7-4a51-ba1a-0db4f48e8b16>
376 * [Rust-Lang Philosophy and Consensus](http://smallcultfollowing.com/babysteps/blog/2019/04/19/aic-adventures-in-consensus/)
377
378 * <https://youtu.be/o5Ihqg72T3c>
379 * <http://flopoco.gforge.inria.fr/>
380 * Fundamentals of Modern VLSI Devices
381 <https://groups.google.com/a/groups.riscv.org/d/msg/hw-dev/b4pPvlzBzu0/7hDfxArEAgAJ>
382
383 # 12 skills summary
384
385 * <https://www.crnhq.org/cr-kit/>
386
387 # Analog Simulation
388
389 * <https://github.com/Isotel/mixedsim>
390 * <http://www.vlsiacademy.org/open-source-cad-tools.html>
391 * <http://ngspice.sourceforge.net/adms.html>
392 * <https://en.wikipedia.org/wiki/Verilog-AMS#Open_Source_Implementations>
393
394 # Libre-SOC Standards
395
396 This list auto-generated from a page tag "standards":
397
398 [[!inline pages="tagged(standards)" actions="no" archive="yes" quick="yes"]]
399
400 # Server setup
401
402 * [[resources/server-setup/web-server]]
403 * [[resources/server-setup/git-mirroring]]
404 * [[resources/server-setup/nagios-monitoring]]
405
406 # Testbeds
407
408 * <https://www.fed4fire.eu/testbeds/>
409
410 # Really Useful Stuff
411
412 * <https://github.com/im-tomu/fomu-workshop/blob/master/docs/requirements.txt>
413 * <https://github.com/im-tomu/fomu-workshop/blob/master/docs/conf.py#L39-L47>
414
415 # Digilent Arty
416
417 * https://store.digilentinc.com/pmod-sf3-32-mb-serial-nor-flash/
418 * https://store.digilentinc.com/arty-a7-artix-7-fpga-development-board-for-makers-and-hobbyists/
419 * https://store.digilentinc.com/pmod-vga-video-graphics-array/
420 * https://store.digilentinc.com/pmod-microsd-microsd-card-slot/
421 * https://store.digilentinc.com/pmod-rtcc-real-time-clock-calendar/
422 * https://store.digilentinc.com/pmod-i2s2-stereo-audio-input-and-output/
423
424 # CircuitJS experiments
425
426 * [[resources/high-speed-serdes-in-circuitjs]]