add link to wikipedia test-driven development
[libreriscv.git] / HDL_workflow.mdwn
1 # HDL workflow
2
3 This section describes the workflow and some best practices for developing
4 the Libre-SOC hardware. We use nmigen, yosys and symbiyosys, and this
5 page is intended not just to help you get set up, it is intended to
6 help advise you of some tricks and practices that will help you become
7 effective team contributors.
8
9 It is particularly important to bear in mind that we are not just
10 "developing code", here: we are creating a "lasting legacy educational
11 resource" for other people to learn from, and for businesses and students
12 alike to be able to use, learn from and augment for their own purposes.
13
14 It is also important to appreciate and respect that we are funded under
15 NLNet's Privacy and Enhanced Trust Programme <http://nlnet.nl/PET>. Full
16 transparency, readability, documentation, effective team communication
17 and formal mathematical proofs for all code at all levels is therefore
18 paramount.
19
20 Therefore, we need not only to be "self-sufficient" (absolutely under no circumstances critically reliant on somebody else's servers **or protocols**) we also need to ensure that everything (including all communication such as the mailing list archives) are recorded, replicable, and accessible in perpetuity. Use of slack or a "forum" either actively prevents or makes that much harder.
21
22 # Collaboration resources
23
24 The main message here: **use the right tool for the right job**.
25
26 * mailing list: general communication and discussion.
27 * bugtracker: task-orientated, goal-orientated *focussed* discussion.
28 * ikiwiki: document store, information store, and (editable) main website
29 * git repositories: code stores (**not binary or auto-generated output store**)
30 * ftp server (<http://ftp.libre-riscv.org>): large file store.
31
32 we will add an IRC channel at some point when there are enough people
33 to warrant having one (and it will be publicly archived)
34
35 note also the lack of a "forum" in the above list. this is very deliberate. forums are a serious distraction when it comes to technical heavily goal-orientated development. recent internet users may enjoy looking up the "AOL metoo postings" meme.
36
37 note also the complete lack of "social platforms". if we wanted to tell everybody how much better each of us are than anyone else in the team, how many times we made a commit (look at me, look at me, i'm so clever), and how many times we went to the bathroom, we would have installed a social media based project "management" system.
38
39 ## Main contact method: mailing list
40
41 To respect the transparency requirements, conversations need to be
42 public and archived (i.e not skype, not telegram, not discord,
43 and anyone seriously suggesting slack will be thrown to the
44 lions). Therefore we have a mailing list. Everything goes through
45 there. <http://lists.libre-riscv.org/mailman/listinfo/libre-riscv-dev>
46 therefore please do google "mailing list etiquette" and at the very
47 minimum look up and understand the following:
48
49 * This is a technical mailing list with complex topics. Top posting
50 is completely inappropriate. Don't do it unless you have mitigating
51 circumstances, and even then please apologise and explain ("hello sorry
52 using phone at airport flight soon, v. quick reply: ....")
53 * Always trim context but do not cut excessively to the point where people
54 cannot follow the discussion. Especially do not cut the attribution
55 ("On monday xxx wrote") of something that you are actually replying
56 to.
57 * Use inline replies i.e. reply at the point in the relevant part of
58 the conversation, as if you were actually having a conversation.
59 * Follow standard IETF reply formatting, using ">" for cascaded
60 indentation of other people's replies. If using gmail, please: SWITCH
61 OFF RICH TEXT EDITING.
62 * Please for god's sake do not use "my replies are in a different
63 colour". Only old and highly regarded people still using AOL are allowed
64 to get away with that (such as Mitch).
65 * Start a new topic with a relevant subject line. If an existing
66 discussion changes direction, change the subject line to reflect the
67 new topic (or start a new conversation entirely, without using the
68 "reply" button)
69 * DMARC is a pain on the neck. Try to avoid GPG signed messages. sigh.
70 * Don't send massive attachments. Put them online (no, not on facebook or
71 google drive or anywhere else that demands privacy violations) and provide
72 the link. Which should not require any kind of login to access. ask the
73 listadmin if you don't have anywhere suitable: FTP access can be arranged.
74
75 ### Actionable items from mailing list
76
77 If discussions result in any actionable items, it is important not to
78 lose track of them. Create a bugreport, find the discussion in the
79 archives <http://lists.libre-riscv.org/pipermail/libre-riscv-dev/>,
80 and put the link actually in the bugtracker as one of the comments.
81
82 At some point in any discussion, the sudden realisation may dawn on one
83 or more people that this is an "actionable" discussion. at that point
84 it may become better to use <http://bugs.libre-riscv.org>
85 itself to continue the discussion rather than to keep on dropping copies
86 of links into the bugtracker. The bugtracker sends copies of comments
87 *to* the list however this is 'one-way' (note from lkcl: because this
88 involves running an automated perl script from email, on every email,
89 on the server, that is a high security risk, and i'm not doing it. sorry.)
90
91 ### Mailing list != editable document store
92
93 Also, please do not use the mailing list as an "information or document
94 store or poor-man's editor". We have the wiki for that. Edit a page and
95 tell people what you did (summarise rather than drop the entire contents
96 at the list) and include the link to the page.
97
98 Or, if it is more appropriate, commit a document (or source code)
99 into the relevant git repository then look up the link in the gitweb
100 source tree browser and post that (in the bugtracker or mailing list)
101 See <http://git.libre-riscv.org>
102
103 ### gmail "spam"ifying the list
104
105 see <https://blog.kittycooper.com/2014/05/keeping-my-mailing-list-emails-out-of-gmails-spam-folder/>
106
107 basically it is possible to select any message from the list, create a "filter" (under "More"),
108 and, on the 2nd dialog box, click the "never send this to Spam" option.
109
110 ## Bugtracker
111
112 bugzilla. old and highly effective. sign up in the usual way. any
113 problems, ask on the list.
114
115 Please do not ask for the project to be transferred to github or other
116 proprietary nonfree service "because it's soooo convenient", as the
117 lions are getting wind and gout from overfeeding on that one.
118
119 ## ikiwiki
120
121 Runs the main libre-riscv.org site (including this page). effective,
122 stunningly light on resources, and uses a git repository not a database.
123 That means it can be edited offline.
124
125 Usual deal: register an account and you can start editing and contributing
126 straight away.
127
128 Hint: to create a new page, find a suitable page that would link to it,
129 first, then put the link in of the page you want to create, as if the
130 page already exists. Save that page, and you will find a questionmark
131 next to the new link you created. click that link, and it will fire up a
132 "create new page" editor.
133
134 Hint again: the wiki is backed by a git repository. Don't go overboard
135 but at the same time do not be afraid that you might "damage" or "lose"
136 pages. Although it would be a minor pain, the pages can always be
137 reverted or edited by the sysadmins to restore things if you get in a tiz.
138
139 Assistance in creating a much better theme greatly appreciated. e.g.
140 <http://www.math.cmu.edu/~gautam/sj/blog/20140720-ikiwiki-navbar.html>
141
142 ## git
143
144 we use git. more on this below. we also use gitolite3 running on a
145 dedicated server. again, it is extremely effective and low resource
146 utilisation. reminder: lions are involved if github is mentioned.
147
148 gitweb is provided which does a decent job. <http://git.libre-riscv.org>
149
150 ## ftp server
151
152 <http://ftp.libre-riscv.org> is available for storing large files
153 that do not belong in a git repository, if we have (or ever need)
154 any. images (etc.) if small and appropriate should go into the
155 wiki, however .tgz archives (etc.) and, at some point, binaries,
156 should be on the ftp server.
157
158 ask on the list if you have a file that belongs on the ftp server.
159
160 ## server
161
162 as an aside: all this is "old school" and run on a single core 512MB
163 VM with only a 20GB HDD allocation. it costs only 8 GBP per month from
164 mythic-beasts and means that the project is in no way dependent on anyone
165 else - not microsoft, not google, not facebook, not amazon.
166
167 we tried gitlab. it didn't go well. please don't ask to replace the
168 above extremely resource-efficient services with it.
169
170 # Hardware
171
172 RAM is the biggest requirement. Minimum 16GB, the more the better (32
173 or 64GB starts to reach "acceptable" levels. Disk space is not hugely
174 critical: 256GB SSD should be more than adequate. Simulations and
175 FPGA compilations however are where raw processing power is a must.
176 High end Graphics Cards are nonessential.
177
178 What is particularly useful is to have hi-res screens (curved is
179 *strongly* recommended if the LCD is over 24in wide, to avoid eyeballs
180 going "prism" through longterm use), and to have several of them: the
181 more the better. Either a DisplayLink UD160A (or more modern variant)
182 or simply using a second machine (lower spec hardware because it will
183 run editors) is really effective.
184
185 Also it is really recommended to have a UHD monitor (4k - 3840x2160),
186 or at least 2560x1200. If given a choice, 4:3 aspect ratio is better
187 than 16:9 particularly when using several of them. However, caveat
188 (details below): please when editing do not assume that everyone will
189 have access to such high resolution screens.
190
191 # Operating System
192
193 First install and become familiar with Debian (ubuntu if you absolutely
194 must) for standardisation cross-team and so that toolchain installation
195 is greatly simplified. yosys in particular warns that trying to use
196 Windows, BSD or MacOS will get you into a world of pain.
197
198 Only a basic GUI desktop is necessary: fvwm2, xfce4, lxde are perfectly
199 sufficient (alongside wicd-gtk for network management). Other more
200 complex desktops can be used however may consume greater resources.
201
202 # editors and editing
203
204 Whilst this is often a personal choice, the fact that many editors are
205 GUI based and run fullscreen with the entire right hand side *and* middle
206 *and* the majority of the left side of the hi-res screen entirely unused
207 and bereft of text leaves experienced developers both amused and puzzled.
208
209 At the point where such fullscreen users commit code with line lengths
210 well over 160 characters, that amusement quickly evaporates.
211
212 Where the problems occur with fullscreen editor usage is when a project
213 is split into dozens if not hundreds of small files (as this one is). At
214 that point it becomes pretty much essential to have as many as six to
215 eight files open *and on-screen* at once, without overlaps i.e. not in
216 hidden tabs, next to at least two if not three additional free and clear
217 terminals into which commands are regularly and routinely typed (make,
218 git commit, nosetests3 etc). Illustrated with the following 3840x2160
219 screenshot (click to view full image), where *every one* of those 80x70
220 xterm windows is *relevant to the task at hand*.
221
222 [[!img 2020-01-24_11-56.png size=640x ]]
223
224 (hint/tip: fvwm2 set up with "mouse-over to raise focus, rather than
225 additionally requiring a mouseclick, can save a huge amount of cumulative
226 development time here, switching between editor terminal(s) and the
227 command terminals).
228
229 Once this becomes necessary, it it turn implies that having greater
230 than 80 chars per line - and running editors fullscreen - is a severe
231 hindance to an essential *and highly effective* workflow technique.
232
233 Additionally, care should be taken to respect that not everyone will have
234 200+ column editor windows and the eyesight of a hawk. They may only have
235 a 1280 x 800 laptop which barely fits two 80x53 xterms side by side.
236 Consequently, having excessively long functions is also a hindrance to
237 others, as such developers with limited screen resources would need to
238 continuously page-up and page-down to read the code even of a single
239 function, in full.
240
241 This helps explain in part, below, why compliance with pep8 is enforced,
242 including its 80 character limit. In short: not everyone has the same
243 "modern" GUI workflow or has access to the same computing resources as
244 you, so please do respect that.
245
246 More on this concept is
247 [here](https://www.linuxjournal.com/content/line-length-limits).
248 Note *very pointedly* that Linus Torvalds *specifically* states that
249 he does not want Linux kernel development to become the exclusive
250 domain of the "wealthy". That means **no** to assumptions about
251 access to ultra-high resolution screens.
252
253 # Software prerequisites
254
255 Whilst many resources online advocate "sudo" in front of all root-level
256 commands below, this quickly becomes tiresome. run "sudo bash", get a
257 root prompt, and save yourself some typing.
258
259 * sudo bash
260 * apt-get install vim exuberant-ctags
261 * apt-get install build-essential
262 * apt-get install git python3.7 python3.7-dev python-nosetest3
263 * apt-get install graphviz xdot gtkwave
264 * apt-get install python3-venv
265 * apt-get install python-virtualenv # this is an alternative to python3-venv
266 * return to user prompt (ctrl-d)
267
268 This will get you python3 and other tools that are needed. graphviz is
269 essential for showing the interconnections between cells, and gtkwave
270 is essential for debugging.
271
272 If you would like to save yourself a lot more typing, check out the
273 [dev-env-setup](https://git.libre-soc.org/?p=dev-env-setup.git;a=summary)
274 repository, examine the scripts there and use them to automate much of
275 the process below.
276
277 ## git
278
279 Look up good tutorials on how to use git effectively. There are so many
280 it is hard to recommend one. This is however essential. If you are not
281 comfortable with git, and you let things stay that way, it will seriously
282 impede development progress.
283
284 If working all day you should expect to be making at least two commits per
285 hour, so should become familiar with it very quickly. If you are *not*
286 doing around 2 commits per hour, something is wrong and you should read
287 the workflow instructions below more carefully, and also ask for advice
288 on the mailing list.
289
290 Worth noting: *this project does not use branches*. All code is committed
291 to master and we *require* that it be either zero-impact additions or that
292 relevant unit tests pass 100%. This ensures that people's work does not
293 get "lost" or isolated and out of touch due to major branch diversion,
294 and that people communicate and coordinate with each other.
295
296 ## yosys
297
298 Follow the source code (git clone) instructions here:
299 <http://www.clifford.at/yosys/download.html>
300
301 Or, alternatively, use the
302 [yosys-et-al](https://git.libre-soc.org/?p=dev-env-setup.git;a=blob;f=yosys-et-al;hb=HEAD)
303 script (which also installs symbiyosys and its dependencies)
304
305 Do not try to use a fixed revision of yosys (currently 0.9), nmigen is evolving
306 and frequently interacts with yosys.
307
308 ## symbiyosys
309
310 Follow the instructions here:
311 <https://symbiyosys.readthedocs.io/en/latest/quickstart.html#installing>
312
313 You do not have to install all of those (avy, boolector can be left
314 out if desired) however the more that are installed the more effective
315 the formal proof scripts will be (less resource utilisation in certain
316 circumstances).
317
318 ## nmigen
319
320 nmigen may be installed as follows:
321
322 * mkdir ~/src
323 * cd !$
324 * git clone https://github.com/nmigen/nmigen.git
325 * cd nmigen
326 * sudo bash
327 * python3 setup.py develop
328 * ctrl-d
329
330 testing can then be carried out with "python3 setup.py test"
331
332 ## Softfloat and sfpy
333
334 These are a test suite dependency for the ieee754fpu library, and
335 will be changed in the future to use Jacob's [simple-soft-float](https://crates.io/crates/simple-soft-float)
336 library. In the meantime, sfpy can be built as follows:
337
338 git clone --recursive https://github.com/billzorn/sfpy.git
339 cd sfpy
340 cd SoftPosit
341 git apply ../softposit_sfpy_build.patch
342 git apply /path/to/ieee754fpu/SoftPosit.patch
343 cd ../berkely-softfloat-3
344 # Note: Do not apply the patch included in sfpy for berkely-softfloat,
345 # it contains the same changes as this one
346 git apply /path/to/ieee754fpu/berkeley-softfloat.patch
347 cd ..
348
349 # prepare a virtual environment for building
350 python3 -m venv .env
351
352 # or, if you prefer the old way:
353 # virtualenv -p python3 .env
354
355 # install dependencies
356 source .env/bin/activate
357 pip3 install --upgrade -r requirements.txt
358
359 # build
360 make lib -j8
361 make cython
362 make inplace -j8
363 make wheel
364
365 # install
366 deactivate # deactivates venv, optional
367 pip3 install dist/sfpy*.whl
368
369 You can test your installation by doing the following:
370
371 python3
372 >>> from sfpy import *
373 >>> Posit8(1.3)
374
375 It should print out `Posit8(1.3125)`
376
377 ## qemu, cross-compilers, gdb
378
379 As we are doing POWER ISA, POWER ISA compilers, toolchains and
380 emulators are required.
381
382 Install powerpc64 gcc:
383
384 apt-get install gcc-9-powerpc64-linux-gnu
385
386 Install qemu:
387
388 apt-get install qemu-system-ppc
389
390 Install gdb from source. Obtain the latest tarball, unpack it, then:
391
392 cd gdb-9.1 (or other location)
393 mkdir build
394 cd build
395 ../configure --srcdir=.. --host=x86_64-linux --target=powerpc64-linux-gnu
396 make -j16
397 make install
398
399 ## Coriolis2
400
401 See [[coriolis2]] page, for those people doing layout work.
402
403 # Registering for git repository access
404
405 After going through the onboarding process and having agreed to take
406 responsibility for certain tasks, ask on the mailing list for git
407 repository access, sending in a public key (id_rsa.pub). If you do
408 not have one then generate it with ssh-keygen -t rsa. You will find it
409 in ~/.ssh
410
411 NEVER SEND ANYONE THE PRIVATE KEY. By contrast the public key, on
412 account of being public, is perfectly fine to make... err... public.
413
414 Create a file ~/.ssh/config with the following lines:
415
416 Host git.libre-riscv.org
417 Port 922
418
419 Wait for the Project Admin to confirm that the ssh key has been added
420 to the required repositories. Once confirmed, you can clone any of the
421 repos at http://git.libre-riscv.org:
422
423 git clone gitolite3@git.libre-riscv.org:REPONAME.git
424
425 Alternatively, the .ssh/config can be skipped and this used:
426
427 git clone ssh://gitolite3@git.libre-riscv.org:922/REPONAME.git
428
429 # git configuration
430
431 Although there are methods online which describe how (and why) these
432 settings are normally done, honestly it is simpler and easier to open
433 ~/.gitconfig and add them by hand.
434
435 core.autocrlf is a good idea to ensure that anyone adding DOS-formatted
436 files they don't become a pain. pull.rebase is something that is greatly
437 preferred for this project because it avoids the mess of "multiple
438 extra merge git tree entries", and branch.autosetuprebase=always will,
439 if you want it, always ensure that a new git checkout is set up with rebase.
440
441 [core]
442 autocrlf = input
443 [push]
444 default = simple
445 [pull]
446 rebase = true
447 [branch]
448 autosetuprebase = always
449
450 # Checking out the HDL repositories
451
452 * mkdir ~/src
453 * cd !$
454 * git clone gitolite3@git.libre-riscv.org:nmutil.git
455 * git clone gitolite3@git.libre-riscv.org:ieee754fpu.git
456 * git clone gitolite3@git.libre-riscv.org:soc.git
457
458 In each of these directories, in the order listed, track down the
459 setup.py file, then, as root (sudo bash), run the following:
460
461 * python3 setup.py develop
462
463 The reason for using "develop" mode is that the code may be edited
464 in-place yet still imported "globally". There are variants on this theme
465 for multi-user machine use however it is often just easier to get your
466 own machine these days.
467
468 The reason for the order is because soc depends on ieee754fpu, and
469 ieee754fpu depends on nmutil
470
471 If "python3 setup.py install" is used it is a pain: edit, then
472 install. edit, then install. It gets extremely tedious, hence why
473 "develop" was created.
474
475 # Development Rules
476
477 team communication:
478
479 * new members, add yourself to the [[about_us]] page and create yourself a home page using someone else's page as a template.
480 * communicate on the mailing list or the bugtracker an intent to take
481 responsibility for a particular task.
482 * assign yourself as the bug's owner
483 * *keep in touch* about what you are doing, and why you are doing it.
484 * edit your home page regularly, particularly to track tasks so that they can be paid by NLNet.
485 * if you cannot do something that you have taken responsibility for,
486 then unless it is a dire personal emergency please say so, on-list. we
487 won't mind. we'll help sort it out.
488
489 regarding the above it is important that you read, understand, and agree
490 to the [[charter]] because the charter is about ensuring that we operate
491 as an effective organisation. It's *not* about "setting rules and meting
492 out punishment".
493
494 ## Coding
495
496 for actual code development
497
498 ### Plan unit tests
499
500 * plan in advance to write not just code but a full test suite for
501 that code. **this is not optional**. large python projects that do not
502 have unit tests **FAIL** (see separate section below).
503 * Prioritise writing formal proofs and a single clear unit test that is more like a "worked example".
504 We receive NLNet funds for writing formal proofs, plus they
505 cover corner cases and take far less time to write
506
507 ### Commit tested or zero-dependent code
508
509 * only commit code that has been tested (or is presently unused). other
510 people will be depending on you, so do take care not to screw up.
511 not least because, as it says in the [[charter]] it will be your
512 responsibility to fix. that said, do not feel intimidated: ask for help
513 and advice, and you'll get it straight away.
514
515 ### Commit often
516
517 * commit often. several times a day, and "git push" it. this is
518 collaboration. if something is left even overnight uncommitted and not
519 pushed so that other people can see it, it is a red flag. if you find
520 yourself thinking "i'll commit it when it's finished" or "i don't want to
521 commit something that people might criticise" *this is not collaboration*,
522 it is making yourself a bottleneck. pair-programming is supposed to help
523 avoid this kind of thing however pair-programming is difficult to organise
524 for remote collaborative libre projects (suggestions welcomed here)
525
526 ### Absolutely no auto-generated output
527
528 * **do not commit autogenerated output**. write a shell script and commit
529 that, or add a Makefile to run the command that generates the output, but
530 **do not** add the actual output of **any** command to the repository.
531 ever. this is really important. even if it is a human-readable file
532 rather than a binary object file.
533 it is very common to add pdfs (the result of running latex2pdf) or configure.in (the result of running automake), they are an absolute nuisance and interfere hugely with git diffs, as well as waste hard disk space *and* network bandwidth. don't do it.
534
535 ### Write commands that do tasks and commit those
536
537 * if the command needed to create any given autogenerated output is not
538 currently in the list of known project dependencies, first consult on
539 the list if it is okay to make that command become a hard dependency of
540 the project (hint: java, node.js php and .NET commands may cause delays
541 in response time due to other list participants laughing hysterically),
542 and after a decision is made, document the dependency and how its source
543 code is obtained and built (hence why it has to be discussed carefully)
544 * if you find yourself repeating commands regularly, chances are high
545 that someone else will need to run them, too. clearly this includes
546 yourself, therefore, to make everyone's lives easier including your own,
547 put them into a .sh shell script (and/or a Makefile), commit them to
548 the repository and document them at the very minimum in the README,
549 INSTALL.txt or somewhere in a docs folder as appropriate. if unsure,
550 ask on the mailing list for advice.
551
552 ### Keep commits single-purpose
553
554 * edit files making minimal *single purpose* modifications (even if
555 it involves multiple files. Good extreme example: globally changing
556 a function name across an entire codebase is one purpose, one commit,
557 yet hundreds of files. miss out one of those files, requiring multiple
558 commits, and it actually becomes a nuisance).
559
560 ### Run unit tests prior to commits
561
562 * prior to committing make sure that relevant unit tests pass, or that
563 the change is a zero-impact addition (no unit tests fail at the minimum)
564
565 ### Do not break existing code
566
567 * keep working code working **at all times**. find ways to ensure that this is the case. examples include writing alternative classes that replace existing functionality and adding runtime options to select between old and new code.
568
569 ### Small commits with relevant commit message
570
571 * commit no more than around 5 to 10 lines at a time, with a CLEAR message
572 (no "added this" or "changed that").
573 * if as you write you find that the commit message involves a *list* of
574 changes or the word "and", then STOP. do not proceed: it is a "red flag"
575 that the commit has not been properly broken down into separate-purpose
576 commits. ask for advice on-list on how to proceed.
577
578 ### Exceptions to small commit: atomic single purpose commit
579
580 * if it is essential to commit large amounts of code, ensure that it
581 is **not** in use **anywhere** by any other code. then make a *small*
582 (single purpose) followup commit which actually puts that code into use.
583
584 this last rule is kinda flexible, because if you add the code *and* add
585 the unit test *and* added it into the main code *and* ran all relevant
586 unit tests on all cascade-impacted areas by that change, that's perfectly
587 fine too. however if it is the end of a day, and you need to stop and
588 do not have time to run the necessary unit tests, do *not* commit the
589 change which integrates untested code: just commit the new code (only)
590 and follow up the next day *after* running the full relevant unit tests.
591
592 ### Why such strict rules?
593
594 the reason for all the above is because python is a dynamically typed language.
595 make one tiny change at the base level of the class hierarchy and the
596 effect may be disastrous.
597
598 it is therefore worth reiterating: make absolutely certain that you *only*
599 commit working code or zero-impact code.
600
601 therefore, if you are absolutely certain that a new addition (new file,
602 new class, new function) is not going to have any side-effects, committing
603 it (a large amount of code) is perfectly fine.
604
605 as a general rule, however, do not use this an an excuse to write code
606 first then write unit tests as an afterthought. write *less* code *in
607 conjunction* with its (more basic) unit tests, instead. then, folliw up with
608 additions and improvements.
609
610 the reason for separating out commits to single purpose only becomes
611 obvious (and regretted if not followed) when, months later, a mistake
612 has to be tracked down and reverted. if the commit does not have an
613 easy-to-find message, it cannot even be located, and once found, if the
614 commit confuses several unrelated changes, not only the diff is larger
615 than it should be, the reversion process becomes extremely painful.
616
617 ### PEP8 format
618
619 * all code needs to conform to pep8. use either pep8checker or better
620 run autopep8. however whenever committing whitespace changes, *make a
621 separate commit* with a commit message "whitespace" or "autopep8 cleanup".
622 * pep8 REQUIRES no more than 80 chars per line. this is non-negotiable. if
623 you think you need greater than 80 chars, it *fundamentally* indicates
624 poor code design. split the code down further into smaller classes
625 and functions.
626
627 ### Docstring checker
628
629 * TBD there is a docstring checker. at the minimum make sure to have
630 an SPD license header, module header docstring, class docstring and
631 function docstrings on at least non-obvious functions.
632
633 ### Clear code commenting and docstrings
634
635 * make liberal but not excessive use of comments. describe a group of
636 lines of code, with terse but useful comments describing the purpose,
637 documenting any side-effects, and anything that could trip you or other
638 developers up. unusual coding techniques should *definitely* contain
639 a warning.
640
641 ### Only one class per module (ish)
642
643 * unless they are very closely related, only have one module (one class)
644 per file. a file only 25 lines long including imports and docstrings
645 is perfectly fine however don't force yourself. again, if unsure,
646 ask on-list.
647
648 ### File and Directory hierarchy
649
650 * *keep files short and simple*. see below as to why
651 * create a decent directory hierarchy but do not go mad. ask for advice
652 if unsure
653
654 ### No import star!
655
656 * please do not use "from module import \*". it is extremely bad practice,
657 causes unnecessary resource utilisation, makes code readability and
658 tracking extremely difficult, and results in unintended side-effects.
659
660 example: often you want to find the code from which a class was imported.
661 nirmally you go to the top of the file, check the imports, and you know
662 exactly which file has the class because of the import path. by using
663 wildcards, you have absolutely *no clue* which wildcard imported which
664 class or classes.
665
666 example: sometimes you may accidentally have duplicate code maintained
667 in two or more places. editing one of them you find, puzzlingly, that
668 the code behaves in some files with the old behaviour, but in others it
669 works. after a nassive amount of investigation, you find that the working
670 files happen to have a wildcard import of the newer accidental duplicate
671 class **after** the wildcard import of the older class with exactly the
672 same name. if you had used explicit imports, you would have spotted
673 the double import of the class from two separate locations, immediately.
674
675 really. don't. use. wildcards.
676
677 ### Keep file and variables short but clear
678
679 * try to keep both filenames and variable names short but not ridiculously
680 obtuse. an interesting compromise on imports is "from ridiculousfilename
681 import longsillyname as lsn", and to assign variables as well: "comb =
682 m.d.comb" followed by multiple "comb += nmigen_stmt" lines is a good trick
683 that can reduce code indentation by 6 characters without reducing clarity.
684
685 Additionally, use comments just above an obtuse variable in order to
686 help explain what it is for. In combination with keeping the the module
687 itself short, other readers will not need to scroll back several pages
688 in order to understand the code.
689
690 Yes it is tempting to actually use the variables as
691 self-explanatory-comments and generally this can be extremely good
692 practice. the problem comes when the variable is so long that a function
693 with several parameters csn no longer fit on a single line, and takes
694 up five to ten lines rather than one or two. at that point, the length
695 of the code is adversely affected and thus so is readability by forcing
696 readers to scroll through reams of pages.
697
698 it is a tricky balance: basically use your common sense, or just ask
699 someone else, "can you understand this code?"
700
701 ### Reasons for code structure
702
703 regarding code structure: we decided to go with small modules that are
704 both easy to analyse, as well as fit onto a single page and be readable
705 when displayed as a visual graph on a full UHD monitor. this is done
706 as follows:
707
708 * using the capability of nmigen (TODO crossref to example) output the
709 module to a yosys ilang (.il) file
710 * in a separate terminal window, run yosys
711 * at the yosys prompt type "read_ilang modulename.il"
712 * type "show top" and a graphviz window should appear. note that typing
713 show, then space, then pressing the tab key twice will give a full list
714 of submodules (one of which will be "top")
715
716 you can now fullsize the graphviz window and scroll around. if it looks
717 reasonably obvious at 100% zoom, i.e the connections can be clearly
718 related in your mind back to the actual code (by matching the graph names
719 against signals and modules in the original nmigen code) and the words are
720 not tiny when zoomed out, and connections are not total incomprehensible
721 spaghetti, then congratulations, you have well-designed code. If not,
722 then this indicates a need to split the code further into submodules
723 and do a bit more work.
724
725 The reasons for doing a proper modularisation job are several-fold:
726
727 * firstly, we will not be doing a full automated layout-and-hope
728 using alliance/coriolis2, we will be doing leaf-node thru tree node
729 half-automated half-manual layout, finally getting to the floorplan,
730 then revising and iteratively adjusting.
731 * secondly, examining modules at the gate level (or close to it) is just
732 good practice. poor design creeps in by *not* knowing what the tools
733 are actually doing (word to experienced developers: yes, we know that
734 the yosys graph != final netlist).
735 * thirdly, unit testing, particularly formal proofs, is far easier on
736 small sections of code, and complete in a reasonable time.
737
738 ## Special warning / alert to vim users!
739
740 Some time around the beginning of 2019 some bright spark decided that
741 an "auto-recommend-completion-of-stuff" option would be a nice, shiny
742 idea to enable by default from that point onwards.
743
744 This incredibly annoying "feature" results in tabs (or spaces) being
745 inserted "on your behalf" when you press return on one line, for your
746 "convenience" of not needing to type lots of spaces/tabs just to get
747 to the same indentation level.
748
749 Of course, this "feature", if you press return on one line in edit
750 mode and then press "escape", leaves a bundle-of-joy extraneous
751 whitespace **exactly** where you don't want it, and didn't ask for it,
752 pooped all over your file.
753
754 Therefore, *please*: **before** running "git commit", get into the
755 habit of always running "git diff", and at the very minimum
756 speed-skim the entire diff, looking for tell-tale "red squares"
757 (these show up under bash diff colour-syntax-highlighting) that
758 inform you that, without your knowledge or consent, vim has
759 "helpfully" inserted extraneous whitespace.
760
761 Remove them **before** git committing because they are not part
762 of the actual desired code-modifications, and committing them
763 is a major and constant distraction for reviewers about actual
764 important things like "the code that actually *usefully* was
765 modified for that commit"
766
767 This has the useful side-effect of ensuring that, right before
768 the commit, you've got the actual diff right in front of you
769 in the xterm window, on which you can base the "commit message".
770
771 ## Unit tests
772
773 For further reading, see the wikipedia page on
774 [Test-driven Development](https://en.wikipedia.org/wiki/Test-driven_development)
775
776 This deserves its own special section. It is extremely important to
777 appreciate that without unit tests, python projects are simply unviable.
778 Python itself has over 25,000 individual tests.
779
780 This can be quite overwhelming to a beginner developer, especially one
781 used to writing scripts of only 100 lines in length.
782
783 Thanks to Samuel Falvo we learned that writing unit tests as a formal
784 proof is not only shorter, it's also far more readable and also, if
785 written properly, provides 100% coverage of corner-cases that would
786 otherwise be overlooked or require tens to hundreds of thousands of
787 tests to be run.
788
789 No this is not a joke or even remotely hypothetical, this is an actual
790 real-world problem.
791
792 The ieee754fpu requires several hundreds of thousands of tests to be
793 run (currently needing several days to run them all), and even then we
794 cannot be absolutely certain that all possible combinations of input have
795 been tested. With 2^128 permutations to try with 2 64 bit FP numbers
796 it is simply impossible to even try.
797
798 This is where formal proofs come into play.
799
800 Samuel illustrated to us that "ordinary" unit tests can then be written
801 to *augment* the formal ones, serving the purpose of illustrating how
802 to use the module, more than anything.
803
804 However it is appreciated that writing formal proofs is a bit of a
805 black art. This is where team collaboration particularly kicks in,
806 so if you need help, ask on the mailing list.
807
808 ## Don't comment out unit tests: add them first (as failures) and fix code later
809
810 Unit tests serve an additional critical purpose of keeping track of code
811 that needs to be written. In many cases, you write the unit test *first*,
812 despite knowing full well that the code doesn't even exist or is completely
813 broken. The unit test then serves as a constant and important reminder
814 to actually fix (or write) the code.
815
816 Therefore, *do not* comment out unit tests just because they "don't work".
817 If you absolutely must stop a unit test from running, **do not delete it**.
818 Simply mark it with an appropriate
819 ["skip" decorator](https://docs.python.org/3/library/unittest.html#skipping-tests-and-expected-failures),
820 preferably with a link to a URL in the [bugtracker](http://bugs.libre-riscv.org)
821 with further details as to why the unit test should not be run.
822
823 # TODO Tutorials
824
825 Find appropriate tutorials for nmigen and yosys, as well as symbiyosys.
826
827 * Robert Baruch's nmigen tutorials look really good:
828 <https://github.com/RobertBaruch/nmigen-tutorial>
829 * Although a verilog example this is very useful to do
830 <https://symbiyosys.readthedocs.io/en/latest/quickstart.html#first-step-a-simple-bmc-example>
831 * This tutorial looks pretty good and will get you started
832 <http://blog.lambdaconcept.com/doku.php?id=nmigen:nmigen_install> and
833 walks not just through simulation, it takes you through using gtkwave
834 as well.
835 * There exist several nmigen examples which are also executable
836 <https://github.com/m-labs/nmigen/tree/master/examples/> exactly as
837 described in the above tutorial (python3 filename.py -h)
838