Literate ProgrammingLiterate programming is a philosophy of computer programming pioneered by Donald Knuth. The main idea of literate programming is that computer programs should be written to be read by people, and not just by the computer. Programs should be works of literature, intended to be understood and enjoyed by others. Most computer languages allow commentary to be included in programs, but with literate programming the commentary is the focus. For this reason:
The two main things you may want to do to a literate program are tangling and weaving. (Knuth's original system was called WEB; these terms are a pun on “What a tangled web we weave...”.) Tangling takes the literate source and extracts the program code, replacing placeholders with the source code they refer to. Tangled output can then be fed to the compiler and linked to build a library, component, or executable. Weaving takes the same program and prepares it for typesetting or online display. Depending on the system, this may involve:
Literate programming requires a bit more discipline than “seat-of-the-pants” programming, and requires a time investment to learn the literate programming system, but the potential rewards are great. Explaining the program to others ensures that the programmer is clear in his own mind about how the program works, contributing to the quality of the result. The knowledge that other people will probably be reading the program and the joy of producing a “work of art” discourage shoddy work and make programming more fun. Monday's SystemThe Monday literate programming system is based around a fairly simple XML architecture. This architecture can be embedded any of several different document architectures using their extension mechanisms. Currently the TEI P4 DTD is used for marking up commentary in the existing Monday libraries and tools, but other architectures such as DocBook would be equally suitable. (In fact, a DocBook customization is planned in the near future.) In the Monday literate programming system, a single document corresponds to a library or executable program/application. This was inspired by the fact that the Dylan compilers compile programs one complete library at a time, though it is also a good fit for many other programming languages. In particular, most libraries are organized around a single “theme”, suitable for exposition in a single article-sized document. A library contains one or more modules. This corresponds exactly to Dylan language program organization; in C, C++ or Java a module generally corresponds to a single source file. Modules consist of program source code fragments and placeholders for references to sections, which are smaller fragments of source code. Sections can, in turn, contain placeholders referencing other sections. It is simple to extract (tangle) such a structure into source code files. For portability and simplicity, the Monday literate programming system currently uses a short XSLT script to do the job. Additional XSLT transforms are used to weave literate source programs into documents for printing or online viewing. For printing, documents are normally transformed to XSL (formatting objects) for printing using FOP or another XSL formatter. For online viewing, a transform to HTML will be used. Currently, no program indexing or cross-referencing is done, though hopefully integration with the compiler can provide this in the future. The Monday build tool mmk is designed to provide specialized build support for literate programs. It uses a completely declarative project description (as opposed to make's partly declarative, partly procedural description), improving build parallelism and making it possible to use project descriptions for many different purposes. Other Systems
|