-
Notifications
You must be signed in to change notification settings - Fork 9
Tool to convert plain text to LaTeX, HTML, and other formats
License
nfeske/gosh
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
GOSH Documentation Norman Feske Introduction ############ ; This document demonstrates GOSH and all its implemented features. ; ; BTW: Comments start with ';' and are ignored by GOSH GOSH is a tool for converting plain ASCII text to LaTeX and other document formats. Its main design criteria was to use a syntax that is perfectly readable as plain ASCII text and the support of different target formats. The source files of GOSH have a very simple syntax that is similar to the usenet style. Everyone, who ever wrote a mail using an plain ASCII editor will be able to write GOSH texts. ; Actually, the file you are reading right now is a GOSH text file. GOSH supports multiple target formats by different backends, which are available as separate files. By now, there exists the built-in LaTeX backend, a simple HTML backend and a man-page backend. Originally, GOSH was meant as an alternative to writing LaTeX files by hand. In the meanwhile I do all kind of textual work, papers, documentation, websites and slides using GOSH. As it is a big help for myself, it might be useful for other people, too. Anyhow, this is an early version of GOSH. Its features and usage may change in the future. How to get GOSH? ################ :GOSH is available at GitHub: [https://github.com/nfeske/gosh] Licence ####### GOSH and its backends are released under the terms of the GNU General Public Licence. For more information about the GNU General Public Licence visit the official GNU website [http://www.gnu.org/licenses]. Usage ##### GOSH is written in the script language 'Tcl/Tk'. Make sure that you have 'Tcl/Tk' installed on your computer before trying out GOSH. Just check, if you have a program called 'tclsh'. GOSH must be called with the source text file as argument and uses standard output for printing its result. For example. a pdf-file of the this text can be created via: ! > gosh gosh.txt > gosh.tex ! > pdflatex gosh.tex In this example, GOSH generates LaTeX output that is stored in the file 'gosh.tex'. This file is then used as input file for 'pdflatex'. A HTML-version of the text can be created by using the HTML-backend: ! > gosh --style html.gosh gosh.txt > gosh.html The backend to use is specified via the '--style' argument. The 'html.gosh' file contains the rules of how to produce the HTML output. Text style ########## Paragraphs are separated from each other be leaving an empty line between them. If you want to insert verbatim text passages (with monospaced font) - for example source codes, you can mark these lines with a '!' at the beginning of line. For example: ! void main(int argc, char **argv) { ! return 0; ! } Items, Enumations and Descriptions ================================== GOSH supports items by a leading '*', followed by a space: * This is an item. * Items can span over multiple lines. Each subsequent line must be indented by two spaces. Even paragraphs within items are possible. * Nested items are supported as well. * Items can be separated by empty lines to make them better readable in the GOSH text. Enumerations are marked by a leading '#' character and behave like items. # They can be nested. # They can span multiple lines. # They can contain multiple paragraphs. GOSH supports descriptions in a very similar way as items and enumerations. The text to describe is enclosed by colons (':'), followed by the description text: :This text: needs some description. The description can span multiple lines and paragraphs. All lines that belong to the description must be indented by two spaces. Descriptions, Items and Enumerations can be mixed and nested as you like. Accentuations ============= GOSH supports accentuations for marking *bold* and _italic_ text. It uses not the slash ('/') character to mark italic text because slashes are used in pathnames, which are very likely to appear in GOSH texts. Additionally, underlined text is rarely used. If you want to _mark multiple words_ you do not need to (_but_you_can_) place '_' in-between the words. All words 'that are written in apostrophes' will be considered as monospaced text. This is useful for 'filenames' and the like. The beginning and the end of the accentuated text fragment must be on the same line. _If you need to span accentuations over multiple lines, you need to apply_ _the accentuation at each line. This, way the accentuation is also visible_ _in the GOSH text._ GOSH detects hexadecimal numbers by a leading '0x' and prints them in a monospaced font automatically. Text structure ############## Head of a GOSH text =================== The document title is the first text in the document. It is meant to be written centered in the GOSH text. There must be at least one space at the beginning of a title line. Otherwise, GOSH will consider the line as the first paragraph of the document. The title can span multiple lines. The title is followed by one or more empty lines and the author's names. As for the title, the author's names should be written in a centered way as well. Example of a header of a GOSH text: ! This is the title ! of a gosh the ! document ! ! Bernd Ullrich ! Uli Berndrich You do not need to specify the author but it is recommended. You can also specify neither the title nor the author. In this case the document is just untitled. Sections ======== The chapter's names are underlined with the '#' character: ! This is the name of a chapter ! ############################# Sections are underlined by '=' characters: ! Section ! ======= Subsections are underlined by '~' characters: ! Subsection ! ~~~~~~~~~~ Paragraphs are underlined by '-' characters: ! Paragraph ! --------- I choose these underline characters based on their different heights. A '#' is higher than '=' that is higher than '~' that is higher than '-'. This way, the different levels of document structure can be easily differentiated from each other. Images ====== Images can be inserted into the document this way: ! [image filename] This is the caption of the image. ! The caption can span multiple lines. Each line ! must be indented by two spaces. You can control the width of the image in relation to the page width via: ! [image filename 50%] This image will appear with the ! size of a half page-width. This works only with the ! LaTeX-backend of GOSH. The image can be rotated via: ! [image filename 5°] Of course, you can specify both parameters for the same image, too. Tikz images =========== Tikz images in the form of individual files can be incorporated into the document. A tikz file contains one 'tikzpicture'. It can be inserted as follows: ![tikz filename] ! The optional caption describes the tikz image. Note that there should be an empty line above and below the statement. Otherwise, the statement will be interpreted as a reference that appears within the surrounding paragraph. The filename must be specified without the '.tikz' file extension. When no caption is provided, the image will be inserted at the current text position. In contrast, when providing a caption, the image will appear as a (potentially floating) figure that can be referenced via the filename. Tables ====== The table feature of GOSH is very preliminary, but it is still usable for a lot of cases. Tables are drawn by '-' and '|' characters. Each line of a table must be led by at least one space. The caption of a table can be written similar to the caption of an image. directory | filename | size -------------------------------- /etc | crontab | 651 | csh.login | 65 | exports | 114 -------------------------------- /sbin | route | 46680 | portmap | 12016 [table example_table] This is the caption of the table. :Note: Two sequent tables without any text in-between cause problems with the current version of GOSH. References ========== Chapters, sections and subsections can be referenced within the document by enclosing the corresponding name with brackets. For example, read section [Licence] carefully. This makes it very easy to insert references but has the drawback, that all referenced section names need to be different to avoid ambiguous references. Images can be referenced by their filenames (without the extension). I use to store all images of the document in a separate 'img/' folder. This avoids naming conflicts of image filenames and section names. Tables can be referenced by their identifiers. If there is no matching identifier within the document, GOSH assumes the text within the brackets is an external reference (citation). So you can insert citations just the same way as references. This is very practical if you use LaTeX and Bibtex. GOSH detects HTML-links by a heading 'http://'. For example, take a look at Atari.org [http://www.atari.org]. The HTML-backend supports the link text and a title to be optionally specified. The link text is delimited from the _URL_ by a '-', enclosed with spaces: ! [http://www.atari.org - The Atari Headquarter] A title can be optionally specified with parenthesis: ! [http://www.dhs.nu - Dead Hackers Society (click here)] The LaTeX backend ################# The integrated LaTeX backend of GOSH can be tweaked via the following command line arguments: :'--tex-table-floating': This argument makes GOSH to create floating tables. By default, tables are inserted at the position of their definition. When prefixing the name of a chapter or a section with an asterisk '*', then this is translated into the latex syntax !\section*{<name>} That way you can define chapters and sections that are not listed in the contents listing and they receive no index number. By default, GOSH uses plain 'image' and 'table' environments. If you prefer to use a 'image*' or 'table*' environment, you can add the attribute 'full-span' to the image or table tag. For example: ![image img/example 85% 5° full-span] The HTML backend ################ By default, the HTML produces pure HTML code without any fancyness. For the different textual styles, the corresponding HTML tags are used. There exist the following command line options, which take effect on the HTML output: :'--html-toc': This option lets GOSH create a table of contents. The entries of the table link to their corresponding section. :'--html-sec-enum': By default, chapters and sections are not enumerized. By using this option, you can make GOSH to prepend section numbers to the headlines. :'--html-p-colored': This option allows any section type to be colored by a different color. The colors are defined inside the backend and can be over-defined via an additional style as described in section [Tweaking the GOSH output]. :'--html-p-justify': This option sets the alignment of paragraphs to justified. :'--html-keep-tags': This option prevents GOSH from replacing '<' and '>' characters by the corresponding HTML sequences and thus, lets you embed HTML tags directly in your GOSH document. Advanced features ################# Tweaking the GOSH output ======================== The output of GOSH can easily be tweaked by supplying multiple '--style' arguments. The style-files will be processed in the specified order. For example if you want to tweak the HTML ouput to not contain a header and tail, you can tweak the 'html.gosh' style by another 'rawcontent.gosh' style: ! > gosh --style html.gosh --sytle rawcontent.gosh gosh.txt > gosh.html The 'rawcontent.gosh' file contains only the empty versions of the functions for outputting the head and tail: ! proc produce_head_html {} { ! } ! ! proc produce_tail_html {} { ! } The 'slides.gosh' backend is another example of this technique. It slightly modifies the LaTeX output of GOSH to create Foiltex output. This way, you can use GOSH to create 'slides' very easily. File encodings ============== By default, GOSH tries to detect the character encoding by examining the input text via the 'file' command. As this Unix command is not always available, it is possible to skip the detection mechanism and explicitely select the use of UTF-8 encoding via the '--utf8' command-line argument. Raw text ======== There are things like formulars, which are not supported by GOSH but by the target format (such as LaTeX). You can insert source code fragments of your target format directly into your GOSH text file by marking such lines by a colon, followed by a space at the beginning of line: ! : This text will not be touched by ! : GOSH. It will be directly written ! : out in its original form. When using this method, you loose the feature of GOSH to create different target formats of your document because the raw content will certainly conflict with the syntax of other formats (such as HTML). For example the LaTeX sequence "': $e=mc^2$'" produces: : $e=mc^2$ Annotations =========== When writing papers, one often wants to make annotations to preliminary revisions that should be printed in a sligtly accentuated way - so that it is easy to differentiate annotations from real text. As GOSH comments are completely ignored by GOSH, you will not be able to make printable annotations via GOSH comments. Instead, you can use a pipe symbol, followed by a space to mark annotated lines. Within such annotations, you can use items, enumerations, descriptions and accentuations. This makes it easy to convert annotations to real text by just clearing the leading pipe symbol and space. ! | This is an *annotation* and will ! | be written in italic style. Troubleshooting ############### In this section you will find some hints for the use of GOSH. It will grow as soon as people will report problems to me. "Error: cannot figure out what you mean with" --------------------------------------------- When GOSH is unable to parse its input file correctly, it outputs an error message "Error: cannot figure out what you mean with", followed by the trouble-making text. You should revisit this text passage. Mostly, this message is caused by wrong indentation. You should make sure that you indented items, enumerations, descriptions and captions with two spaces and no TABS. Do not use TABS. GOSH does not recognize TABS. Known bugs and limitations ########################## * All references must be written completely on one line, including the brackets. Contact ####### If you have comments, tips or bug reports regarding GOSH, please do not hesitate to contact me via :Email: norman.feske@genode-labs.com
About
Tool to convert plain text to LaTeX, HTML, and other formats
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published