diff --git a/Makefile b/Makefile new file mode 100644 index 0000000..8467993 --- /dev/null +++ b/Makefile @@ -0,0 +1,2 @@ +all: + @$(MAKE) -C grammar all diff --git a/grammar/Makefile b/grammar/Makefile new file mode 100644 index 0000000..34efeb0 --- /dev/null +++ b/grammar/Makefile @@ -0,0 +1,4 @@ +.PHONY: all +all: main.pdf +main.pdf: main.tex main.aux + latexmk -xelatex main -interaction=nonstopmode diff --git a/grammar/main.tex b/grammar/main.tex index 821e928..ff79a9a 100644 --- a/grammar/main.tex +++ b/grammar/main.tex @@ -2,29 +2,49 @@ % vim: set et tw=80 ts=4 sw=4: \documentclass{article} \usepackage{fontspec} -\setmainfont{Andika}[StylisticSet=13] -\setsansfont{Minion 3}[Scale=MatchUppercase] + +\setmainfont{Minion 3}[ + Script=Latin, + Script=Greek, + Script=Cyrillic +] \setmonofont{Iosevka}[ Scale=MatchUppercase, CharacterVariant={99:8} ] +\newfontfamily\h{Cousine}[Scale=MatchUppercase] + \usepackage[margin=25mm]{geometry} \usepackage{nguhslides/nguhgloss} +\usepackage{xcolor} + \usepackage{hyperref} +\hypersetup{colorlinks=true,allcolors=blue!50!black} + \let\nf\normalfont \def\z#1{\texttt{#1}} -\def\Clong{\z{811\#0*034C534*4\#C004-*\#3*75} } -\title{\Clong\\{\large A CCC3 Submission}} -\author{Annwan} -\date{2024} +\def\Clong{\z{811\#0*034C534*4\#C004-*\#3*75}} + +\def\doctitle{\Clong} +\def\docstitle{A CCC3 Submission} +\def\docauthor{Annwan} +\def\docdate{\today} \setlength\parskip{1ex} \setlength\parindent{0ex} \begin{document} -\maketitle -% \tableofcontents -\section{Presentation} +\topskip0pt +\vspace*{\fill} +\begin{center} + {\Large \doctitle}\\ + {\large \docstitle}\\ + {\docauthor}\\ + {\docdate} +\end{center} +\vspace*{\fill} +\clearpage\tableofcontents +\clearpage\section{Presentation} \Clong is the language of telephone exchanges @@ -45,8 +65,6 @@ Here is a table of DTMF tones for reference. \end{tabular} \end{center} - - \section{Units of meaning -- Packets} Semantic information is encoded into packets. A packet can be of 5 types: {\bf @@ -54,19 +72,19 @@ Semantic}, {\bf Proper Noun}, {\bf Syntactic}, {\bf Continuation} and {\bf Variable}. A packet is a sequences of coniguous tones surrounded by blanks. additionally individual packets have a maximal length of 20 tones. If a packet's information would exceed that length limit, the extra data shall be -included in one or more continuation packets. +included in one or more continuation packets (see \ref{continuation}). \subsection{Semantic Packet} A semantic packet is a packet encoding a concept. It uses the Universal Decimal Clasification to encode the topics. Numeric values are encoded with their -appropriate digit, dots are encoded with \z*, slashes are encoded with \z A. -dashes are encoded with \z{BA}, equals with \z{BB}, column with \z C and quotes -with \z D and parens with \z\#. This has the side effect that semantic packets -cannot start with \z*. +appropriate digit, dots are encoded with \z*, colons are encoded with \z{\#\#}, double colons are encoded with \z{\#*}, square brackets are encoded with \z{A} and \z{B}, parens are encoded with \z{C} and \z{D}, plus is encoded as \z{\#A}, slash is encoded as \z{\#B}, star is encoded as \z{\#C}, quote is ecoded as \z{\#D}, dash is encoded as \z{\#0} and equals is encoded as \z{\#1}. -A reference to a variable (see \ref{variables}) starts with \z{B*} and ends with -\z{B\#}. The variable prefix is omitted in that use. +A reference to a variable (see \ref{variables}) is to be included in between \z{**}. The variable prefix is omitted in that use. + +Non-UDC notation is to be included as a variable refering to a proper noun representing the string to be included and with the qualifier of \z{*}. + +An abbriged summary of the UDC classification containing every topic number and every construct used in the examples and the translation is available in Apendix \ref{udcsummary}. A more complete summary is available online at \url{https://udcsummary.info/}. The full specification can be obtained from the UDC consortium (by paying a substantial amount of money) or can probably be consulted at your nearest library. \newpage \subsection{Proper Nouns} @@ -90,6 +108,8 @@ normalised decomposed form then subsequently encoded to EBCDIC page 00803 A proper noun must be used in the context of a Proper Noun Qualifier Packet (see \ref{references}). +A copy of the EBCDIC code page can be found in Appendix \ref{ebcdic} + \subsection{Syntactic Packet} Syntactic Packets start with a \z* followed by one or more digit or letters. If @@ -140,6 +160,7 @@ and the fourth as Verb. \end{description} \subsection{Continuation Packet} +\label{continuation} A continuation packet is used whenever a packet goes over it's maximum size (3+16 tones for proper noun packets, 20 tones for semantic packets) to contain @@ -226,14 +247,79 @@ thine humanity: There are no politics in real life. {\nf male child of} var[0] | cont | gnr | govern *3 | *D | **0*44*#0#0#0#0#0#0 | *#0#5#4*44*#0#0#0#0 | *#0#0#0#6#9*44*#0#0 - trans | ppn | ``Tian'' | cont | cont + trans | ppn | \nf``Tian'' | cont | cont *#0#0#0#0#0#6#1*44* | *#0#0#0#0#0#0#0#6C5 | *1 | *2A | - | - cont|cont|assert|ntrans|\nf prohibition/cosmetic:[silence+restraint]|\nf beset - [Tian (beset by cosmetic prohibitions of silence and restraint)] [practices against] - [the lesser side of the invisible origin of light], + - | - + \nf[side:lesser]/[origin:light]:invisible | \nf fight +5} } +\appendix +\def\thesubsubsection{\arabic{subsubsection}} +\clearpage\section{EBCDIC Code Page 803} +\label{ebcdic} +{\setmainfont{Iosevka}\def\arraystretch{1.25}\begin{center} + \def\s#1{\footnotesize\itshape\bfseries #1} + \begin{tabular}{c|cccc|cccc|cccc|cccc} + & \bf x0 & \bf x1 & \bf x2 & \bf x3 & \bf x4 & \bf x5 & \bf x6 & \bf x7 & \bf x8 & \bf x9 & \bf xA & \bf xB & \bf xC & \bf xD & \bf xE & \bf xF \\\hline + \bf 0x&\s{NUL}&\s{SOH}&\s{STX}&\s{ETX}&\s{ST}&\s{HT}&\s{SSA}&\s{DEL}&\s{SSA}&\s{RI}&\s{SS2}&\s{VT}&\s{FF}&\s{CR}&\s{SO}&\s{SI}\\ + \bf 1x&\s{DLE}&\s{DC1}&\s{DC2}&\s{DC3}&\s{OSC}&\s{NL}&\s{BS}&\s{ESA}&\s{CAN}&\s{EM}&\s{PU2}&\s{SS3}&\s{IFS}&\s{IGS}&\s{IRS}&\s{ITB}\\ + \bf 2x&\s{PAD}&\s{HOP}&\s{BPH}&\s{NBH}&\s{IND}&\s{LF}&\s{ETB}&\s{ESC}&\s{HTS}&\s{HTJ}&\s{VTS}&\s{PLD}&\s{UP}&\s{ENQ}&\s{ACK}&\s{BEL}\\ + \bf 3x&\s{DCS}&\s{BU1}&\s{SYN}&\s{STS}&\s{CCH}&\s{MW}&\s{SPA}&\s{EOT}&\s{SOS}&\s{SGCI}&\s{SCI}&\s{CSI}&\s{DC4}&\s{NAK}&\s{PM}&\s{SUB}\\\hline + \bf 4x& \s{SP} &&&&&&&&&&\$&.&<&(&+&|\\ + \bf 5x&\h א&&&&&&&&&&\h לי֞&*&)&;&¬\\ + \bf 6x&-&/&&&&&&&&&&,&\%&\_&>&?\\ + \bf 7x&&&&&&&&&&&:&\#&@&'&=&"\\\hline + \bf 8x&&\h ב &\h ג &\h ד &\h ה &\h ו &\h ז &\h ח &\h ט &\h י &&&&&&\\ + \bf 9x&&\h ך &\h כ &\h ל &\h ם &\h מ &\h ן &\h נ &\h ס &\h ע &&&&&&\\ + \bf Ax&&&\h ע &\h ף &\h פ &\h ץ &\h צ &\h ק &\h ר &\h ש &\h ת &&&&&\\ + \bf Bx&&&&&&&&&&&&&&&&\\\hline + \bf Cx&&A&B&C&D&E&F&G&H&I&&&&&&\\ + \bf Dx&&J&K&L&M&N&O&P&Q&R&&\s{LRO}&\s{RLO}&\s{PDF}&&\\ + \bf Ex&&&S&T&U&V&W&X&Y&Z&&&&&&\\ + \bf Fx&0&1&2&3&4&5&6&7&8&9&&\s{LRF}&\s{RLF}&\s{LRM}&\s{RLM}&\s{APC}\\ + \end{tabular} +\end{center}} +\clearpage\section{An Abriged Summary of the Universal Decimal Classification} +\label{udcsummary} + +\subsection{Main Tables} +{\setcounter{subsubsection}{-1} +\subsubsection{Science and Knowledge, Organisation, Computer Science, Information, Documentation, Librarianship, Institutions, Publications} +\subsubsection{Philosophy, Psychology} +\subsubsection{Religion, Theology} +\subsubsection{Social Sciences} +\subsubsection{\it Unused} +\subsubsection{Mathematics, Natural Sciences} +\subsubsection{Applied Sciences, Medicine, Technology} +\subsubsection{The Arts, Recreation, Entertainment, Sport} +\subsubsection{Language, Linguistics, Litterature} +\subsubsection{Geography, Biography, History} } +\subsection{Auxiliary Signs} + +\subsubsection*{{\ttfamily +} Coordination, Addition} The coordination sign + (plus) connects two or more separated (non-consecutive, non-related) UDC numbers, to denote a compound subject for which no single number exists +\subsubsection*{{\ttfamily /} Consecutive extension} The extension sign / (stroke or slash) connects the first and last of a series of consecutive UDC numbers to denote a broad subject, or range of concepts. + +If the number following the stroke is more than three digits long and begins with a group of digits in common with the preceding number, it may be abbreviated by omitting the digits common to both, so long as the first item after the stroke is a point. Thus, in the notation built from 629.734 and 629.735, the group 629 is common to both and is not repeated; the group .735 following the stroke is coordinate with .734 preceding it: 629.734/.735; this is equivalent to 629.734+629.735 . +\subsubsection*{{\ttfamily :} Simple relation} The relation sign : (colon) indicates relationship between two or more subjects by connecting their UDC numbers. Unlike the plus and stroke, the colon restricts rather than extends the subjects it connects. + +When the colon is used to link auxiliary numbers with biterminal signs (i.e. parentheses or quotation marks), the resultant compound number is enclosed within a single set of signs, e.g. (44:450). If required, the type of relationship between two concepts, such as bias phase, comparison, influence etc. may be further specified using common auxiliaries of phase relations. + +\subsubsection*{{\ttfamily ::} Order-Fixing} The double colon :: indicates a relationship and fixes the order of the UDC numbers. The symbol is used when there is a need to stress that the number following the double colon is a specification (part of the definition) of the subject by adding attribute(s) to the preceding number and that the reverse order does not represent the same meaning. + +\subsubsection*{{\ttfamily []} Subgrouping} Square brackets [ ] may be used as a subgrouping device within a complex combination of UDC numbers, in order to clarify the relationship of the components. Subgrouping may be needed when a subject denoted by two or more UDC numbers linked by plus sign, stroke or colon is as a whole related to another number by colon, or is modified by a common or special auxiliary + +\subsubsection*{{\ttfamily *} Introduction of non-UDC notation} + +\subsection{Auxiliary Numbers} +\subsubsection*{{\ttfamily =...} Common auxiliaries of langauge} +\subsubsection*{{\ttfamily (0...)} Common auxiliaries of form} +\subsubsection*{{\ttfamily (1/9)} Common auxiliaries of place} +\subsubsection*{{\ttfamily (=...)} Common auxiliaries of human ancestry, etnic grouping and nationality} +\subsubsection*{{\ttfamily "..."} Common auxiliaries of time} +\subsubsection*{{\ttfamily -0...} Common auxiliaries of general characteristics: Properties, Materials, Relations/Processes and Persons} \end{document}