Very simply, a COCOA reference consists of a balanced set of angled brackets (< >) which contains two entities:
For example, the code "A" could be used to refer to the variable "author" and the string would stand for the author's name. Thus COCOA references which indicate the author of a passage of text would look like the following:
<A CHARLES DICKENS>
<A WOLFGANG VON GOETHE>
<A HOMER>
COCOA references only represent an informal trend for encoding specific types of textual information, e.g. authors, dates and titles. Current trends are moving more towards more formalised international standards of encoding. The flagship of this current trend is the Text Encoding Iniative (TEI), a project sponsored by the Association for Computational Linguistics, the Association for Literary and Linguistic Computing and the Association for Computers and the Humanites. Its aim is to provide standardised implementations for machine-readable text interchange.
The TEI uses a form of document markup known as SGML (Standard Generalised Markup Language). SGML has the following advantages:
In the TEI, each text (or document) consists of two parts - a header and the text itself. The header contains information such as the following:
Click here to read more about headers and text
You might also want to read about the EAGLES advisory body in chapter 2 of Corpus Linguistics (page 29).