Pending
- Use of said would require the who attribute and personagraphy.
- Front Matter
- Body Section
- Back Matter
- What to do with content that doesn't fit any of features already described in Prose page
Front
See the Front Matter page for more detailed information.
Body
Prose includes novels, shorts stories, essays, etc.
The body will generally take the following structure (with a few exceptions):
- body
- div type="chapter"
- any of the below tags needed to encode the text
- div
- div type="chapter"
- body
There are several main tags that we use to mark up the structural elements of prose.
They indicate:
Divisions
Divisions are often indicated by a chapter, section, etc. of a book. Nest as many divisions as necessary to properly represent the structure of the text (e.g., chapters, sections, etc.). Be sure to maintain consistency among the levels of division within the body (e.g., all chapters occur as first-level divisions, section as second-level, etc.).
<div type="chapter"> <head>1. Chapter Title</head> <p>Chapter prose ...</p> <div type="section"> <head>1.1 Chapter Section Title</head> <p>Chapter section prose ...</p> </div> </div>
All division tags will have a type attribute. The value of the type attribute will be one of following:
- chapter
- section
- lecture
- letter
- essay
- book
- pamphlet
- notes
- dedication
If none of these value correctly describe the section of text you are encoding, document the nature of the division in the VWWP Encoding Problems page.
<div type=chapter> <pb n="4"/> <head>CHAPTER II</head> <head>THE SEPARATION</head>
Headings
See VWWP TEI P5 Encoding Guidelines for more information about headings.
Paragraphs
Paragraphs are marked with a <p>
tag. Paragraphs can be marked virtually anywhere in the text to mark a prose block. Paragraphs include <pb/>
(page breaks), lists and tables. Paragraphs are extremely versatile and are used in a wide variety of text encoding situations. Generally speaking, if something is written as a paragraph, it can be marked as such. <div>
tags cannot come within paragraphs, but <list>
tags, <figure>
tags, <pb/>
tags, <note>
tags, and many others can come within <p> tags. For instance, if a paragraph is broken up by a blank page and an image, as shown below, you do not need to close the paragraph to include these features. This allows you to faithfully represent the text.
<p> Another feature was boats large and small, and junks, some laboriously tracked or rowed like my own, when the wind failed, against the powerful stream, or descending, keeping the necessary steerage headway by crowds of standing men on the low deck, facing forwards, vigorously working great sweeps or yulows, five or ten at each, the gorge echoing all along its length to the rise and fall of the wild chants to which the rowers keep time and which are only endurable when softened by distance. After some hours of this region of magic and mystery, near sunset we emerged into open water, with broken picturesque shores, and at dusk tied up in a pebbly bay with glorious views of mountain and woodland, not far from the beautiful village of Nan-to, and the "needle" or "pillar" of heaven, well known to the dwellers in Ichang. The Ichang gorge is about twelve miles long; the Niu-kan, grander yet, about three; the Mitan about three and a half; the Wushan about twenty; and the Feng-hsiang, or "Wind Box," the last of the great gorges, about four. These are the great gorges. </p>
<p>With a strong, fair wind our sail was set; the creak and swish of the oars was exchanged for the low music of the river as it parted under our prow; and the deep water (from fifty to a hundred feet), of a striking bottle-green colour, was unbroken by a swirl or ripple, and slid past in a grand, full volume. The stillness was profound, enlivened only as some big junk with lowered mast glided past us at great speed, the fifty or sixty <pb n="107"/> <pb n="108"/> <figure> <figDesc> "Entrance to Ichang Gorge." </figDesc> </figure> <pb n="109"/> men at the sweeps raising a wild chant in keeping with the scene. Scuds of snow, wild, white clouds whirling round pinnacles, and desolate snow-clothed mountains, apparently blocking further progress, added to the enchantment. </p>
Notes
See VWWP TEI P5 Encoding Guidelines for more information about encoding notes (footnote, endnotes, etc.).
Photographs, Graphics, and other Images
See VWWP TEI P5 Encoding Guidelines for more information about photographs, graphics and other images.
Lists
See VWWP TEI P5 Encoding Guidelines for more information about lists.
Tables
See VWWP TEI P5 Encoding Guidelines for more information about tables.
Quotes
Quotes are denoted by quotation marks. Only text that comes within quotation marks will be marked as a quotation for the purposes of encoding. There are two types of quotes: quotes that are external to the text and quotes that are internal. The quote element is used for passages that are external to the text, like a reference to a study or another book. Internal quotes are quotes occur inside the text (e.g., character speeches or thoughts or notes written by characters) and have various TEI elements to represent them.
Quotes that are External to the Text: Outside Sources and Other References
Quotes that come from outside the text are marked by first using a <cit>
tag, to denote an external citation. Within the <cit>
tag there are two smaller parts, <quote>
and <bibl>
. <quote>
encompasses the body of the quote, or actual quoted text. The <bibl>
tag encompasses any bibliographic reference given that identifies the source of the text, such as a title or author. For a more comprehensive discussion of the <bibl>
tag, please see the <bibl> section of the official TEI P5 guidelines. Quotes can also be marked with other tags, for instance, inside the <quote>
tag, you can have an <l>
tag to denote a line of poetry.
<cit> <quote> <l>Parted without the least regret,</l> <l>Except that they had ever met.</l> <l>* * * *</l> <l>Misses, the tale that I relate,</l> <l>This lesson seems to carry:</l> <l>Choose not alone a proper mate,</l> <l>But proper time to marry!</l> </quote> <bibl> <author>Cowper,</author> <title level="a">Pairing Time anticipated</title> </bibl> </cit>
<cit> <quote> "To be or not to be?" </quote> <bibl> <author>Shakespeare,</author> <title level="a">Hamlet</title> </bibl> </cit>
<p> There are three main female characters in The Great Gatsby, Myrtle Wilson, Jordan Baker and Daisy Buchanan. When <cit> <bibl> <author>Fitzgerald</author> says, </bibl> <quote>"it takes two to make an accident,"</quote> </cit> one wonders to which of these women he is referring. </p>
Quotes that are Internal to the Text: Thought, Speech, Writing
Quotations in the text that indicate speech, thought, writing, etc. by one or more characters is marked by the various TEI elements. For instance, dialogue or notes written from one character to another would be indicated using this <q> element. The <q>
tag will generally come inside of a set of <p>
tags, since most dialogue is denoted within the text by setting it apart as a separate paragraph. Quotes can come within quotes, such as when one speaker quotes someone else. If there is an external quote inside an internal quote, for instance, a character quotes the bible, the correct tags will be used to delineate between the two distinct types of quotes.
Specialized tags are provided to indicate the various types of internal quotations, but for this project we will only use a subset of the possible tags:
- <foreign>: A word or phrase is in quotation marks, italisized or set apart in some way because it not the predominant language used in the text.
- Attempt to identify the language using the "xml:lang" attribute and a two-letter (as opposed to the three-letter) code according to the ISO 639 standard. See example below.
- <distinct>: A word or phrase is in quotes or set apart in some way because it is linguistically distinct such as slang or regional dialect.
Characters engaged in speech such as a dialogue should retain the quotations marks, but not use specialized tags to represnt spoken text.
<p>Henry blustered, "I know you. <quote>'Thou Shalt Not Kill.'</quote>" </p>
<p>I had four days of <distinct>"hanging on."</distinct>
<p> <foreign xml:lang="fr">C'est la vie</foreign>, said Jack. </p>
Closers
See VWWP TEI P5 Encoding Guidelines for more information about letter closers.
Page Breaks
For more information on how to encode page breaks see the page break section of the general guidelines.
Back Matter
For how to encode the back matter of the text, see the back matter section.
Problems
If a part of the prose text that you are trying to encode does not fit one of the above described features, document the problem in the VWWP Encoding Problems page.