- Use of said would require the who attribute and personagraphy.
See the Front Matter page for more detailed information.
Prose includes novels, shorts stories, essays, etc.
The body will generally take the following structure (with a few exceptions):
There are several main tags that we use to mark up the structural elements of prose.
Divisions are often indicated by a chapter, section, etc. of a book. Nest as many divisions as necessary to properly represent the structure of the text (e.g., chapters, sections, etc.). Be sure to maintain consistency among the levels of division within the body (e.g., all chapters occur as first-level divisions, section as second-level, etc.).
All division tags will have a type attribute. The value of the type attribute will be one of following:
If none of these value correctly describe the section of text you are encoding, document the nature of the division in the VWWP Encoding Problems page.
See VWWP TEI P5 Encoding Guidelines for more information about headings.
Paragraphs are marked with a
<p> tag. Paragraphs can be marked virtually anywhere in the text to mark a prose block. Paragraphs include
<pb/> (page breaks), lists and tables. Paragraphs are extremely versatile and are used in a wide variety of text encoding situations. Generally speaking, if something is written as a paragraph, it can be marked as such.
<div> tags cannot come within paragraphs, but
<note> tags, and many others can come within <p> tags. For instance, if a paragraph is broken up by a blank page and an image, as shown below, you do not need to close the paragraph to include these features. This allows you to faithfully represent the text.
See VWWP TEI P5 Encoding Guidelines for more information about encoding notes (footnote, endnotes, etc.).
Photographs, Graphics, and other Images
See VWWP TEI P5 Encoding Guidelines for more information about photographs, graphics and other images.
See VWWP TEI P5 Encoding Guidelines for more information about lists.
See VWWP TEI P5 Encoding Guidelines for more information about tables.
Quotes are denoted by quotation marks. Only text that comes within quotation marks will be marked as a quotation for the purposes of encoding. There are two types of quotes: quotes that are external to the text and quotes that are internal. The quote element is used for passages that are external to the text, like a reference to a study or another book. Internal quotes are quotes occur inside the text (e.g., character speeches or thoughts or notes written by characters) and have various TEI elements to represent them.
Quotes that are External to the Text: Outside Sources and Other References
Quotes that come from outside the text are marked by first using a
<cit> tag, to denote an external citation. Within the
<cit> tag there are two smaller parts,
<quote> encompasses the body of the quote, or actual quoted text. The
<bibl> tag encompasses any bibliographic reference given that identifies the source of the text, such as a title or author. For a more comprehensive discussion of the
<bibl> tag, please see the <bibl> section of the official TEI P5 guidelines. Quotes can also be marked with other tags, for instance, inside the
<quote> tag, you can have an
<l> tag to denote a line of poetry.
Quotes that are Internal to the Text: Thought, Speech, Writing
Quotations in the text that indicate speech, thought, writing, etc. by one or more characters is marked by the various TEI elements. For instance, dialogue or notes written from one character to another would be indicated using this <q> element. The
<q> tag will generally come inside of a set of
<p> tags, since most dialogue is denoted within the text by setting it apart as a separate paragraph. Quotes can come within quotes, such as when one speaker quotes someone else. If there is an external quote inside an internal quote, for instance, a character quotes the bible, the correct tags will be used to delineate between the two distinct types of quotes.
Specialized tags are provided to indicate the various types of internal quotations, but for this project we will only use a subset of the possible tags.
- <foreign>: A word or phrase is in quotation marks, italisized or set apart in some way because it not the predominant language used in the text.
- <distinct>: A word or phrase is in quotes or set apart in some way because it is linguistically distinct such as slang or regional dialect.
Characters engaged in speech such as a dialogue should retain the quotations marks, but not
Quick Reference, Quote Type
*spoken: A quote is spoken out loud by a character in the text. Use said,
*thought: A character thinks a quote, rather than saying it out loud. Use said.
*written: A character internal to the text has written something that is quoted within the text. Use quotation.
*emph: A word or phrase is in quotation marks in order to emphasize it. Use emph.
*mentioned: A word or phrase is in quotes because the author refers to the word itself, such as in a discussion of its part of speech, rather than using the word. Use mentioned.
*term: A word or phrase is in quotes because it is terminology. For instance, computer terms or scientific language. Use term.
*foreign: A word or phrase is in quotation marks because it does not belong to the predominant language used in the text. Use foreign.
*soCalled: An author uses scare quotes to distance him or herself from a word. Use soCalled.
See VWWP TEI P5 Encoding Guidelines for more information about closers.
For more information on how to encode page breaks see the page break section of the general guidelines.
For how to encode the back matter of the text, see the back matter section.
If a part of the prose text that you are trying to encode does not fit one of the above described features, document the problem in the VWWP Encoding Problems page.