Documentation
Presentation of the Material
The Auden Musulin Papers project follows the approach of digital documentary edition (as outlined by Pierazzo 2014).
By default, the edited documents are presented in the form of a reading version with minor editing. Optionally, original text features (such as revisions) can be displayed, and users can modify presentation preferences (including enabling/disabling display of entities, comments, and intertextuality information).
-
The default view offers the following features:
- one view per page recto and verso (with the exception of photographs glued onto pages of photo albums)
- vertical arrangements of texts that approximate the original layout
- different fonts to represent handwriting (serif font), typewriting (monospaced slab-serif font), as well as print and stamps (sans-serif font)
- reading versions of texts that do not contain deletions or indications of unclear characters
- indications of undecipherable (illegible) characters (if not deleted)
-
The following display options are available:
- revisions (deletions and additions)
- indications of unclear characters which cannot be transcribed with certainty
- labeling of entities (persons, places, institutions, works, events)
- editorial commentary, including information about responsibility and uncertainty
- intertextuality information (on quotations from other sources)
- choice of fonts and font sizes
- facsimiles displayed through a plugin viewer
- full-screen view
- detailed editorial metadata
- correspondence metadata
Image Digitization
The documents have been reproduced at 600 dpi and 24 bit/RGB through an Image Access WideTEK®25 color flatbed scanner and saved in an uncompressed TIFF format. Selected materials (images 3r and 3v in Autograph Letter Signed W. H. Auden to Stella Musulin with Typescript W. H. Auden "Joseph Weinheber" and with Typescript W. H. Auden Translation "Joseph Weinheber" 1965-04-28, images 2r and 2v in Autograph Letter Signed W. H. Auden to Stella Musulin 1969-06-10, as well as all photographs) have been digitized at 603.1 dpi and 24 bit by means of a Phase One XF IQ3 100MP medium-format camera with a Schneider Kreuznach 80mm f/2.8 leaf shutter lens. The high-resolution digital reproductions can be accessed via https://hdl.handle.net/21.11115/0000-000F-3256-8. On the website, images are provided through IIIF.
The images of Autograph Letter Signed Stella Musulin to Christine Busta 1962-05-23 have been created by the Literary Archive of the Austrian National Library.
In collaboration with the Computer Vision Lab of TU Wien, the surface of selected source documents (images 3r-cv and 3v-cv in Autograph Letter Signed W. H. Auden to Stella Musulin with Typescript W. H. Auden "Joseph Weinheber" and with Typescript W. H. Auden Translation "Joseph Weinheber" 1965-04-28, images 2r-cv and 2v-cv in Autograph Letter Signed W. H. Auden to Stella Musulin 1969-06-10) has been visualized by means of Photometric Stereo, using a PhaseOne IQ260 Achromatic with an 8,964 × 6,716 pixel medium-format sensor and a Schneider-Kreuznach 120 mm LS Macro lens (see Auden Musulin Papers through Computer Vision).
Transcription and Markup Guidelines amp-data v1.0
Source transcriptions as encoded in TEI/XML represent the spelling, punctuation, and revisions as they appear, aiming for minimal editorial intervention or interpretation.
-
Spelling
- The original spelling in both manuscripts and typescripts is retained, even if irregular.
- Characters that cannot be transcribed with certainty (for whichever reason) are marked up as <unclear> in XML and can be highlighted in grey on the web page.
- Handwritten characters are not marked up as <unclear> in XML when they can be identified as a consistent and typical feature of the respective author's handwriting.
- Signatures are never marked up as <unclear>.
- Characters that cannot be transcribed are marked up as <gap> in XML and represented by means of square brackets on the web page.
- Superscripts in handwritten documents have not been marked up and are represented in standard script on the web page.
- Numericals
- For typescripts, the characters 'I' (uppercase i) and 'O' (uppercase o) designating the numbers 1 and 0 (zero), respectively, are preserved.
- For manuscripts, the anglophone variant writing of the numeral 1 is represented by '1'.
- Punctuation and Spaces
- The original punctuation is retained, even if irregular.
- Punctuation signs that cannot be transcribed with certainty are rendered by UTF-8/Unicode symbols that represent a graphical approximation, and are marked up as <unclear> in XML.
- For handwritten documents, when spaces before or after punctuation marks are significantly smaller than the spaces separating words in the surrounding text, no spaces are transcribed.
- For handwritten documents, the length of spaces between words is regularized and represented as either single space or no space.
- For typescripts, the exact number of spaces before and within sentences is preserved and marked up via the whitespace character as well as (especially in the case of multiple spaces) through <space> in TEI/XML.
- Indentations of typed lines (of poetry) without reference value in relation to the left-hand margin are marked up as <space extent="undefined quantity of characters" min="1" unit="char">.
- Changes in the left-hand margins of typescripts are not marked up as indentations (by means of <space>) if they appear to result from reinsertions of the sheets of paper or resets of margins on the typewriter that do not appear meaningful.
- Spaces that precede printed text have been disregarded.
- Blank lines between lines of writing are disregarded in the markup. However, structural units of text have been marked up (see 4.1), and, in the web-application display, these textual components (such as paragraphs) are represented as distinct blocks of writing.
- Page Layout
- Original line breaks are retained by means of </lb> within structural elements of text, but not in between block-level elements (such as in between <p>, <salute>, or <signed> elements). Within structural elements, the TEI <lb> element represents standard line spacing (most likely produced by operating the typewriter's carriage-return lever); in between elements, spacing (possibly produced by means of operating the typewriter's variable-line-spacing knob) can vary, which is not represented in the markup. When prose paragraphs (<p>) are interrupted by page beginnings (<pb>), the continuing paragraph on the new page is marked up as <p prev="true">.
- For prose documents (notably, Stella Musulin's memoirs of W. H. Auden), a TEI-conformant container structure is used. The key structural elements are the following: the <div type="transcription"> node contains a <div type="prose">, in which <head>, <p>, and <fw type="pageNum"> are nested; writing that does not form part of the prose text proper is contained in separate <div> nodes, which can include <ab> elements.
- For correspondence documents, a specific and semantically expressive TEI-conformant container structure is used. The most important structural elements are the following: <div type="transcription"> can contain <div type="envelope"> and <div type="letter">; <div type="letter"> can include <div type="letter_message"> and <div type="poem">; <div type="letter_message"> can contain <opener> and <closer> containers (in addition to <p>, which can be subdivided by means of <seg>); <opener> groups together dateline and salutation information from the start of the correspondence item; <closer> groups together salutation and signature information from the end of the correspondence item; <div type="poem"> can be further structured through the <cb> milestone element and can contain <head> and <lg> elements; <lg> nodes include <space> and <l> elements.
- The vertical succession of text elements is represented; their horizontal arrangement has been disregarded.
- Underline
- Underlines are marked up as <hi rend="underline"> and are graphically represented on the web page.
- Underlines are not represented if they form part of signatures.
- Underlines in typescripts are rendered precisely and include, for instance, empty spaces before or after words if they are underlined in the source document.
- Revisions
- Deletions of text are marked up as <del> in XML, and can be represented on the web page (highlighted in light red).
- For revisions in handwriting and typewriting, we have accepted the orthographically or grammatically accurate, or otherwise plausible, versions as corrections, even if the actual sequences of textual layers cannot be reconstructed.
- For transcriptions of typescripts - in addition to handwritten deletions - we accept the use of the typed signs 'x', 'm', 's', 'z', 'y', '-', and '/' as indicating deletions where such deletion seems plausible.
- Additions have been marked up through <add>. They are always represented on the website, but can be highlighted (in light green).
- When additions are inserted into manuscripts or typescripts, surrounding spaces are included in ways that are coherent with the general use of spaces in the respective texts.
- Revision processes that involve (single or multiple) deletions and additions have been nested within a <subst> element.
- When additions replace deleted words in typescripts, they are inserted directly after the deletions without additional spaces in between.
- When it is uncertain whether characters have been deleted, the passage in question is marked up as '<unclear><del>[...]<del/><unclear/>'.
- For typescripts by Stella Musulin, we accept handwritten underlines of deleted passages as cancelled deletions.
- Pre-printed Components and Postal Additions
- Pre-printed letter heads and sender addresses are transcribed.
- Postmarks and stamps are not transcribed in the <text> part. Organization (e.g., post office), place, date, and time (captured in terms of @notBefore-iso and @notAfter-iso) information has been extracted and included in the <correspDesc> section of the <teiHeader>. In accordance with the TEI P5 Guidelines, the individual stages in the correspondence process are marked up as <correspAction>. The following values of the @type attribute are used: "sent" for information that refers to the first processing of the correspondence item by the outgoing post office as retrieved from the postmark on the front of the envelope; "transmitted" for information on the further processing of the correspondence item by intermediate post office(s) as retrieved from the backstamp(s); "redirected" for information on the place (as well as, if applicable, on the person or organization) related to redirecting the unopened correspondence item to another destination; "received" for information on the person (and, if applicable, place) related to the last indicated destination on the front of the envelope. Information on the place and date of writing is encoded within the <fileDesc> node of the <teiHeader>. Where no envelope has been preserved, only the <correspAction> of @type "received" is documented. Copies of correspondence items are not treated as correspondence items and no <correspDesc> has been recorded.
- Pre-printed postal texts and images in telegrams and aerograms are not rendered in the transcriptions.
- Watermarks have not been transcribed.
- Writing technologies
- Writing technologies (handwriting, typewriting, print) are marked up by means of the @hand attribute in XML and are represented through different fonts on the web page. Inked stamp marks (apart from postmarkes) are marked up by means of the <stamp> element and are represented on the website in the font as printed writing.
- When parts of writing have been attributed to specific authors, such identification has not been encoded via @hand. However, different authors are documented via the <author> element in the <titleStmt> of the <fileDesc> node in the TEI header.
- Photographs
- All texts pertaining to the photograph documents are transcribed within <div type="photo"> (inside <div type="transcription">).
- Persons depicted in the photographs or referred to in the accompanying texts are recorded in the <particDesc> section of the <teiHeader>. Photographers are recorded both in <particDesc> and through the TEI <author> element.
- Markup of contents
- Extra-textual entities (persons, places, institutions, works, and events) are marked up in XML by means of <rs> and can optionally be displayed on the website, where that display is enriched with additional information and authority-file identifiers.
- Editorial comments are marked up in XML within the <interpGrp> element of the <teiHeader>. The respective elements within the textual data or metadata to which the editorial interpretations refer are linked to these via the @ana attribute. The <desc> element within each <interp> node contains a human-readable comment; furthermore, TEI elements have been used to formalize that comment in a machine-readable format (such as bibliographic references).
- Editorial interpretation has been rendered transparent in machine-readable format using the <respons> and <certainty> elements. Within <certainty>, the @cert attribute has the following values: "high" (interpretation based on concordant information from multiple external sources or on information supplied by one or more reliable external sources), "medium" (some indication based on external sources, but inconclusive or disputable), and "low" (interpretation based on contradictory external information or on no external evidence).
- Information retrieved from the correspondence items (such as related to other correspondence, telephone calls, meetings, and Auden's professional activities) has been marked up via <event>.
- Language information is encoded in XML by means of the TEI <foreign> element and the @xml:lang attribute.
- In the <teiHeader> section, metadata related to the source material are encoded within the <sourceDesc> node; edition metadata are represented in the <publicationStmt>, <editionStmt>, and <titleStmt> nodes.