Documentation

Presentation of the Material

By default, the edited documents are presented in the form of a reading version with minor editing. Optionally, original text features (such as deletions) can be displayed, and users can modify presentation preferences (including enabling/disabling display of entities, comments, and intertextuality information)..

  • The default view offers the following features:
    • one view per page recto and verso (with the exception of photographs glued onto pages of photo albums)
    • vertical arrangements of texts that partially represent the original layout
    • different fonts to represent handwriting (Times New Roman), typewriting (Courier New), and print (Arial)
    • reading versions of texts that do not contain deletions, underlines, additional typographic whitespace, or indications of unclear characters
    • italics to represent originally underlined text
    • indications of undecipherable characters (if not deleted)
  • The following display options are available:
    • original text features: deletions, underlines, additional typographic whitespace
    • indications of unclear characters which cannot be transcribed with certainty
    • labeling of entities (persons, places, institutions, works, events)
    • editorial commentary, including information about responsibility and uncertainty (in development)
    • intertextuality information (in development)
    • choice of fonts and font sizes
    • facsimiles displayed through a plugin viewer
    • full-screen view
    • detailed editorial metadata
    • correspondence metadata

Image Digitization

The documents have been reproduced at 600 dpi and 24 bit/RGB through an Image Access WideTEK®25 color flatbed scanner and saved in an uncompressed TIFF format. Selected materials (images 5 and 7 in https://amp.acdh.oeaw.ac.at/amp-transcript__0039.html, images 3 and 5 in https://amp.acdh.oeaw.ac.at/amp-transcript__0040.html, as well as all photographs) have been digitized at 603.1 dpi and 24 bit by means of a Phase One XF IQ3 100MP medium-format camera with a Schneider Kreuznach 80mm f/2.8 leaf shutter lens. The high-resolution digital reproductions can be accessed via https://id.acdh.oeaw.ac.at/auden-musulin-papers/facs. On the website, images are provided through IIIF.

The images of Autograph Letter Signed Stella Musulin to Christine Busta 1962-05-23 have been created by the Literary Archive of the Austrian National Library.

In collaboration with the Computer Vision Lab of TU Wien, the surface of selected source documents (images 6 and 8 in https://amp.acdh.oeaw.ac.at/amp-transcript__0039.html, images 4 and 6 in https://amp.acdh.oeaw.ac.at/amp-transcript__0040.html) has been visualized by means of Photometric Stereo, using a PhaseOne IQ260 Achromatic with an 8,964 × 6,716 pixel medium-format sensor and a Schneider-Kreuznach 120 mm LS Macro lens (see https://amp.acdh.oeaw.ac.at/cv.html).

Transcription and Markup Guidelines amp-data v0.9.0

Source transcriptions as encoded in TEI/XML represent the spelling, punctuation, and, partly, revisions as they appear, aiming for minimal editorial intervention or interpretation.

  1. Spelling
    1. The original spelling in both manuscripts and typescripts is retained, even if irregular.
    2. Characters that cannot be transcribed with certainty (for whichever reason) are marked up as <unclear> in XML and can be highlighted in grey on the web page.
    3. Handwritten characters are not marked up as <unclear> in XML when they can be identified as a consistent and typical feature of the respective author's handwriting.
    4. Signatures are never marked up as <unclear>.
    5. Characters that cannot be transcribed are marked up as <gap> in XML and represented by means of square brackets on the web page.
    6. Superscripts in the original documents have not yet been marked up and are represented in standard script on the web page.
  1. Numericals
    1. For typescripts, the characters 'I' (uppercase i) and 'O' (uppercase o) designating the numbers 1 and 0 (zero), respectively, are preserved.
    2. For manuscripts, the anglophone variant writing of the numeral 1 is represented by '1'.
  1. Punctuation and Spaces
    1. The original punctuation is retained, even if irregular.
    2. Punctuation signs that cannot be transcribed with certainty are rendered by UTF-8/Unicode symbols that represent a graphical approximation, marked up as <unclear> in XML.
    3. For manuscripts, when spaces before or after punctuation marks are significantly smaller than the spaces separating words in the surrounding text, no spaces are transcribed.
    4. For manuscripts, the length of spaces between words is regularized and represented as either single space or no space.
    5. For typescripts, the exact number of spaces before and within sentences is preserved and marked up as <space> in TEI/XML. For spaces in two-column layout pages, most likely generated by means of the tab function or by resetting the typewriter's margin stop, approximative values of spaces have been indicated.
    6. Indentations of typed lines (of poetry) without reference value in relation to the left-hand margin are marked up as <space extent="undefined quantity of characters" min="1" unit="char">.
    7. Changes in the left-hand margins of typescripts are not marked up as indentations (by means of <space>) if they appear to result from reinsertions of the sheets of paper or resets of margins on the typewriter that do not appear meaningful.
    8. Spaces that precede printed text have so far been disregarded.
    9. Blank lines between lines of writing are disregarded.
  1. Page Layout
    1. Original line breaks are retained by means of <lb> within structural elements of text, but not in between block-level elements (such as in between <p>, <salute>, or <signed> elements). Within structural elements, the TEI <lb> element represents standard line spacing (most likely produced by operating the typewriter's carriage-return lever); in between elements, spacing (possibly produced by means of operating the typewriter's variable line spacing knob) can vary, which is not represented in the markup. When prose paragraphs (<p>) are interrupted by page beginnings (<pb>), the continuing paragraph on the new page is marked up as <p prev="true">.
    2. For non-correspondence documents as well as for telegrams, aerograms, and postcards, a non-specific container structure is currently used: the transcription of each document is contained within a TEI <div> element; each transcription is subdivided by means of <pb>, <p>, and <lg> elements.
    3. For correspondence documents, excluding telegrams, aerograms, and postcards, a specific and semantically expressive TEI-conformant container structure is used. The most important structural elements are the following: <div type="transcription"> can contain <div type="envelope"> and <div type="letter">; <div type="letter"> can include <div type="letter_message"> and <div type="poem">; <div type="letter_message"> can contain <opener> and <closer> containers (in addition to <p>, which can be subdivided by means of <seg>); <opener> groups together dateline and salutation information from the start of the correspondence item; <closer> groups together salutation and signature information from the end of the correspondence item; <div type="poem"> can be further structured through the <cb> milestone element and can contain <head> and <lg> elements; <lg> nodes include <space> and <l> elements.
    4. The vertical succession of text elements is represented; their horizontal arrangement has so far been disregarded.
  1. Underline
    1. Underlines are marked up as <hi rend="underline"> and can optionally be graphically represented on the web page (highlighted in lilac). By default, they are represented in italics on the web page.
    2. Underlines are not represented if they form part of signatures.
    3. Underlines in typescripts are rendered precisely and include, for instance, empty spaces before or after words if they are underlined in the source document; transcriptions of underlines in manuscripts ignore underlined spaces before or after words.
  1. Revisions
    1. Deletions of text are marked up as <del> in XML and can be represented on the web page (highlighted in peach).
    2. For revisions in manuscripts and typescripts, we have accepted the orthographically or grammatically accurate versions as corrections, even if the actual sequences of textual layers cannot be reconstructed.
    3. For transcriptions of typescripts - in addition to handwritten deletions - we accept the use of the typed signs 'x', 'm', 's', 'z', 'y', '-', and '/' as indicating deletions where such deletion seems plausible.
    4. Additions have not yet been marked up and are not represented as additions on the website, but are treated as part of the original documents.
    5. When additions are inserted into manuscripts or typescripts, surrounding spaces are included in ways that are coherent with the general use of spaces in the respective texts.
    6. When additions replace deleted words in typescripts, they are inserted directly after the deletions without additional spaces in between.
    7. When it is uncertain whether characters have been deleted, the passage in question is marked up as '<unclear><del>[...]<del/><unclear/>'.
    8. For typescripts by Stella Musulin, we accept handwritten underlines of deleted passages as cancelled deletions.
    9. Multiple overwritings in the same place of a typescript (for instance, when character 'x' is overwritten by character 'y', which, in turn, is overwritten by 'z') are represented as nested deletions: '<del><del>x</del>y</del>z'
  1. Pre-printed Components and Postal Additions
    1. Pre-printed letter heads and sender addresses are transcribed.
    2. Image captions and company information in postcards are transcribed.
    3. Postmarks and stamps are not transcribed in the <text> part. Organization (e.g., post office), place, date, and time (captured in terms of @notBefore-iso and @notAfter-iso) information has been extracted and included in the <correspDesc> section of the <teiHeader>. In accordance with the TEI P5 Guidelines, the individual stages in the correspondence process are marked up as <correspAction>. The following values of the @type attribute are used: "sent" for information that refers to the first processing of the correspondence item by the outgoing post office as retrieved from the postmark on the front of the envelope; "transmitted" for information on the further processing of the correspondence item by the incoming post office(s) as retrieved from the backstamp(s); "redirected" for information on the place (as well as, if applicable, on the person or organization) related to redirecting the unopened correspondence item to another destination; "received" for information on the person (and, if applicable, place) related to the last indicated destination on the front of the envelope. Information on the place and date of writing is encoded within the <fileDesc> node of the <teiHeader>. Where no envelope has been preserved, only the <correspAction> of @type "received" is documented. Copies of correspondence items are not treated as correspondence items and no <correspDesc> has been recorded.
    4. Pre-printed postal texts and images in telegrams and aerograms are currently not rendered in the transcriptions.
    5. Watermarks have not yet been transcribed.
  1. Writing technologies
    1. Writing technologies (handwriting, typewriting, print) are marked up by means of the @hand attribute in XML and are represented through different fonts on the web page.
    2. The <handShift> element indicates an immediate change of writing technologies within a segment of transcription.
  1. Photographs
    1. All texts pertaining to the photograph documents are transcribed.
    2. Persons depicted in the photographs or referred to in the accompanying texts are recorded in the <particDesc> section of the <teiHeader>. Photographers are recorded in the TEI <author> element.
  1. Markup of contents
    1. Extra-textual entities (persons, places, organizations, works, and events) are marked up in XML by means of <rs> and can optionally be displayed on the website, where that display is enriched with additional information and authority-file identifiers.
    2. Editorial comments are marked up in XML within the <interpGrp> element of the <teiHeader>. The respective elements within the textual data or metadata to which the editorial interpretations refer are linked to these via the @ana attribute. The <desc> element within each <interp> node contains a human-readable comment; furthermore, TEI elements have been used to formalize that comment in a machine-readable format (in development).
  1. Language information is encoded in XML by means of the TEI <foreign> element and the @xml:lang attribute (in development).
  1. In the <teiHeader> section, metadata related to the source material are encoded within the <sourceDesc> node; edition metadata are represented in the <publicationStmt>, <editionStmt>, and <titleStmt> nodes.