Dating medieval texts by classification with flexible time intervals

Gregory Toner, Xiwu Han

Research output: Contribution to journalArticle

Abstract

Determining the date of writing is a practical problem often encountered in the study of ancient and medieval texts. The problem is compounded by differences in genre, register and style, but in particular by our frequent dependence on much later copies of lost originals. This article examines how a method of classification with flexible time intervals has been developed for a corpus of medieval Irish annals (c.700–c.1600). It is shown that the method can deal successfully with an unparsed, complex corpus containing contaminated data. A method of extending the model from good quality, uncontaminated data to more complex texts is also demonstrated by identifying temporal characteristics and typical entries in the high-quality source.
Original languageEnglish
Article numberfqz031
Number of pages12
JournalDigital Scholarship in the Humanities
DOIs
Publication statusPublished - 18 Jun 2019

Fingerprint

data quality
genre
time
Medieval Period
Time Interval
Annals

Bibliographical note

Not for REF

Cite this

@article{465906da84d24a7e975d04efc786d13f,
title = "Dating medieval texts by classification with flexible time intervals",
abstract = "Determining the date of writing is a practical problem often encountered in the study of ancient and medieval texts. The problem is compounded by differences in genre, register and style, but in particular by our frequent dependence on much later copies of lost originals. This article examines how a method of classification with flexible time intervals has been developed for a corpus of medieval Irish annals (c.700–c.1600). It is shown that the method can deal successfully with an unparsed, complex corpus containing contaminated data. A method of extending the model from good quality, uncontaminated data to more complex texts is also demonstrated by identifying temporal characteristics and typical entries in the high-quality source.",
author = "Gregory Toner and Xiwu Han",
note = "Not for REF",
year = "2019",
month = "6",
day = "18",
doi = "10.1093/llc/fqz031",
language = "English",
journal = "Digital Scholarship in the Humanities",
issn = "2055-7671",
publisher = "Oxford University Press",

}

Dating medieval texts by classification with flexible time intervals. / Toner, Gregory; Han, Xiwu.

In: Digital Scholarship in the Humanities, 18.06.2019.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Dating medieval texts by classification with flexible time intervals

AU - Toner, Gregory

AU - Han, Xiwu

N1 - Not for REF

PY - 2019/6/18

Y1 - 2019/6/18

N2 - Determining the date of writing is a practical problem often encountered in the study of ancient and medieval texts. The problem is compounded by differences in genre, register and style, but in particular by our frequent dependence on much later copies of lost originals. This article examines how a method of classification with flexible time intervals has been developed for a corpus of medieval Irish annals (c.700–c.1600). It is shown that the method can deal successfully with an unparsed, complex corpus containing contaminated data. A method of extending the model from good quality, uncontaminated data to more complex texts is also demonstrated by identifying temporal characteristics and typical entries in the high-quality source.

AB - Determining the date of writing is a practical problem often encountered in the study of ancient and medieval texts. The problem is compounded by differences in genre, register and style, but in particular by our frequent dependence on much later copies of lost originals. This article examines how a method of classification with flexible time intervals has been developed for a corpus of medieval Irish annals (c.700–c.1600). It is shown that the method can deal successfully with an unparsed, complex corpus containing contaminated data. A method of extending the model from good quality, uncontaminated data to more complex texts is also demonstrated by identifying temporal characteristics and typical entries in the high-quality source.

U2 - 10.1093/llc/fqz031

DO - 10.1093/llc/fqz031

M3 - Article

JO - Digital Scholarship in the Humanities

JF - Digital Scholarship in the Humanities

SN - 2055-7671

M1 - fqz031

ER -