Recent Publications & Digital Projects
Journal Articles
-
(2025) “Navigational lexicography: The model of European phrasebooks in France and the Americas.” Exemplaria, 36(4), 367–390. https://doi.org/10.1080/10412573.2024.2437337
This article examines early modern phrasebooks as models for translingual navigation and cultural contact, focusing on France’s linguistic exchanges with the Americas.
Public Humanities
-
(2024) “A Thousand Scripts, One Model: Transcribing 19th-Century Penn Medical Dissertations using Handwritten Text Recognition.” Essay featured on Unique at Penn .
A public-facing post on handwritten text recognition in 19th-century Penn medical dissertations, written for the Unique at Penn blog.
Digital Projects
-
-
“'Notes de lecture' on André Thevet, Cosmographie universelle (1575).” Hugging Face Repository .
A 15-page excerpt of an anonymous 16th-century commonplace book held at Princeton University Library, containing reading notes and citations drawn from André Thevet’s Cosmographie universelle (1575). Transcribed for research in digital paleography, OCR, and text reuse.
-
“Ms. Moreau 841 — Jacques Cartier, Relation (1534).” Hugging Face Repository .
Line-level transcriptions and cropped manuscript images from the BnF manuscript Ms. Moreau 841, a handwritten copy of Jacques Cartier’s 1534 voyage Relation. Provides training and evaluation data for handwritten text recognition and historical French language modeling.
-
“Excerpts from LJS380 - Sextus Empiricus, Pros mathēmatikous.” Hugging Face Repository .
Line-level transcriptions and cropped manuscript images of a 15th-century manuscript copy of the Greek treatise Pros mathēmatikous attributed to Sextus Empiricus, held within the Lawrence J. Schoenberg Collection at Penn Libraries. Manuscript lines were transcribed to match an 1842 public domain edition by August Immanuel Bekker, and used to train a machine learning model for recognizing handwritten Greek in the context of a Hellenic studies workshop.
-
“Ms. 24269 Dataset.”
A forthcoming dataset of early modern manuscript transcriptions, prepared for analysis on text reuse patterns.
-
(In preparation). At the Limits of Legibility.
This monograph explores the dialogical relationship between French language standardization and early modern imperialism. It employs computational methods (computer vision, stylometrics, handwritten text recognition) to explore the relationship between technology and legibility from the 16th century to the present.
-
Interactive prototypes created for an NEH-funded project. Map and timeline examples .
Interactive map and timeline prototypes created for an inter-institutional project, a collaboration between Princeton's Department of Spanish and Portuguese, Princeton University Library, and institutions and archives in the UK and the Philippines.