Vague, Incomplete, Subjective, and Uncertain Information in Digital History

Mariani, Fabio

doi:10.48548/pubdata-3038

DissertationFirst publicationDOI: 10.48548/pubdata-3038

Vague, Incomplete, Subjective, and Uncertain Information in Digital History

Downloads

Mariani_Vague_Incomplete_Subjective_and_Uncertain_Information_in_Digital_History_Diss.pdf8 MB

Open Access

Chronological data

Date of first publication2026-02-20

Date of publication in PubData 2026-02-20

Date of defense2025-10-16

Language of the resource

English

Related external resources

Related part

DOI: 10.53269/9780865593152/06
Rother, L., Koss, M. & Mariani, F. (2022). Taking Care of History: Toward a Politics of Provenance Linked Open Data in Museums. In E. L. Fry, & E. Canning (Eds.), Perspectives on Data. Art Institute of Chicago.

Related part

DOI: 10.14361/9783839467107-014
Mariani, F, Rother, L., & Koss, M. (2023). Teaching Provenance to AI. An Annotation Scheme for Museum Data. In S. Thiel, & J. C. Bernhardt (Hrsg.), AI in Museums (S. 163-172). Transcript.

Related part

DOI: 10.1515/jbwg-2023-0005
Rother, L., Mariani, F. & Koss, M. (2023). Hidden Value: Provenance as a Source for Economic and Social History. Jahrbuch für Wirtschaftsgeschichte / Economic History Yearbook, 64(1), 111–142.

Related PubData resources

Supplemented by

Resource

Dataset

Supplementary Research Materials for the PhD Thesis "Vague, Incomplete, Subjective, and Uncertain Information in Digital History"

Mariani, Fabio

2026

Author

Mariani, Fabio

Referee

Rother, Lynn

Usbeck, Ricardo

Daquino, Marilena

Advisor

Rother, Lynn

Usbeck, Ricardo

Abstract

This cumulative dissertation investigates how digital infrastructures can accommodate the epistemic complexity of historical knowledge, focusing on the specific challenges posed by vague, incomplete, subjective, and uncertain (VISU) information. Using art provenance as a focused domain of inquiry, the research addresses how such information can be identified, structured, and preserved in digital formats without flattening its interpretative depth. The central hypothesis underpinning this work is that VISU information can be effectively preserved and meaningfully operationalised in digital history only through workflows that integrate computational methods with interpretive oversight. Specifically, this requires combining automated extraction, adequate data modelling, and expert validation to ensure that the epistemic complexity of historical knowledge is not lost in the process of digitisation. This guiding hypothesis gives rise to three interrelated research questions. First, how can automated extraction processes be implemented to identify and flag VISU information for preservation in historical texts? Second, how can VISU information be formally represented in structured data models without sacrificing interpretive complexity? Third, how can expert validation be operationalised to safeguard VISU information during the transformation of historical data? To answer the first research question, the thesis identifies key natural language processing tasks such as sentence boundary detection and span categorisation, and develops a tailored annotation scheme to capture VISU features. This scheme is used in training and evaluation of models for automatic extraction of structured knowledge from provenance records. Addressing the second question, the research explores modelling strategies that extend CIDOC CRM by aligning CRMinf with the Historical Context Ontology (HiCO) and structuring data using nanopublications. This approach supports the formal representation of historical claims, interpretative assertions, and metadata from the digitisation process. Finally, in response to the third question, the thesis introduces PROV-A, a web based tool that integrates automated extraction with expert validation. It allows historians to refine extracted data, annotate epistemic qualifiers, and publish structured provenance information as linked open data, preserving interpretative depth within scalable workflows. These contributions are presented across seven publications that form the basis of this cumulative dissertation. Together, they establish a methodological approach for preserving the historiographical richness of provenance records as they are transformed into digital formats.

Keywords

Digital History; Provenance; Provenance Data; Artificial Intelligence; Web-based Tool; Linked Open Data

Grantor

Leuphana University Lüneburg

Leuphana Institution

Fakultät Kulturwissenschaften

More information

DDC

707 :: Ausbildung, Forschung, verwandte Themen zur bildenden und angewandten Kunst

Creation Context

Research

Collections

Literaturpublikationen

Vague, Incomplete, Subjective, and Uncertain Information in Digital History

Chronological data

Language of the resource

Related external resources

Related PubData resources

Supplemented by

Editor

Author

Referee

Advisor

Case provider

Other contributors

Abstract

Keywords

Grantor

Leuphana Institution

More information

DDC

Creation Context

Collections