DissertationFirst publicationDOI: 10.48548/pubdata-2823

Machine Learning Canvases for Project Support and Application of Graph Neural Networks in Invoice Recognition

Chronological data

Date of first publication2026-01-16
Date of publication in PubData 2026-01-16
Date of defense2025-12-19

Language of the resource

English

Related external resources

Related part DOI: 10.18420/informatik2021-101
Thiée, L.-W. (2021). A systematic Literature Review of Machine Learning Canvases, Informatik 2021 - Die 51. Jahrestagung der Gesellschaft für Informatik in: Computer Science and Sustainability. Gesellschaft für Informatik e.V. (GI) (Hrsg.). Bonn: Gesellschaft für Informatik e.V., S. 1221-1235. (Lecture Notes in Informatics (LNI), Proceedings - Series of the Gesellschaft fur Informatik (GI); Band P-314).
Related part URL: https://aisel.aisnet.org/wi2022/ai/ai/13/
Thiée, L.-W. (2022). Developing an ontology for data science projects to facilitate the design process of a canvas. Wirtschaftsinformatik 2022 Proceedings, 13.
Related part DOI: 10.18420/inf2023_180
Thiée, L., Krieger, F. & Funk, B. (2023). Extraction of Information from Invoices – Challenges in the Extraction Pipeline. Informatik 2023 - Designing Futures: Zukünfte gestalten, 1777-1792. Bonn: Gesellschaft für Informatik e.V.. Wirtschaft, Management Industrie - Joint Workshop IntDig 2023 MOC 2023; Intelligente Digitalisierung, (KI-basiertes) Management und Optimierung komplexer Systeme. Berlin. 26.-29. September 2023

Abstract

Motivation: Modern machine learning (ML) projects are complex in many respects. From an organizational perspective, this concerns, for example, the number and type of stakeholders or the collaboration of development teams. From a technical perspective, high requirements arise from the development of algorithms or the size, availability, and processing of data. These various types of complexity make it difficult for small organizations to implement ML projects purposefully. For this reason, conceptual and technical solutions must be sought to support users, managers, and developers in these contexts. From a conceptual point of view, it is advisable to create clear process descriptions and overviews in order to gain clarity about the project at hand. From a technical perspective, concrete ML use cases should be implemented to overcome technical obstacles. In the specific use case of this thesis, this refers to the improvement of Graph Attention Networks (GAT) for invoice recognition (IR). The research question - which methods and concrete models support Small and medium-sized enterprises in implementing ML - as well as the application context are derived from the collaboration with a partner company. Research Method: In this dissertation, a holistic approach is applied to advance the Data Science domain. Design Science Research and Case Studies are used, to make conceptual as well as technical progress for machine learning projects. A comprehensive literature review builds the foundation for the conceptual section, upon which artifacts are built. Model development in program code as well as evaluation form the central part of the technical section. From the big picture to the detailed view, general collaboration and prioritization aspects in ML projects are first explored and then the ML use case of invoice recognition with Graph Neural Networks is examined in detail. This mixed methods approach allows for a comprehensive view of the field of application. Contribution: Five research papers result in two major contributions for the practice and research community - a catalog and a model. The catalog lists ML-canvases and their corresponding fields and questions. It is derived from a comprehensive review of the related literature. This catalog supports the initiation and consistent implementation of machine learning projects. In particular, specific questions help to understand the problem that needs to be solved, the value that is supposed to be generated through the ML project, and the steps to reach a feasible solution. The second contribution is a specific model for invoice recognition. In this use case, special challenges and information types of invoice documents - visually rich documents - are analyzed and categorized. Subsequently, a pre-trained multi-modal Graph Attention Network is enhanced and tested on different datasets. Integrating an LLM in a Graph Attention Network to provide semantic embeddings is a new research approach in this thesis. Applying the model to English and German invoice documents, a significant improvement in token classification can be achieved. These contributions support machine learning projects on both a conceptual and a technical level.

Keywords

Machine Learning Canva; Invoice Recognition; Graph Attention Network

Grantor

Leuphana University Lüneburg

Study programme

More information

DDC

006 :: Spezielle Computerverfahren

Creation Context

Research