Abstract
The digitalization of asset management within the architecture, engineering and construction (AEC) sector is in need of effective methods for the automatic classification of documents. This study focuses on the development and evaluation of multimodal document classification models, utilizing visual, textual, and layout-related document information. We examine various state-of-the-art machine learning models and combine them through an iterative development process. The performance of these models is evaluated on two different AEC-document datasets. The results demonstrate that each of the modalities is useful in classifying the documents, as well as the integration of the different information types. This study contributes by applying AI techniques, specifically document classification in the AEC sector, setting the initial step to automating information extraction and processing for Intelligent Asset Management, and lastly, by combining and comparing multimodal state-of-the-art classification models on real-life datasets.
| Original language | English |
|---|---|
| Article number | 200609 |
| Number of pages | 18 |
| Journal | Intelligent Systems with Applications |
| Volume | 29 |
| Early online date | 19 Nov 2025 |
| DOIs | |
| Publication status | Published - Mar 2026 |
Keywords
- UT-Gold-D
- Construction 4.0
- Deep learning
- Document classification
- Intelligent asset management
- Machine learning
- Multi-modal classification
- AEC
Fingerprint
Dive into the research topics of 'Multi-modal document classification in AEC asset management'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver