Skip to main navigation Skip to search Skip to main content

Large Language Models for Analyzing Enterprise Architecture Debt in Unstructured Documentation

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

8 Downloads (Pure)

Abstract

Enterprise Architecture Debt (EA Debt) arises from suboptimal design decisions and misaligned components that can degrade an organization’s IT landscape over time. Early indicators, Enterprise Architecture Smells (EA Smells), are currently mainly detected manually or only from structured artifacts, leaving much unstructured documentation under-analyzed. This study proposes an approach using a large language model (LLM) to identify and quantify EA Debt in unstructured architectural documentation. Following a design science research approach, we design and evaluate an LLM-based prototype for automated EA Smell detection. The artifact ingests unstructured documents (e.g., process descriptions, strategy papers), applies fine-tuned detection models, and outputs identified smells. We evaluate the prototype through a case study using synthetic yet realistic business documents, benchmarking against a custom GPT-based model. Results show that LLMs can detect multiple predefined EA Smells in unstructured text, with the benchmark model achieving higher precision and processing speed, and the fine-tuned on-premise model offering data protection advantages. The findings highlight opportunities for integrating LLM-based smell detection into EA governance practice.
Original languageEnglish
Title of host publicationSAC'26
Subtitle of host publicationProceedings of the 41st ACM/SIGAPP Symposium on Applied Computing
Place of PublicationNew York, NY
PublisherAssociation for Computing Machinery (ACM)
Pages815-822
Number of pages8
ISBN (Print)979-8-4007-2294-3
DOIs
Publication statusAccepted/In press - 2026
Event41st ACM/SIGAPP Symposium On Applied Computing, SAC 2026 - Grand Hotel Palace, Thessaloniki, Greece
Duration: 23 Mar 202627 Mar 2026
Conference number: 41
https://www.sigapp.org/sac/sac2026/

Conference

Conference41st ACM/SIGAPP Symposium On Applied Computing, SAC 2026
Abbreviated titleSAC 2026
Country/TerritoryGreece
CityThessaloniki
Period23/03/2627/03/26
Internet address

Keywords

  • Enterprise Architecture Debt
  • Enterprise Architecture Smells
  • LLM
  • Unstructured Data Analysis

Fingerprint

Dive into the research topics of 'Large Language Models for Analyzing Enterprise Architecture Debt in Unstructured Documentation'. Together they form a unique fingerprint.

Cite this