TY - GEN
T1 - Effectiveness of neural language models for word prediction of textual mammography reports
AU - Marin, Mihai David
AU - Mocanu, Elena
AU - Seifert, Christin
PY - 2020/12/14
Y1 - 2020/12/14
N2 - Radiologists are required to write free paper text reports for breast screenings in order to assign cancer diagnoses in a later step. The current procedure requires considerable time and needs efficiency. In this paper, to streamline the writing process and keep up with the specific vocabulary, a word prediction tool using neural language models was developed. Consequently, challenges as different languages (English, Dutch), small data sizes and low computational power have been overcome by introducing a novel English-Dutch Radiology Language Modelling process. After defining model architectures, the process involves data preparation, bilevel hyperparameters optimization, configuration transfer and evaluation. The model is able to improve the current workflow and successfully meet the computational constraints, based on both an intrinsic and extrinsic evaluation. Given its flexibility, the model opens the door for future research involving other languages and also an extensive set of real-world applications.
AB - Radiologists are required to write free paper text reports for breast screenings in order to assign cancer diagnoses in a later step. The current procedure requires considerable time and needs efficiency. In this paper, to streamline the writing process and keep up with the specific vocabulary, a word prediction tool using neural language models was developed. Consequently, challenges as different languages (English, Dutch), small data sizes and low computational power have been overcome by introducing a novel English-Dutch Radiology Language Modelling process. After defining model architectures, the process involves data preparation, bilevel hyperparameters optimization, configuration transfer and evaluation. The model is able to improve the current workflow and successfully meet the computational constraints, based on both an intrinsic and extrinsic evaluation. Given its flexibility, the model opens the door for future research involving other languages and also an extensive set of real-world applications.
U2 - 10.1109/SMC42975.2020.9283304
DO - 10.1109/SMC42975.2020.9283304
M3 - Conference contribution
SN - 978-1-7281-8527-9
T3 - IEEE International Conference on Systems, Man, and Cybernetics (SMC)
BT - 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC)
PB - IEEE
CY - Piscataway, NJ
T2 - IEEE International Conference on Systems, Man, and Cybernetics, SMC 2020
Y2 - 11 October 2020 through 14 October 2020
ER -