Pre-trained language models in Spanish for health insurance coverage

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

The field of clinical natural language processing (NLP) can extract useful information from clinical text. Since 2017, the NLP field has shifted towards using pre-trained language models (PLMs), improving performance in several tasks. Most of the research in this field has focused on English text, but there are some available PLMs in Spanish. In this work, we use clinical PLMs to analyze text from admission and medical reports in Spanish for an insurance and health provider to give a probability of no coverage in a labor insurance process. Our results show that fine-tuning a PLM pretrained with the provider's data leads to better results, but this process is time-consuming and computationally expensive. At least for this task, fine-tuning publicly available clinical PLM leads to comparable results to a custom PLM, but in less time and with fewer resources. Analyzing large volumes of insurance requests is burdensome for employers, and models can ease this task by pre-classifying reports that are likely not to have coverage. Our approach of entirely using clinical-related text improves the current models while reinforcing the idea of clinical support systems that simplify human labor but do not replace it. To our knowledge, the clinical corpus collected for this study is the largest one reported for the Spanish language.

Original languageEnglish
Title of host publication5th Workshop on Clinical Natural Language Processing, ClinicalNLP 2023 - Proceedings of the Workshop
PublisherAssociation for Computational Linguistics (ACL)
Pages433-438
Number of pages6
ISBN (Electronic)9781959429883
StatePublished - 2023
Externally publishedYes
Event5th Workshop on Clinical Natural Language Processing, ClinicalNLP 2023. held at ACL 2023 - Toronto, Canada
Duration: 14 Jul 2023 → …

Publication series

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN (Print)0736-587X

Conference

Conference5th Workshop on Clinical Natural Language Processing, ClinicalNLP 2023. held at ACL 2023
Country/TerritoryCanada
CityToronto
Period14/07/23 → …

Fingerprint

Dive into the research topics of 'Pre-trained language models in Spanish for health insurance coverage'. Together they form a unique fingerprint.

Cite this