Cross-lingual Approaches for the Detection of Adverse Drug Reactions in German from a Patient's Perspective - Laboratoire Interdisciplinaire des Sciences du Numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

Cross-lingual Approaches for the Detection of Adverse Drug Reactions in German from a Patient's Perspective

Résumé

In this work, we present the first corpus for German Adverse Drug Reaction (ADR) detection in patient-generated content. The data consists of 4,169 binary annotated documents from a German patient forum, where users talk about health issues and get advice from medical doctors. As is common in social media data in this domain, the class labels of the corpus are very imbalanced. This and a high topic imbalance make it a very challenging dataset, since often, the same symptom can have several causes and is not always related to a medication intake. We aim to encourage further multilingual efforts in the domain of ADR detection and provide preliminary experiments for binary classification using different methods of zero-and few-shot learning based on a multilingual model. When fine-tuning XLM-RoBERTa first on English patient forum data and then on the new German data, we achieve an F1-score of 37.52 for the positive class. We make the dataset and models publicly available for the community.
Fichier principal
Vignette du fichier
Raithel_LREC2022.pdf (966.59 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03866409 , version 1 (22-11-2022)

Identifiants

  • HAL Id : hal-03866409 , version 1

Citer

Lisa Raithel, Philippe Thomas, Roland Roller, Oliver Sapina, Sebastian Möller, et al.. Cross-lingual Approaches for the Detection of Adverse Drug Reactions in German from a Patient's Perspective. 13th Conference on Language Resources and Evaluation, Jun 2022, Marseille, France. ⟨hal-03866409⟩
20 Consultations
16 Téléchargements

Partager

Gmail Facebook X LinkedIn More