SEME at SemEval-2024 Task 2: Comparing Masked and Generative Language Models on Natural Language Inference for Clinical Trials

Mathilde Aguiar; Pierre Zweigenbaum; Nona Naderi

doi:10.48550/arXiv.2404.03977

Pré-Publication, Document De Travail Année : 2024

SEME at SemEval-2024 Task 2: Comparing Masked and Generative Language Models on Natural Language Inference for Clinical Trials

(1, 2, 3) , (2, 1, 3) , (2, 1, 3)

1
2
3

Mathilde Aguiar

Fonction : Auteur

Laboratoire Interdisciplinaire des Sciences du Numérique

Sciences et Technologies des Langues - LISN

Université Paris-Saclay

Pierre Zweigenbaum

Fonction : Auteur

Sciences et Technologies des Langues - LISN

Laboratoire Interdisciplinaire des Sciences du Numérique

Université Paris-Saclay

Nona Naderi

Fonction : Auteur

Sciences et Technologies des Langues - LISN

Laboratoire Interdisciplinaire des Sciences du Numérique

Université Paris-Saclay

Résumé

This paper describes our submission to Task 2 of SemEval-2024: Safe Biomedical Natural Language Inference for Clinical Trials. The Multi-evidence Natural Language Inference for Clinical Trial Data (NLI4CT) consists of a Textual Entailment (TE) task focused on the evaluation of the consistency and faithfulness of Natural Language Inference (NLI) models applied to Clinical Trial Reports (CTR). We test 2 distinct approaches, one based on finetuning and ensembling Masked Language Models and the other based on prompting Large Language Models using templates, in particular, using Chain-Of-Thought and Contrastive Chain-Of-Thought. Prompting Flan-T5-large in a 2-shot setting leads to our best system that achieves 0.57 F1 score, 0.64 Faithfulness, and 0.56 Consistency.

Mots clés

Computation and Language (cs.CL) FOS: Computer and information sciences

Domaines

Intelligence artificielle [cs.AI] Informatique [cs]

Nona Naderi : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04536600

Soumis le : lundi 8 avril 2024-12:12:54

Dernière modification le : mercredi 10 avril 2024-03:28:09

Dates et versions

hal-04536600 , version 1 (08-04-2024)

Identifiants

HAL Id : hal-04536600 , version 1
DOI : 10.48550/arXiv.2404.03977

Citer

Mathilde Aguiar, Pierre Zweigenbaum, Nona Naderi. SEME at SemEval-2024 Task 2: Comparing Masked and Generative Language Models on Natural Language Inference for Clinical Trials. 2024. ⟨hal-04536600⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CENTRALESUPELEC UNIV-PARIS-SACLAY LISN GS-COMPUTER-SCIENCE

0 Consultations

0 Téléchargements

SEME at SemEval-2024 Task 2: Comparing Masked and Generative Language Models on Natural Language Inference for Clinical Trials

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager