Unveiling Strengths and Weaknesses of NLP Systems Based on a Rich Evaluation Corpus: the Case of NER in French

Alice Millour; Yoann Dupont; Karën Fort; Liam Duignan

Communication Dans Un Congrès Année : 2024

Unveiling Strengths and Weaknesses of NLP Systems Based on a Rich Evaluation Corpus: the Case of NER in French

(1) , (2) , (3, 4) , (1, 5, 6, 7)

1
2
3
4
5
6
7

Alice Millour

Fonction : Auteur
PersonId : 21553
IdHAL : alice-millour
ORCID : 0009-0005-6920-4196
IdRef : 253127947

Laboratoire d'Informatique Avancée de Saint-Denis

Yoann Dupont

Fonction : Auteur
PersonId : 169798
IdHAL : yoann-dupont
IdRef : 224619225

Lattice - Langues, Textes, Traitements informatiques, Cognition - UMR 8094

Karën Fort

Fonction : Auteur
PersonId : 2215
IdHAL : karen-fort
ORCID : 0000-0002-0723-8850
IdRef : 176299548

Semantic Analysis of Natural Language

Sorbonne Université

Liam Duignan

Fonction : Auteur
PersonId : 1371719
IdHAL : liam-duignan

Laboratoire d'Informatique Avancée de Saint-Denis

Université Paris 8 Vincennes-Saint-Denis

Université Paris Cité

ASTEK

Résumé

Named Entity Recognition (NER) is an applicative task for which annotation schemes vary. To compare the performance of systems which tagsets differ in precision and coverage, it is necessary to assess (i) the comparability of their annotation schemes and (ii) the individual adequacy of the latter to a common annotation scheme. What is more, and given the lack of robustness of some tools towards textual variation, we cannot expect an evaluation led on an homogeneous corpus with low-coverage to provide a reliable prediction of the actual tools performance. To tackle both these limitations in evaluation, we provide a gold corpus for French covering 6 textual genres and annotated with a rich tagset that enables comparison with multiple annotation schemes. We use the flexibility of this gold corpus to provide both: (i) an individual evaluation of four heterogeneous NER systems on their target tagsets, (ii) a comparison of their performance on a common scheme. This rich evaluation framework enables a fair comparison of NER systems across textual genres and annotation schemes.

Mots clés

Named Entity Recognition (NER) Evaluation Textual variation

Domaines

Informatique [cs] Informatique et langage [cs.CL]

Fichier principal

FENEC_LREC_2024.pdf (230.05 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Alice Millour : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04534593

Soumis le : vendredi 5 avril 2024-16:06:25

Dernière modification le : vendredi 19 avril 2024-16:18:57

Dates et versions

hal-04534593 , version 1 (05-04-2024)

Identifiants

HAL Id : hal-04534593 , version 1

Citer

Alice Millour, Yoann Dupont, Karën Fort, Liam Duignan. Unveiling Strengths and Weaknesses of NLP Systems Based on a Rich Evaluation Corpus: the Case of NER in French. LREC-COLING 2024, May 2024, Turin, Italy. ⟨hal-04534593⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS UNIV-PARIS8 CNRS INRIA UNIV-PARIS3 LATTICE LIASD UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD PSL SORBONNE-UNIVERSITE UNIV-PARIS-LUMIERES UNIV-PARIS8-OA

241 Consultations

53 Téléchargements

Unveiling Strengths and Weaknesses of NLP Systems Based on a Rich Evaluation Corpus: the Case of NER in French

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager