Black-box language model explanation by context length probing

Ondřej Cífka; Antoine Liutkus

doi:10.18653/v1/2023.acl-short.92

Communication Dans Un Congrès Année : 2023

Black-box language model explanation by context length probing

(1) , (1)

Ondřej Cífka

Fonction : Auteur
PersonId : 179459
IdHAL : ondrej-cifka
ORCID : 0000-0002-6268-6445

Scientific Data Management

Antoine Liutkus

Fonction : Auteur
PersonId : 2740
IdHAL : antoine-liutkus
ORCID : 0000-0002-3458-6498
IdRef : 167600419

Scientific Data Management

Résumé

The increasingly widespread adoption of large language models has highlighted the need for improving their explainability. We present context length probing, a novel explanation technique for causal language models, based on tracking the predictions of a model as a function of the length of available context, and allowing to assign differential importance scores to different contexts. The technique is model-agnostic and does not rely on access to model internals beyond computing token-level probabilities. We apply context length probing to large pre-trained language models and offer some initial analyses and insights, including the potential for studying long-range dependencies. The source code and a demo of the method are available.

Domaines

Informatique et langage [cs.CL] Intelligence artificielle [cs.AI] Apprentissage [cs.LG]

Fichier principal

main.pdf (749.85 Ko)

Origine	Fichiers produits par l'(les) auteur(s)
Licence	Copyright (Tous droits réservés)

Ondřej Cífka : Connectez-vous pour contacter le contributeur

https://hal.umontpellier.fr/hal-03917930

Soumis le : lundi 13 novembre 2023-10:58:25

Dernière modification le : jeudi 7 novembre 2024-16:14:03

Dates et versions

hal-03917930 , version 1 (13-11-2023)

Licence

Identifiants

HAL Id : hal-03917930 , version 1
ARXIV : 2212.14815
DOI : 10.18653/v1/2023.acl-short.92

Citer

Ondřej Cífka, Antoine Liutkus. Black-box language model explanation by context length probing. ACL 2023 - 61st Annual Meeting of the Association for Computational Linguistics, Jul 2023, Toronto, Canada. pp.1067--1079, ⟨10.18653/v1/2023.acl-short.92⟩. ⟨hal-03917930⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA ZENITH LIRMM INRIA2 UNIV-MONTPELLIER ANR NUMEV

75 Consultations

36 Téléchargements

Black-box language model explanation by context length probing

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager