On the Relationship Between RNN Hidden-State Vectors and Semantic Structures

Edi Muškardin, Martin Tappler, Ingo Pill, Bernhard K. Aichernig, Thomas Pock

Publikation: Beitrag in Buch/Bericht/KonferenzbandBeitrag in einem KonferenzbandBegutachtung

Abstract

We examine the assumption that hidden-state vectors of recurrent neural networks (RNNs) tend to form clusters of semantically similar vectors, which we dub the clustering hypothesis. While this hypothesis has been assumed in RNN analyses in recent years, its validity has not been studied thoroughly on modern RNN architectures. We first consider RNNs that were trained to recognize regular languages. This enables us to draw on perfect ground-truth automata in our evaluation, against which we can compare the RNN's accuracy and the distribution of the hidden-state vectors. Then, we consider context-free languages to examine if RNN states form clusters for more expressive languages. For our analysis, we fit (generalized) linear models to classify RNN states into automata states and we apply different unsupervised clustering techniques. With a new ambiguity score, derived from information entropy, we measure how well an abstraction function maps the hidden state vectors to abstract clusters. Our evaluation supports the validity of the clustering hypothesis for regular languages, especially if RNNs are well-trained, i.e., clustering techniques succeed in finding clusters of similar state vectors. However, the clustering accuracy decreases substantially for context-free languages. This suggests that clustering is not a reliable abstraction technique for RNNs used in tasks like natural language processing.

Originalspracheenglisch
Titel62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Proceedings of the Conference
Redakteure/-innenLun-Wei Ku, Andre Martins, Vivek Srikumar
Herausgeber (Verlag)Association for Computational Linguistics (ACL)
Seiten5641-5658
Seitenumfang18
ISBN (elektronisch)9798891760998
DOIs
PublikationsstatusVeröffentlicht - 2024
VeranstaltungFindings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Bangkok, Hybrid, Thailand
Dauer: 11 Aug. 202416 Aug. 2024

Publikationsreihe

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN (Print)0736-587X

Konferenz

KonferenzFindings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024
Land/GebietThailand
OrtBangkok, Hybrid
Zeitraum11/08/2416/08/24

ASJC Scopus subject areas

  • Angewandte Informatik
  • Linguistik und Sprache
  • Sprache und Linguistik

Fingerprint

Untersuchen Sie die Forschungsthemen von „On the Relationship Between RNN Hidden-State Vectors and Semantic Structures“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren