A Neural-based Architecture For Small Datasets Classification

Andi Rexha*, Mauro Dragoni, Roman Kern

*Korrespondierende/r Autor/-in für diese Arbeit

Publikation: Beitrag in Buch/Bericht/KonferenzbandBeitrag in einem KonferenzbandBegutachtung


Digital Libraries benefit from the use of text classification strategies since they are enablers for performing many document management tasks like Information Retrieval. The effectiveness of such classification strategies depends on the amount of available data and the classifier used. The former leads to the design of data augmentation solutions where new samples are generated into small datasets based on the semantic similarity between existing samples and concepts defined within external linguistic resources. The latter relates to the capability of finding, which is the best learning principle to adopt for designing an effective classification strategy suitable for the problem. In this work, we propose a neural-based architecture thought for addressing the text classification problem on small datasets. Our architecture is based on BERT equipped with one further layer using the sigmoid function. The hypothesis we want to verify is that by using a BERT-based architecture, the vectors' semantic learned by the BERT model can perform effective classification on small datasets without the use of data augmentation strategies. We observed improvements up to 14% in the accuracy and up to 23% in the f-score with respect to baseline classifiers exploiting data augmentation.

TitelJCDL' 20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020
Herausgeber (Verlag)Association of Computing Machinery
ISBN (elektronisch)9781450375856
ISBN (Print)9781450375856
PublikationsstatusVeröffentlicht - Aug. 2020
Veranstaltung2020 ACM/IEEE Joint Conference on Digital Libraries - Virtuell, China
Dauer: 1 Aug. 20205 Aug. 2020


Konferenz2020 ACM/IEEE Joint Conference on Digital Libraries
KurztitelJCDL 2020

ASJC Scopus subject areas

  • Ingenieurwesen (insg.)


Untersuchen Sie die Forschungsthemen von „A Neural-based Architecture For Small Datasets Classification“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren