Are We Really Making Much Progress in Text Classification? A Comparative Review Paper • 2204.03954 • Published Apr 8, 2022
Efficient Continual Learning for Small Language Models with a Discrete Key-Value Bottleneck Paper • 2412.08528 • Published Dec 11, 2024
CRAWLDoc: A Dataset for Robust Ranking of Bibliographic Documents Paper • 2506.03822 • Published Jun 4 • 2
GenCodeSearchNet: A Benchmark Test Suite for Evaluating Generalization in Programming Language Understanding Paper • 2311.09707 • Published Nov 16, 2023
Transformers are Short Text Classifiers: A Study of Inductive Short Text Classifiers on Benchmarks and Real-world Datasets Paper • 2211.16878 • Published Nov 30, 2022