Also see my Semantic Scholar and Google Scholar profiles.
† indicates core contributors
2024
-
OLMoE: Open Mixture-of-Experts Language Models
Niklas Muennighoff, Luca Soldaini, Dirk Groeneveld, Kyle Lo, Jacob Daniel Morrison, and 19 more authors
arXiv preprint, 2024
-
OLMo: Accelerating the Science of Language Models
Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia, Rodney Kinney, and 38 more authors
ACL, 2024
🏆 Best Theme Paper
🏆 Geekwire Innovation of the Year
-
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Luca Soldaini†, Rodney Kinney†, Akshita Bhagia†, Dustin Schwenk†, David Atkinson, and 31 more authors
ACL, 2024
🏆 Best Resource Paper
-
What’s In My Big Data?
Yanai Elazar, Akshita Bhagia, Ian Magnusson, Abhilasha Ravichander, Dustin Schwenk, and 8 more authors
ICLR, 2024
Spotlight
2023
-
Paloma: A Benchmark for Evaluating Language Model Fit
Ian Magnusson, Akshita Bhagia, Valentin Hofmann, Luca Soldaini, A. Jha, and 11 more authors
In submission, 2023
-
Catwalk: A Unified Language Model Evaluation Framework for Many Datasets
Dirk Groeneveld, Anas Awadalla, Iz Beltagy, Akshita Bhagia, Ian Magnusson, and 5 more authors
arXiv preprint, 2023
-
Robust Tooling and New Resources for Large Language Model Evaluation via Catwalk (extended abstract)
Kyle Richardson, Ian Magnusson, Oyvind Tafjord, Akshita Bhagia, Iz Beltagy, and 8 more authors
-
HINT: Hypernetwork Instruction Tuning for Efficient Zero-Shot Generalisation
Hamish Ivison, Akshita Bhagia, Yizhong Wang, Hannaneh Hajishirzi, and Matthew Peters
ACL, 2023
2022
-
Continued Pretraining for Better Zero- and Few-Shot Promptability
Zhaofeng Wu, Robert L. Logan IV, Pete Walsh, Akshita Bhagia, Dirk Groeneveld, and 2 more authors
EMNLP, 2022
-
On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization
Shruti Palaskar, Akshita Bhagia, Yonatan Bisk, Florian Metze, Alan W. Black, and 1 more author
Findings of EMNLP, 2022