Publications

Select publications only. See my Google Scholar profile for a full list of publications.

2025

  1. LLM
    consistency.png
    Consistency in Language Models: Current Landscape, Challenges, and Future Directions
    Jekaterina Novikova, Carol Anderson, Borhane Blili-Hamelin, and 2 more authors
    arXiv preprint arXiv:2505.00268, 2025
  2. VLM
    kaleidoscope.png
    Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation
    Israfel Salazar, Manuel Fernández Burda, Shayekh Bin Islam, and 42 more authors
    2025
  3. Eval
    Human-Centered Evaluation and Auditing of Language Models
    Yu Lu Liu, Wesley Hanwen Deng, Michelle S Lam, and 6 more authors
    In Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2025

2024

  1. LLM
    include3.png
    INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
    Angelika Romanou, Negar Foroutan, Anna Sotnikova, and 54 more authors
    In The Thirteenth International Conference on Learning Representations, 2024

2023

  1. LLM
    bloom.png
    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
    BigScience Workshop, :, Teven Le Scao, and 391 more authors
    2023
  2. Eval
    bigbench.png
    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
    Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, and 447 more authors
    Transactions on Machine Learning Research, 2023

2021

  1. ML for Health
    comparing.jpg
    Comparing pre-trained and feature-based models for prediction of Alzheimer’s disease based on speech
    Aparna Balagopalan, Benjamin Eyre, Jessica Robin, and 2 more authors
    Frontiers in aging neuroscience, 2021
  2. ML for Health
    acoustic.png
    Comparing Acoustic-Based Approaches for Alzheimer’s Disease Detection
    Aparna Balagopalan and Jekaterina Novikova
    In Interspeech 2021, 2021

2020

  1. NLG
    e2e_nlg.png
    Evaluating the state-of-the-art of End-to-End Natural Language Generation: The E2E NLG challenge
    Ondřej Dušek, Jekaterina Novikova, and Verena Rieser
    Comput. Speech Lang., Jan 2020
  2. ML for Health
    bert_ad.png
    To BERT or not to BERT: Comparing Speech and Language-Based Approaches for Alzheimer’s Disease Detection
    Aparna Balagopalan, Benjamin Eyre, Frank Rudzicz, and 1 more author
    In Interspeech 2020, Jan 2020

2019

  1. ML for Health
    cn_ad.png
    Detecting cognitive impairments by agreeing on interpretations of linguistic features
    Zining Zhu, Jekaterina Novikova, and Frank Rudzicz
    In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Jun 2019

2018

  1. NLG
    e2e_nlg_find.png
    Findings of the E2E NLG Challenge
    Ondřej Dušek, Jekaterina Novikova, and Verena Rieser
    In Proceedings of the 11th International Conference on Natural Language Generation, Nov 2018
  2. NLG
    rankme.png
    RankME: Reliable Human Ratings for Natural Language Generation
    Jekaterina Novikova, Ondřej Dušek, and Verena Rieser
    In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), Jun 2018

2017

  1. Eval
    nlg_eval.png
    Why We Need New Evaluation Metrics for NLG
    Jekaterina Novikova, Ondřej Dušek, Amanda Cercas Curry, and 1 more author
    In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Sep 2017
  2. NLG
    e2e_new_chal.png
    The E2E Dataset: New Challenges For End-to-End Generation
    Jekaterina Novikova, Ondřej Dušek, and Verena Rieser
    In Proceedings of the 18th Annual SIGdial Meeting on Discourse and Dialogue, Aug 2017

2016

  1. NLG
    crowd_nlg.png
    Crowd-sourcing NLG Data: Pictures Elicit Better Data.
    Jekaterina Novikova, Oliver Lemon, and Verena Rieser
    In Proceedings of the 9th International Natural Language Generation conference, Sep 2016

2015

  1. HRI
    art_emot.png
    Towards Artificial Emotions to Assist Social Coordination in HRI
    Jekaterina Novikova and Leon Watts
    International Journal of Social Robotics, Sep 2015

2014

  1. HRI
    robot_emot.png
    A Design Model of Emotional Body Expressions in Non-humanoid Robots
    Jekaterina Novikova and Leon Watts
    In Proceedings of the second international conference on Human-agent interaction, Sep 2014