User opinion modelling in conversations
2022 – Present
Mechanistic Interpretability
2022 – Present
Natural Language Processing
2018 – Present
Large Language models, Interpretability
2018 – Present
Robustness, text perturbations
2018 – Present
Multilingual NLP
2018 – Present