{Seminar} @ CDS: #102 : 05th January, 2024 : “Multimodality for Multilingual NLP: The need, an application, and open questions”

When

5 Jan 24    
3:00 PM - 4:00 PM

Event Type

Department of Computational and Data Sciences

Department Seminar


Speaker : Ms. Simran Khanuja
Title : “Multimodality for Multilingual NLP: The need, an application, and open questions ”
Date & Time : January 05, 2024, 03:00 PM

Venue : # 102, CDS Seminar Hall


ABSTRACT

Our world is multimodal and multilingual, demanding NLP technology to be reflective of the same. From a cognitive perspective, multimodality is intricately linked with multilingualism in the brain. From an application perspective, this would make models capable of interpreting and generating human-like responses. In this talk, we will first present a bird’s eye view on the benefits of multimodal modeling for multilingual NLP (the need). We will specifically focus on the visual and text modality, and discuss how real-world tasks would benefit from such models. Next, we will narrow down on our focus to one such application, that of translating images across cultures (an application). Specifically, we will discuss our recent work on assessing the state of generative AI to do the task, and highlight existing gaps and biases of these models. Finally, we will broaden the discussion to explore open research questions in this exciting field, and consider potential strategies for addressing these issues (open questions).

BIOGRAPHY

Simran Khanuja is a second-year PhD student at Carnegie Mellon University, working with Prof. Graham Neubig. Her research interests lie in multilingual, multimodal modeling, and downstream applications that would benefit from such models. Prior to this, she worked at Google and Microsoft Research in India for three years, contributing to the development of models and benchmarks for Indian languages.

Host Faculty: Dr. Danish Pruthi


ALL ARE WELCOME