Saloni Potdar

Senior AI/ML Manager @ Apple

I am a Senior Applied Research Manager with 11 years of industry experience and 8 years of management experience. I work in the Siri and Search team at Apple, where I build Apple Intelligence features that power natural interactions across Siri, Spotlight, and Safari. My research interests lie at the intersection of generative AI, post-training of large language models for question answering, dialog systems, and knowledge graphs. I have 30+ publications and 30+ patents, with 1000+ citations in top-tier research conferences like ACL, NAACL, EMNLP, and AAAI.

Education

M.S. in Intelligent Information Systems

Carnegie Mellon University, Pittsburgh, USA

Advisor: Jamie Callan

Work Experience

Oct 2022 - Present

Apple Inc., Seattle, WA, USA

Senior AI/ML Manager, Question Answering

Manages and leads the applied research team for Knowledge Graph Machine Learning, where we build features that power interactions across Siri, Safari, and Spotlight Search. I currently work on question answering, semantic annotation, entity linking, and knowledge graphs. My team has shipped features including Knowledge Graph Question Answering, Answer Highlights, Query Understanding models for Siri, Safari, and Spotlight Search, Related People, Safari Highlights, and Apple Intelligence Summarization.

Feb 2015 - Sep 2022

IBM Watson, New York, NY, USA

Senior Staff Applied Scientist and Engineering Manager, Watson Assistant

Manager and lead for the applied research team that designed and developed algorithms for IBM's conversational AI product - Watson Assistant. I worked on the natural language understanding components of Watson Assistant, including intent classification, entity recognition, spellcheck, and irrelevant detection across multiple languages. The algorithms are designed to be custom-trained for customers globally, deployed at scale with hundreds of thousands of models in production, and served more than 1.9% of the world's population every month.

Selected Honors and Awards

2024

Outstanding Paper Award at EMNLP 2024

Towards Cross-Cultural Machine Translation with Retrieval-Augmented Generation from Multilingual Knowledge Graphs

2023

Women in AI Awards North America

Special Jury Recognition

2022

MIT 35 Innovators under 35 - Semi Finalist

https://www.technologyreview.com/

2022

Women in AI Awards Finalist - Rising Star

VentureBeat

2022

IBM Master Inventor

Title recognizing sustained and outstanding contributions to IP

2021

IBM Corporate Award

Outstanding technical contributions to the Watson Assistant product which resulted in high business impact

2021

IBM Outstanding Technical Achievement Award

State-of-the-art algorithms for intent classification and entity recognition in Watson Assistant

2020

IBM Research AI Accomplishment (A-level)

Meta-Learning for Low-Resource NLP

2020

IBM Corporate Award

Outstanding technical contributions for language enablement for Watson Services which resulted in high business impact

2019

Women in Leadership Program

I was one of the 25 women leaders across IBM selected for the fully funded eCornell certificate program

2018

IBM Corporate Award

Outstanding technical contributions to the Watson Conversation Service product which resulted in high business impact

2018

Best of IBM Award

Awarded to less than 1000 employees globally contributions to IBM’s business

2017

IBM Eminence and Excellence Award

Awarded for exceptional contributions to IBM Watson

2016

IBM Outstanding Technical Achievement Award

Watson Conversation Service

2017, 2018, 2019, 2020, 2021, 2022

IBM Invention Achievement Awards

Publications

DEEPAMBIGQA: Ambiguous Multi-hop Questions for Benchmarking LLM Answer Completeness

Jiabao Ji , Min Li , Priyanshu Kumar , Shiyu Chang , Saloni Potdar

To appear Findings of ACL. 2026.

PDF

Over-Searching in Search-Augmented Large Language Models

Roy Xie , Deepak Gopinath , David Qiu , Dong Lin , Haitian Sun , Saloni Potdar , Bhuwan Dhingra

EACL (Main Track). 2026.

PDF Code

Leveraging the Power of Large Language Models in Entity Linking via Adaptive Routing and Targeted Reasoning

Yajie Li , Albert Galimov , Mitra Datta Ganapaneni , Pujitha Thejaswi , De Meng , Priyanshu Kumar , Saloni Potdar

EMNLP (Industry Track). 2025.

PDF

AgREE: Agentic Reasoning for Knowledge Graph Completion on Emerging Entities

Ruocheng Zhao , Simone Conia , Eric Peng , Min Li , Saloni Potdar

arXiv preprint. 2025.

PDF

SemEval-2025 Task 2: Entity-Aware Machine Translation

Simone Conia , Min Li , Roberto Navigli , Saloni Potdar

SemEval. 2025.

PDF Code Data

mRAKL: Multilingual Retrieval-Augmented Knowledge Graph Construction for Low-Resourced Languages

Hellina Hailu Nigatu , Min Li , Maartje Ter Hoeve , Saloni Potdar , Sarah Chasins

Findings of ACL. 2025.

PDF

KG-TRICK: Unifying Textual and Relational Information Completion of Knowledge for Multilingual Knowledge Graphs

Zelin Zhou , Simone Conia , Daniel Lee , Min Li , Shenglei Huang , Umar Farooq Minhas , Saloni Potdar , Henry Xiao , Yunyao Li

COLING (Main Track). 2025.

PDF Code

Comprehensive Evaluation for a Large Scale Knowledge Graph Question Answering Service

Saloni Potdar , Daniel Lee , Omar Attia , Varun Embar , De Meng , Ramesh Balaji , Chloe Seivwright , Eric Choi , Mina H. Farid , Yiwen Sun , Yunyao Li

arXiv preprint. 2025.

PDF

Towards Cross-Cultural Machine Translation with Retrieval-Augmented Generation from Multilingual Knowledge Graphs

Simone Conia , Daniel Lee , Min Li , Umar Farooq Minhas , Saloni Potdar , Yunyao Li

EMNLP (Main Track). 2024.

PDF Code Outstanding Paper

AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings

Revanth Gangi Reddy , Omar Attia , Yunyao Li , Heng Ji , Saloni Potdar

EMNLP (Main Track). 2024.

PDF Code

ConvKGYarn: Spinning Configurable and Scalable Conversational Knowledge Graph QA Datasets with Large Language Models

Ronak Pradeep , Daniel Lee , Ali Mousavi , Jeff Pound , Yisi Sang , Jimmy Lin , Ihab Ilyas , Saloni Potdar , Mostafa Arefiyan , Yunyao Li

EMNLP (Industry Track). 2024.

PDF

Entity Disambiguation via Fusion Entity Decoding

Junxiong Wang , Ali Mousavi , Omar Attia , Saloni Potdar , Alexander Rush , Umar Farooq Minhas , Yunyao Li

NAACL (Main Track). 2024.

PDF Video

Do Large Language Models Have an English Accent? Evaluating and Improving the Naturalness of Multilingual LLMs

Yanzhu Guo , Simone Conia , Zelin Zhou , Min Li , Saloni Potdar , Henry Xiao

ACL (Main Track). 2025.

PDF

Distinguish Sense from Nonsense Out-of-Scope Detection for Virtual Assistants

Cheng Qian , Haode Qi , Gengyu Wang , Ladislav Kunc , Saloni Potdar

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing Industry Track. 2022.

PDF Video

Fast and Light-Weight Answer Text Retrieval in Dialogue Systems

Hui Wan , Siva Sankalp Patel , William Murdock , Saloni Potdar , Sachindra Joshi

NAACL (Industry Track). 2022.

PDF Video Code

Benchmarking Language-agnostic Intent Classification for Virtual Assistant Platforms

Gengyu Wang , Cheng Qian , Lin Pan , Haode Qi , Ladislav Kunc , Saloni Potdar

Proceedings of the Workshop on Multilingual Information Access (MIA). 2022.

PDF

Comparing Model Development Practices in B2B vs B2C Machine Learning Teams

Navneet Rao , Saloni Potdar

Workshop on Applied Machine Learning Management - Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2022.

PDF

Improved Text Classification via Contrastive Adversarial Training

Lin Pan , Chung-Wei Hang , Avirup Sil , Saloni Potdar

Proceedings of the AAAI Conference on Artificial Intelligence. 2022.

PDF

Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques A Comprehensive Study

Xiangyang Mou , Chenghao Yang , Mo Yu , Bingsheng Yao , Xiaoxiao Guo , Saloni Potdar , Hui Su

Transactions of the Association for Computational Linguistics. 2021.

PDF

Benchmarking Commercial Intent Detection Services with Practice-Driven Evaluations

Haode Qi , Lin Pan , Atin Sood , Abhishek Shah , Ladislav Kunc , Mo Yu , Saloni Potdar

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technologies Industry Papers. 2021.

PDF Video

Multilingual BERT Post-Pretraining Alignment

Lin Pan , Chung-Wei Hang , Haode Qi , Abhishek Shah , Mo Yu , Saloni Potdar

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technologies. 2020.

PDF Video

Frustratingly Hard Evidence Retrieval for QA Over Books

Xiangyang Mou , Mo Yu , Bingsheng Yao , Chenghao Yang , Xiaoxiao Guo , Saloni Potdar , Hui Su

Proceedings of the 1st Joint Workshop on Narrative Understanding, Storylines, and Events. 2020.

PDF

Diverse Few-Shot Text Classification with Multiple Metrics

Mo Yu , Xiaoxiao Guo , Jinfeng Yi , Shiyu Chang , Saloni Potdar , Gerald Tesauro , Wang, Haoyu

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technologies. 2018.

PDF Code

Extracting Multiple-Relations in One-Pass with Pre-Trained Transformers

Haoyu Wang , Ming Tan , Mo Yu , Shiyu Chang , Dakuo Wang, , Kun Xu , Xiaoxiao Guo , Saloni Potdar

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019.

PDF Code

Context-Aware Conversation Thread Detection in Multi-Party Chat

Ming Tan , Dakuo Wang , Yupeng Gao , Haoyu Wang , Saloni Potdar , Xiaoxiao Guo , Shiyu Chang , Mo Yu

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 2019.

PDF Code

Out-of-Domain Detection for Low-Resource Text Classification Tasks

Ming Tan , Yang Yu , Haoyu Wang , Dakuo Wang , Saloni Potdar , Shiyu Chang , Mo Yu

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 2019.

PDF Code

Identifying Student Leaders from MOOC Discussion Forums Through Language Influence

Seungwhan Moon , Saloni Potdar , Lara Martin

Proceedings of the EMNLP 2014 Workshop on Analysis of Large Scale Social Interaction in MOOCs. 2014.

PDF

Neural Models for Sequence Chunking

Feifei Zhai , Saloni Potdar , Bing Xiang , Bowen Zhou

AAAI Conference on Artificial Intelligence. 2017.

PDF

Robust Task Clustering for Deep Many-Task Learning

Mo Yu , Xiaoxiao Guo , Jinfeng Yi , Shiyu Chang , Saloni Potdar , Gerald Tesauro , Wang, Haoyu , Bowen Zhou

arXiv preprint arXiv:1708.07918. 2017.

PDF

Patents

Configuring Artificial Intelligence-Based Virtual Assistants Using Response Modes

Matthew Richard Arnold , Eric Donald Wayne , Saloni Potdar

US Patent App. 18085257. 2024.

Pretraining of Split Layer Portions for Multilingual Model

Lin Pan , Haode Qi , Ladislav Kunc , Saloni Potdar

US Patent App. 18063788. 2024.

PDF

Detecting Out-of-Domain Text Data in Dialog Systems Using Artificial Intelligence

Cheng Qian , Haode Qi , Saloni Potdar , Ladislav Kunc

US Patent App. 17/897,887. 2024.

PDF

Out-of-Domain Sentence Detection

Haode Qi , Cheng Qian , Ladislav Kunc , Saloni Potdar , Eric Wayne

US Patent App. US17/815,630. 2024.

PDF

Conversational AI with Multilingual Human Chatlogs

Haode Qi , Lin Pan , Abhishek Shah , Ladislav Kunc , Saloni Potdar

US Patent 11,853,712. 2023.

PDF

Out-of-Domain Encoder Training

Ming Tan , Dakuo Wang , Mo Yu , Haoyu Wang , Yang Yu , Shiyu Chang , Saloni Potdar

US Patent 11,645,514. 2023.

PDF

Domain-Specific Model Compression

Haoyu Wang , Yang Yu , Ming Tan , Saloni Potdar

US Patent 11,620,435. 2023.

PDF

Artificial Intelligence-Based Context-Dependent Spellchecking

Panos Karagiannis , Ladislav Kunc , Saloni Potdar , Haoyu Wang , Navneet Rao

US Patent 11,301,626. 2022.

PDF

Intent Classification Distribution Calibration

Haoyu Wang , Ming Tan , Dakuo Wang , Chuang Gan , Saloni Potdar

US Patent 11,436,528. 2022.

PDF

Contextual Question Answering Using Human Chat Logs

Yang Yu; , Ming Tan , Shasha Lin , Saloni Potdar

US Patent 11,443,117. 2022.

PDF

Routing Text Classifications Within a Cross-Domain Conversational Service

Ming Tan , Ladislav Kunc , Yang Yu; , Haoyu Wang , Saloni Potdar

US Patent 11,270,077. 2022.

PDF

Evaluating Text Classification Anomalies Predicted by a Text Classification Model

Ming Tan , Saloni Potdar , Lakshminarayanan Krishnamurthy

US Patent 11,537,821. 2022.

PDF

Hybrid Model for Short Text Classification with Imbalanced Data

Yang Yu , Ming Tan , Ravi Nair , Haoyu Wang , Saloni Potdar

US Patent 11,328,221. 2022.

PDF

Unintended Bias Detection in Conversational Agent Platforms with Machine Learning Model

Navneet Rao , Ming Tan , Haode Qi , Yang Yu , Panos Karagiannis , Saloni Potdar

Google Patents. 2022.

Generating Question Answer Pairs

Dakuo Wang , Mo Yu , Chuang Gan , Saloni Potdar

US Patent App. 17/302,550. 2022.

PDF

Intent Classification Using Non-Correlated Features

Abhishek Shah , Ladislav Kunc , Haode Qi , Lin Pan , Saloni Potdar

US Patent App. 17/350,116. 2024.

PDF

Weak Supervised Abnormal Entity Detection

Haode Qi , Ming Tan , Yang Yu , Navneet Rao , Ladislav Kunc , Saloni Potdar

US Patent 11,423,227. 2022.

PDF

Intent Boundary Segmentation for Multi-Intent Utterances

Ming Tan , Haoyu Wang , Saloni Potdar , Yang Yu , Navneet Rao , Haode Qi

US Patent 11,308,944. 2022.

PDF

Mechanisms for Continuous Improvement of Automated Machine Learning

Haode Qi , Ming Tan , Ladislav Kunc , Saloni Potdar

US Patent 11,423,333. 2022.

PDF

Feature Reweighting in Text Classifier Generation Using Unlabeled Data

Yang Yu , Haode Qi , Haoyu Wang , Ming Tan , Navneet Rao , Saloni Potdar Robert Yates

US Patent 11,216,619. 2022.

PDF

Suggestion of New Entity Types with Discriminative Term Importance Analysis

Haode Qi , Ming Tan , Yang Yu , Navneet Rao , Saloni Potdar , Haoyu Wang

US Patent 11,379,666. 2022.

PDF

Learning Parameter Sampling Configuration for Automated Machine Learning

Haode Qi , Ming Tan , Ladislav Kunc , Saloni Potdar

Google Patents. 2021.

PDF

Privacy Protection Through Template Embedding

Haode Qi , Saloni Potdar , Ming Tan , Navneet Rao

Google Patents. 2021.

PDF

Bias Detection in Conversational Agent Platforms

Navneet Rao , Ming Tan , Haode Qi , Yang Yu , Panos Karagiannis , Saloni Potdar

Google Patents. 2021.

PDF

Adversarial Training Data Augmentation Data for Text Classifiers

Ming Tan , Ruijian Wang , Inkit Padhi , Saloni Potdar

US Patent 11,093,707. 2021.

PDF

Displaying Text Classification Anomalies Predicted by a Text Classification Model

Ming Tan , Saloni Potdar , Lakshminarayanan Krishnamurthy

US Patent 11,068,656. 2021.

PDF

Updating an Online Multi-Domain Sentence Representation Generation Module of a Text Classification System

Ming Tan , Ladislav Kunc , Yang Yu , Haoyu Wang , Saloni Potdar

US Patent 11,120,225. 2021.

PDF

Cross-Domain Multi-Task Learning for Text Classification

Ming Tan , Haoyu Wang , Ladislav Kunc , Yang Yu , Saloni Potdar

US Patent 10,937,416. 2021.

PDF

Displaying Text Classification Anomalies Predicted by a Text Classification Model

Ming Tan , Saloni Potdar , Lakshminarayanan Krishnamurthy

US Patent 11,074,414. 2021.

PDF

Weighting Features for an Intent Classification System

Yang Yu , Ladislav Kunc , Haoyu Wang , Ming Tan , Saloni Potdar

US Patent 10,977,445. 2021.

PDF

Out-of-Domain Sentence Detection

Inkit Padhi , Ruijian Wang , Haoyu Wang , Saloni Potdar

US Patent 11,023,683. 2021.

PDF

Adversarial Training Data Augmentation for Generating Related Responses

Ming Tan , Ruijian Wang , Inkit Padhi , Saloni Potdar

US Patent 11,189,269. 2021.

PDF

Implementing Dynamic Confidence Rescaling with Modularity in Automatic User Intent Detection Systems

Yang Yu , Ladislav Kunc , Saloni Potdar

Google Patents. 2020.

PDF

Services

EMNLP Industry Track (Chair) 2025

Apple Ph.D. Fellowship Selection Committee (Information Retrieval and Knowledge Graph) 2023 - present

Apple Internal AIML Conference (Knowledge Bases and Search Track Chair) 2022 - present

ACL/NAACL/EMNLP ARR (Area Chair) 2025, 2024

EMNLP Industry Track (Area Chair) 2024

NAACL Industry Track (Reviewer) 2024

WAMLM KDD (Co-organizer) 2024

WAMLM KDD (Co-organizer) 2023

EMNLP Industry Track (Reviewer) 2023

ACL Industry Track (Reviewer) 2023

Web Conference Industry Track (Reviewer) 2023

EMNLP Industry Track (Reviewer) 2022

NAACL Industry Track (Reviewer) 2022

ACL/NAACL/EMNLP ARR (Reviewer) 2021

AAAI (Reviewer) 2021

Workshop for Women in Machine Learning 2019 (Reviewer) 2019

CCL (Reviewer) 2017

Collaborators and Interns

2026 - present

Zeyu Huang at Apple (AIML Scholar)

PhD, University of Edinburgh

2026 - present

Jiabao Ji at Apple

PhD, UC Santa Barbara

Jan 2025 - present

Roy Xie at Apple (AIML Scholar)

PhD, Duke University

Jan 2023 - present

Simone Conia at Apple

PhD, Sapienza NLP

Mar 2024 - present

Hellina Hailu Nigatu at Apple

PhD, University of California Berkeley

May 2024 - present

Yanzhu Guo at Apple

PhD, École Polytechnique Paris

Jul 2023 - present

Charlie (Zelin) Zhou at Apple

Masters, Shanghai Jiao Tong University

Jun 2023 - Nov 2024

Revanth Gangi Reddy at Apple

PhD, University of Illinois Urbana-Champaign

Jun 2023 - Nov 2024

Ronak Pradeep at Apple

PhD, University of Waterloo

Feb 2023 - Nov 2024

Daniel Lee at Apple

Bachelor, University of Calgary

Jun 2023 - Dec 2023

Junxiong Wang at Apple

PhD, Cornell University

Summer 2021

George Karagiannis at IBM

PhD, Cornell University

Summer 2020

Chenghao Yang at IBM

PhD, University of Chicago

Articles and Blogs

Jan 2026

Over-Searching in Search-Augmented Large Language Models

Apple Machine Learning Research

Oct 2025

Leveraging the Power of Large Language Models in Entity Linking via Adaptive Routing and Targeted Reasoning

Apple Machine Learning Research

Dec 2025

AgREE: Agentic Reasoning for Knowledge Graph Completion on Emerging Entities

Apple Machine Learning Research

Jul 2025

mRAKL: Multilingual Retrieval-Augmented Knowledge Graph Construction for Low-Resourced Languages

Apple Machine Learning Research

May 2025

Interleaved Reasoning for Large Language Models via Reinforcement Learning

Apple Machine Learning Research

May 2025

Do Large Language Models Have an English Accent? Evaluating and Improving the Naturalness of Multilingual LLMs

Apple Machine Learning Research

Jul 2025

Name Translation for Machine Translation

Apple Machine Learning Research

Jan 2025

KG-TRICK: Unifying Textual and Relational Information Completion of Knowledge for Multilingual Knowledge Graphs

Apple Machine Learning Research

Oct 2024

Towards Cross-Cultural Machine Translation with Retrieval-Augmented Generation from Multilingual Knowledge Graphs

Apple Machine Learning Research

Oct 2024

ConvKGYarn: Spinning Configurable and Scalable Conversational Knowledge Graph QA Datasets with Large Language Models

Apple Machine Learning Research

Aug 2024

Spinning Configurable and Scalable Conversational Knowledge Graph QA Datasets with Large Language Models

Apple Machine Learning Research

Jun 2024

AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings

Apple Machine Learning Research

Aug 2024

Entity Disambiguation via Fusion Entity Decoding

Apple Machine Learning Research

May 26 2021

Under the Hood - All the Natural Language Understanding Technology That Makes Watson Assistant Powerful

Medium - All the natural language understanding technology that makes Watson Assistant so powerful.

Apr 23 2021

AI Lifecycle for Virtual Assistants

Medium- While deploying virtual assistants it is important to focus on building conversational experiences which work seamlessly.

Nov 14 2019

Why Zero-Effort Irrelevance is Relevant

Medium - How We Designed the New Zero-Effort Irrelevant Question Detection Feature in Watson Assistant

Jul 29 2019

A New State-of-the-Art Method for Relation Extraction

IBM Research - IBM Research AI and IBM Watson worked together to develop a promising method that achieves state-of-the-art performance on relation extraction.

Press

Aug 17 2023

Finalists and Special Jury Recognitions announced for Women in AI Awards North America 2023

Women in AI - Special Jury Award

Jul 10 2022

Putting more knowledge at the fingertips of non-English speakers

PrimeQA for Non-English Speakers

July 8 2022

Meet the nominees for the 2022 VentureBeat Women in AI Awards

VentureBeat - Rising Star Nominee

Apr 19 2021

5 reasons NLP for chatbots improves performance

TechTarget - Natural language processing takes chatbots from order takers to true conversational agents. Find out how NLP for chatbots advances interaction.

Dec 10 2020

Watson Assistant improves intent detection accuracy, leads against AI vendors cited in published study

IBM - Watson Assistant has a new and improved intent detection algorithm, which is more accurate versus commercial and open-source solutions.

Jul 15 2020

Announcing nominees for the second annual Women in AI Awards

VentureBeat - Rising Star Nominee