Kevin Duh at JHU

My photo

Contact info:
Johns Hopkins University, HLTCOE
Stieff Building / 810 Wyman Park Drive
Baltimore, MD 21211-2840, USA
Email: x@cs.jhu.edu where x=kevinduh

Hi! I am a senior research scientist at the Johns Hopkins University Human Language Technology Center of Excellence (HLTCOE). I am also an associate research professor in the Department of Computer Science and a member of the Center for Language and Speech Processing (CLSP). Previously, I was assistant professor at the Nara Institute of Science and Technology (2012-2015) and research associate at NTT CS Labs (2009-2012). I received my B.S. in 2003 from Rice University, and PhD in 2009 from the University of Washington, both in Electrical Engineering. My research interests lie at the intersection of Natural Language Processing and Machine Learning, in particular on areas relating to machine translation and multilingual applications.

Current Doctoral Advisees: Neha Verma (co-advised with Kenton Murray), Sophia Hager (co-advised with Nick Andrews)
Previous lab members: Xuan Zhang (Meta), Suzanna Sia (Hyundai), Jeremy Gwinnup (AFRL), Shuo Sun (Institute for Infocomm Research), Mitchell Gordon (Startup: Latitude), Pamela Shapiro (Comcast), Sheng Zhang (Microsoft, co-advised with Ben Van Durme) Muhammad Rahman (Children's National Hospital / GWU), Sorami Hisamoto (Startup: MIERUNE), Hiroki Ouchi (NAIST), Fei Cheng (Kyoto University), Xiaoyi Wu, Frances Yung (Saarland University), Masashi Tsubaki (AIST), Xiaodong Liu (Microsoft), Yanyan Luo (Baidu).

Selected Publications [click here for full list, or Google scholar profile]:

Linguistic Nepotism: Trading-off Quality for Language Preference in Multilingual RAG (ICML2026 spotlight)
Where does In-context Learning Happen in Large Language Models? (NeurIPS2024)
An Extensive Exploration of Back-Translation in 60 Languages (ACL-Findings2023)
Bilingual Lexicon Induction for Low-Resource Languages using Graph Matching via Optimal Transport (EMNLP2022)
Data and Parameter Scaling Laws for Neural Machine Translation (EMNLP2021)
Reproducible and Efficient Benchmarks for Hyperparameter Optimization of NMT Systems (TACL2020)

Quick Links:
- JSALT 2024 LLM Tutorial
- EACL 2023 AutoML for NLP tutorial

To prospective students: Thanks for your interest! I am not able to reply to your inquiries individually, due to the large volumes of email I receive ("email event horizon"). If you are seeking admission, please refer to the official CS page. I am currently not accepting internship students (including self-funded ones).