I'm a first-year PhD student at the Center for Language and Speech Processing at Johns Hopkins University, advised by Benjamin Van Durme and Reno Kriz.
I study neural information retrieval models and the systems in which they are integrated. I aim to design retrieval systems that efficiently address complex information needs.
This summer, I'll be interning at Mixedbread AI as a Research Scientist.
Previously, I graduated from UT Austin with an M.S. in Computer Science and from Carnegie Mellon University with a B.S. in Artificial Intelligence, where I did retrieval research with Jamie Callan. I also interned at Jina AI, mentored by Bo Wang, to train their multilingual, multi-vector ColBERT retrieval model.
Publications
A Brief Comparison of Training-Free Multi-Vector Sequence Compression Methods
Rohan Jha
, Chunsheng Zuo
, Reno Kriz
, Benjamin Van Durme
1st Late Interaction Workshop @ ECIR 2026
|
paper
Multi-Vector Index Compression in Any Modality
Hanxiang Qin
, Alexander Martin
, Rohan Jha
, Chunsheng Zuo
, Reno Kriz
, Benjamin Van Durme
semantic-features: A User-Friendly Tool for Studying Contextual Word Embeddings in Interpretable Semantic Spaces
Jwalanthi Ranganathan
, Rohan Jha
, Kanishka Misra
, Kyle Mahowald
Jina-ColBERT-v2: A General-Purpose Multilingual Late Interaction Retriever
Rohan Jha
, Bo Wang
, Michael Günther
, Georgios Mastrapas
, Saba Sturua
, Isabelle Mohr
, Andreas Koukounas
, Mohammad Kalim Akram
, Nan Wang
, Han Xiao
Generalizable Tip-of-the-Tongue Retrieval with LLM Re-ranking
Luís Borges
, Rohan Jha
, Jamie Callan
, Bruno Martins
COILcr: Efficient Semantic Matching in Contextualized Exact Match Retrieval
Zhen Fan
, Luyu Gao
, Rohan Jha
, Jamie Callan