Research

I have attached links to the papers, code, and data where applicable. If anything that you want is missing, please email me.

Research Themes

2025

  1. MedScore: Generalizable Factuality Evaluation of Free-Form Medical Answers by Domain-adapted Claim Decomposition and Verification
    Heyuan Huang, Alexandra DeLucia, Vijay Murari Tiyyala, and 1 more author
    Oct 2025
    Health AI Evaluation
  2. MedExpert: An Expert-Annotated Dataset for Medical Chatbot Evaluation
    Mahsa Yarmohammadi, Alexandra DeLucia, Lillian C Chen, and 8 more authors
    In Machine Learning for Health 2025, 2025
    Health AI Annotation Dataset

2025

  1. Can One Size Fit All?: Measuring Failure in Multi-Document Summarization Domain Transfer
    Alexandra DeLucia and Mark Dredze
    Jul 2025
    LLMs Evaluation

2024

  1. Using Natural Language Inference to Improve Persona Extraction from Dialogue in a New Domain
    Alexandra DeLucia, Mengjie Zhao, Yoshinori Maeda, and 3 more authors
    arXiv preprint arXiv:2401.06742, 2024
    LLMs Decoding
  2. Anti-LM Decoding for Zero-shot In-context Machine Translation
    Suzanna Sia, Alexandra DeLucia, and Kevin Duh
    In Findings of the Association for Computational Linguistics: NAACL 2024, Jun 2024
    LLMs

2023

  1. Common Law Annotations: Investigating the Stability of Dialog System Output Annotations
    Seunggun Lee, Alexandra DeLucia, Nikita Nangia, and 9 more authors
    In Findings of the Association for Computational Linguistics: ACL 2023, Jul 2023
    LLMs Annotation Evaluation
  2. Strength in Numbers: Estimating Confidence of Large Language Models by Prompt Agreement
    Gwenyth Portillo Wightman, Alexandra DeLucia, and Mark Dredze
    In Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023), Jul 2023
    LLMs

2021

  1. Decoding Methods for Neural Narrative Generation
    Alexandra DeLucia, Aaron Mueller, Xiang Lisa Li, and 1 more author
    In Proceedings of the First Workshop on Natural Language Generation, Evaluation, and Metrics (GEM), Aug 2021
    LLMs Evaluation Decoding Annotation

2023

  1. A Multi-instance Learning Approach to Civil Unrest Event Detection on Twitter
    Alexandra DeLucia, Mark Dredze, and Anna L. Buczak
    In Proceedings of the 6th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text, Sep 2023
    Crisis Informatics Social Media
  2. Geo-Seq2seq: Twitter User Geolocation on Noisy Data through Sequence to Sequence Learning
    Jingyu Zhang, Alexandra DeLucia, Chenyu Zhang, and 1 more author
    In Findings of the Association for Computational Linguistics: ACL 2023, Jul 2023
    Crisis Informatics Social Media Decoding

2022

  1. Changes in Tweet Geolocation over Time: A Study with Carmen 2.0
    Jingyu Zhang, Alexandra DeLucia, and Mark Dredze
    In Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022), Oct 2022
    Crisis Informatics Social Media

2021

  1. Study of Manifestation of Civil Unrest on Twitter
    Abhinav Chinta, Jingyu Zhang, Alexandra DeLucia, and 2 more authors
    In Proceedings of the Seventh Workshop on Noisy User-generated Text (W-NUT 2021), Nov 2021
    Crisis Informatics Social Media Dataset

2025

  1. Histories of Daily Cannabis Use Patterns and Reported Negative Effects: A Content Analysis of Three Cannabis-Centric Communities on Reddit
    Savannah Brenneke, Eloise Estrada, Epifania Ortiz, and 4 more authors
    Drug and Alcohol Dependence, 2025
    Public Health Reddit Social Media
  2. Leveraging Cannabis-Centric Reddit Communities to Explore Patterns of Cannabis Product, Modes and Frequency of Use between 2010-2019
    Savannah Brenneke, Alexandra DeLucia, Gwenyth Portillo Wightman, and 3 more authors
    Drug and Alcohol Dependence, 2025
    Public Health Reddit Social Media

2024

  1. First-Hand Accounts of Structural Stigma toward People Who Use Opioids on Reddit
    Evan L Eschliman, Karen Choe, Alexandra DeLucia, and 6 more authors
    Social Science & Medicine, 2024
    Public Health Annotation Reddit Social Media

2023

  1. R/AskAComputerScientist: Processing Reddit Data for the Social Sciences
    Savannah Brenneke, Meredith Meacham, Amanda Bunting, and 2 more authors
    In 85th Annual Scientific Meeting of College on Problems of Drug Dependence, Jul 2023
    Public Health Reddit Social Media
  2. Automated Discovery of Perceived Health-related Concerns about E-cigarettes from Reddit
    Alexandra DeLucia, Adam Poliak, Zechariah Zu, and 5 more authors
    In 29th Annual Meeting of the Society for Research on Nicotine and Tobacco, Mar 2023
    Public Health Reddit Social Media

2020

  1. Analyzing Hpc Support Tickets: Experience and Recommendations
    Alexandra DeLucia and Elisabeth Moore
    arXiv preprint arXiv:2010.04321, 2020
    Systems

2018

  1. Modeling High Performance Computing System Log Messages for Early Prediction of Job Outcome
    Alexandra DeLucia and Elisabeth Moore
    2018
    Systems
  2. Work in Progress: Topic Modeling for HPC Job State Prediction
    Alexandra DeLucia and Elisabeth Baseman
    In Proceedings of the First Workshop on Machine Learning for Computing Systems, 2018
    Systems

2017

  1. High Performance Computing Job Outcome Prediction by Mining System Logs
    Alexandra DeLucia and Elisabeth Baseman
    2017
    Systems
  2. Markov Chain Modeling for Anomaly Detection in High Performance Computing System Logs
    Abida Haque, Alexandra DeLucia, and Elisabeth Baseman
    In Proceedings of the Fourth International Workshop on HPC User Support Tools, Nov 2017
    Systems

All Publications

2025

  1. Histories of Daily Cannabis Use Patterns and Reported Negative Effects: A Content Analysis of Three Cannabis-Centric Communities on Reddit
    Savannah Brenneke, Eloise Estrada, Epifania Ortiz, and 4 more authors
    Drug and Alcohol Dependence, 2025
    Public Health Reddit Social Media
  2. Leveraging Cannabis-Centric Reddit Communities to Explore Patterns of Cannabis Product, Modes and Frequency of Use between 2010-2019
    Savannah Brenneke, Alexandra DeLucia, Gwenyth Portillo Wightman, and 3 more authors
    Drug and Alcohol Dependence, 2025
    Public Health Reddit Social Media
  3. Can One Size Fit All?: Measuring Failure in Multi-Document Summarization Domain Transfer
    Alexandra DeLucia and Mark Dredze
    Jul 2025
    LLMs Evaluation
  4. MedScore: Generalizable Factuality Evaluation of Free-Form Medical Answers by Domain-adapted Claim Decomposition and Verification
    Heyuan Huang, Alexandra DeLucia, Vijay Murari Tiyyala, and 1 more author
    Oct 2025
    Health AI Evaluation
  5. MedExpert: An Expert-Annotated Dataset for Medical Chatbot Evaluation
    Mahsa Yarmohammadi, Alexandra DeLucia, Lillian C Chen, and 8 more authors
    In Machine Learning for Health 2025, 2025
    Health AI Annotation Dataset

2024

  1. Using Natural Language Inference to Improve Persona Extraction from Dialogue in a New Domain
    Alexandra DeLucia, Mengjie Zhao, Yoshinori Maeda, and 3 more authors
    arXiv preprint arXiv:2401.06742, 2024
    LLMs Decoding
  2. First-Hand Accounts of Structural Stigma toward People Who Use Opioids on Reddit
    Evan L Eschliman, Karen Choe, Alexandra DeLucia, and 6 more authors
    Social Science & Medicine, 2024
    Public Health Annotation Reddit Social Media
  3. Anti-LM Decoding for Zero-shot In-context Machine Translation
    Suzanna Sia, Alexandra DeLucia, and Kevin Duh
    In Findings of the Association for Computational Linguistics: NAACL 2024, Jun 2024
    LLMs

2023

  1. R/AskAComputerScientist: Processing Reddit Data for the Social Sciences
    Savannah Brenneke, Meredith Meacham, Amanda Bunting, and 2 more authors
    In 85th Annual Scientific Meeting of College on Problems of Drug Dependence, Jul 2023
    Public Health Reddit Social Media
  2. A Multi-instance Learning Approach to Civil Unrest Event Detection on Twitter
    Alexandra DeLucia, Mark Dredze, and Anna L. Buczak
    In Proceedings of the 6th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text, Sep 2023
    Crisis Informatics Social Media
  3. Automated Discovery of Perceived Health-related Concerns about E-cigarettes from Reddit
    Alexandra DeLucia, Adam Poliak, Zechariah Zu, and 5 more authors
    In 29th Annual Meeting of the Society for Research on Nicotine and Tobacco, Mar 2023
    Public Health Reddit Social Media
  4. Common Law Annotations: Investigating the Stability of Dialog System Output Annotations
    Seunggun Lee, Alexandra DeLucia, Nikita Nangia, and 9 more authors
    In Findings of the Association for Computational Linguistics: ACL 2023, Jul 2023
    LLMs Annotation Evaluation
  5. The SIGMORPHON 2022 Shared Task on Cross-lingual and Low-Resource Grapheme-to-Phoneme Conversion
    Arya D. McCarthy, Jackson L. Lee, Alexandra DeLucia, and 7 more authors
    In Proceedings of the 20th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, Jul 2023
  6. Strength in Numbers: Estimating Confidence of Large Language Models by Prompt Agreement
    Gwenyth Portillo Wightman, Alexandra DeLucia, and Mark Dredze
    In Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023), Jul 2023
    LLMs
  7. Geo-Seq2seq: Twitter User Geolocation on Noisy Data through Sequence to Sequence Learning
    Jingyu Zhang, Alexandra DeLucia, Chenyu Zhang, and 1 more author
    In Findings of the Association for Computational Linguistics: ACL 2023, Jul 2023
    Crisis Informatics Social Media Decoding

2022

  1. Bernice: A Multilingual Pre-trained Encoder for Twitter
    Alexandra DeLucia, Shijie Wu, Aaron Mueller, and 3 more authors
    In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, Dec 2022
    Social Media
  2. Changes in Tweet Geolocation over Time: A Study with Carmen 2.0
    Jingyu Zhang, Alexandra DeLucia, and Mark Dredze
    In Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022), Oct 2022
    Crisis Informatics Social Media

2021

  1. Study of Manifestation of Civil Unrest on Twitter
    Abhinav Chinta, Jingyu Zhang, Alexandra DeLucia, and 2 more authors
    In Proceedings of the Seventh Workshop on Noisy User-generated Text (W-NUT 2021), Nov 2021
    Crisis Informatics Social Media Dataset
  2. Decoding Methods for Neural Narrative Generation
    Alexandra DeLucia, Aaron Mueller, Xiang Lisa Li, and 1 more author
    In Proceedings of the First Workshop on Natural Language Generation, Evaluation, and Metrics (GEM), Aug 2021
    LLMs Evaluation Decoding Annotation

2020

  1. Analyzing Hpc Support Tickets: Experience and Recommendations
    Alexandra DeLucia and Elisabeth Moore
    arXiv preprint arXiv:2010.04321, 2020
    Systems
  2. Civil Unrest on Twitter (CUT): A Dataset of Tweets to Support Research on Civil Unrest
    Justin Sech, Alexandra DeLucia, Anna L. Buczak, and 1 more author
    In Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020), Nov 2020
    Social Media Dataset Annotation

2018

  1. Modeling High Performance Computing System Log Messages for Early Prediction of Job Outcome
    Alexandra DeLucia and Elisabeth Moore
    2018
    Systems
  2. Work in Progress: Topic Modeling for HPC Job State Prediction
    Alexandra DeLucia and Elisabeth Baseman
    In Proceedings of the First Workshop on Machine Learning for Computing Systems, 2018
    Systems

2017

  1. High Performance Computing Job Outcome Prediction by Mining System Logs
    Alexandra DeLucia and Elisabeth Baseman
    2017
    Systems
  2. Markov Chain Modeling for Anomaly Detection in High Performance Computing System Logs
    Abida Haque, Alexandra DeLucia, and Elisabeth Baseman
    In Proceedings of the Fourth International Workshop on HPC User Support Tools, Nov 2017
    Systems

2015

  1. Self-Driven Service Learning: Community-Student-Faculty Collaboratives Outside of the Classroom
    Verónica A. Segarra, Alexandra A. DeLucia, Alyssa A. DeLucia, and 16 more authors
    Journal of Microbiology & Biology Education, Dec 2015