Lucas H. McCabe

Data Science Fellow @ LMI. CS PhD Candidate @ GW GraphLab.

prof_pic.png

The overarching aim of my research is the development and implementation of mathematical data science and natural language processing techniques to support critical informatics applications. My current focus is trustworthiness in machine learning, work that has examined uncertainty quantification (ICLR ‘26, arXiv ‘26) and obfuscated evil twin prompts (EMNLP ‘25, EMNLP ‘24). I am also interested in interdisciplinary applications of data and network science. Please see my research themes and publications for more.

My PhD studies are supervised by H. Howie Huang, before which I completed my master’s at Johns Hopkins under the late Tom Woolf. I am grateful to have been recognized as a DARPA Riser, Luminary Awardee (LMI), and Bernstein Scholar (Institute for Quantitative Biomedicine, formerly “BioMaPS”).

selected research

  1. Preprint
    seneca_summary.png
    SENECA: Small-Sample Discrete Entropy Estimation via Self-Consistent Missing Mass
    L. H. McCabe, and H. H. Huang
    arXiv preprint arXiv:2605.00668, 2026
  2. ICLR
    esa_schema.png
    Estimating Semantic Alphabet Size for LLM Uncertainty Quantification
    L. H. McCabe, R. Melamed, T. Hartvigsen, and H. H. Huang
    In The Fourteenth International Conference on Learning Representations , 2026
  3. EMNLP
    ood-prompts-zipf.png
    Demystifying optimized prompts in language models
    R. Melamed, L. H. McCabe, and H. H. Huang
    In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing , Nov 2025
  4. Appl. Netw. Sci.
    overdose-network-map.png
    Network analysis of U.S. non-fatal opioid-involved overdose journeys, 2018–2023
    L. H. McCabe, N. Masuda, S. Casillas, N. Danneman, A. Alic, and R. Law
    Applied Network Science, Nov 2024
  5. EMNLP
    propane.png
    Prompts have evil twins
    R. Melamed, L. H. McCabe, T. Wakhare, Y. Kim, H. H. Huang, and E. Boix-Adsera
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing , Nov 2024