Lucas H. McCabe

The overarching aim of my research is the development and implementation of mathematical data science and natural language processing techniques to support critical informatics applications. My current focus is trustworthiness in machine learning, work that has examined uncertainty quantification (ICLR ‘26, arXiv ‘26) and obfuscated evil twin prompts (EMNLP ‘25, EMNLP ‘24). I am also interested in interdisciplinary applications of data and network science. Please see my research themes and works for more.

My PhD studies are supervised by H. Howie Huang, before which I completed my master’s at Johns Hopkins under the late Tom Woolf. I am grateful to have been recognized as a DARPA Riser, Luminary Awardee (LMI), and Bernstein Scholar (Institute for Quantitative Biomedicine, formerly “BioMaPS”).

selected research

Preprint
SENECA: Small-Sample Discrete Entropy Estimation via Self-Consistent Missing Mass

L. H. McCabe, and H. H. Huang

arXiv preprint arXiv:2605.00668, 2026

Abs Bib HTML PDF

Discrete entropy estimation is a classic information theory problem, wherein the average information content of a discrete random variable is estimated from samples alone. Naive approaches, such as the plugin method, fail to account for the probability mass associated with members of the random variable’s support that are unobserved in a given sample, known as the "missing mass." The resulting systemic underestimation is particularly problematic when data is time-consuming or costly to gather. We propose SENECA, an entropy estimation scheme based on a novel “self-consistent” missing mass calculation. Extensive numerical experiments indicate that our approach outperforms many state-of-the-art alternatives overall in the small-sample setting. We then apply SENECA to two practical use cases, namely biodiversity estimation and the detection of incorrect large language model responses, where our method is competitive with domain-specific approaches. Our work advances SENECA as an effective drop-in replacement for small-sample entropy estimation, with broad utility across several domains.
@article{mccabe2026seneca, title = {SENECA: Small-Sample Discrete Entropy Estimation via Self-Consistent Missing Mass}, author = {McCabe, L. H. and Huang, H. H.}, journal = {arXiv preprint arXiv:2605.00668}, year = {2026}, rtheme = {trustworthyml} }
ICLR
Estimating Semantic Alphabet Size for LLM Uncertainty Quantification

L. H. McCabe, R. Melamed, T. Hartvigsen, and H. H. Huang

In The Fourteenth International Conference on Learning Representations , 2026

Abs Bib PDF Code

Many black-box techniques for quantifying the uncertainty of large language models (LLMs) rely on repeated LLM sampling, which can be computationally expensive. Therefore, practical applicability demands reliable estimation from few samples. Semantic entropy (SE) is a popular sample-based uncertainty estimator with a discrete formulation attractive for the black-box setting. Recent extensions of SE exhibit improved LLM hallucination detection, but do so with less interpretable methods that admit additional hyperparameters. For this reason, we revisit the canonical discrete semantic entropy (DSE) estimator, finding that it underestimates the "true" semantic entropy, as expected from theory. We propose a modified semantic alphabet size estimator, and illustrate that using it to adjust DSE for sample coverage results in more accurate SE estimation in our setting of interest. Furthermore, we find that two semantic alphabet size estimators, including our proposed, flag incorrect LLM responses as well or better than many top-performing alternatives, with the added benefit of remaining highly interpretable.
@inproceedings{mccabe2026estimatingsemanticalphabetsize, title = {Estimating Semantic Alphabet Size for LLM Uncertainty Quantification}, author = {McCabe, L. H. and Melamed, R. and Hartvigsen, T. and Huang, H. H.}, year = {2026}, booktitle = {The Fourteenth International Conference on Learning Representations}, rtheme = {trustworthyml} }
EMNLP
Demystifying optimized prompts in language models

R. Melamed, L. H. McCabe, and H. H. Huang

In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing , Nov 2025

Abs Bib HTML PDF Code

Modern language models (LMs) are not robust to out-of-distribution inputs. Machine generated ("optimized") prompts can be used to modulate LM outputs and induce specific behaviors while appearing completely uninterpretable. In this work, we investigate the composition of optimized prompts, as well as the mechanisms by which LMs parse and build predictions from optimized prompts. We find that optimized prompts primarily consist of punctuation and noun tokens which are more rare in the training data. Internally, optimized prompts are clearly distinguishable from natural language counterparts based on sparse subsets of the model’s activations. Across various families of instruction-tuned models, optimized prompts follow a similar path in how their representations form through the network.
@inproceedings{melamed2025demystifying, title = {Demystifying optimized prompts in language models}, author = {Melamed, R. and McCabe, L. H. and Huang, H. H.}, editor = {Christodoulopoulos, Christos and Chakraborty, Tanmoy and Rose, Carolyn and Peng, Violet}, booktitle = {Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing}, month = nov, year = {2025}, address = {Suzhou, China}, publisher = {Association for Computational Linguistics}, doi = {10.18653/v1/2025.emnlp-main.147}, pages = {2983--2999}, isbn = {979-8-89176-332-6}, rtheme = {trustworthyml}, month_numeric = {11} }
Appl. Netw. Sci.
Network analysis of U.S. non-fatal opioid-involved overdose journeys, 2018–2023

L. H. McCabe, N. Masuda, S. Casillas, N. Danneman, A. Alic, and R. Law

Applied Network Science, Nov 2024

Abs Bib HTML PDF

We present a nation-wide network analysis of non-fatal opioid-involved overdose journeys in the United States. Leveraging a unique proprietary dataset of Emergency Medical Services incidents, we construct a journey-to-overdose geospatial network capturing nearly half a million opioid-involved overdose events spanning 2018-2023. We analyze the structure and sociological profile of the nodes, which are counties or their equivalents, characterize the distribution of overdose journey lengths, and investigate changes in the journey network between 2018 and 2023. Our findings include that authority and hub nodes identified by the HITS algorithm tend to be located in urban areas and involved in overdose journeys with particularly long geographical distances.
@article{mccabe2024overdosenetworks, title = {Network analysis of U.S. non-fatal opioid-involved overdose journeys, 2018–2023}, author = {McCabe, L. H. and Masuda, N. and Casillas, S. and Danneman, N. and Alic, A. and Law, R.}, journal = {Applied Network Science}, month = nov, year = {2024}, doi = {https://doi.org/10.1007/s41109-024-00661-z}, rtheme = {health}, month_numeric = {11} }
EMNLP
Prompts have evil twins

R. Melamed, L. H. McCabe, T. Wakhare, Y. Kim, H. H. Huang, and E. Boix-Adsera

In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing , Nov 2024

Abs Bib HTML PDF Code

We discover that many natural-language prompts can be replaced by corresponding prompts that are unintelligible to humans but that provably elicit similar behavior in language models. We call these prompts “evil twins” because they are obfuscated and uninterpretable (evil), but at the same time mimic the functionality of the original natural-language prompts (twins). Remarkably, evil twins transfer between models. We find these prompts by solving a maximum-likelihood problem which has applications of independent interest.
@inproceedings{melamed2023propane, title = {Prompts have evil twins}, author = {Melamed, R. and McCabe, L. H. and Wakhare, T. and Kim, Y. and Huang, H. H. and Boix-Adsera, E.}, editor = {Al-Onaizan, Yaser and Bansal, Mohit and Chen, Yun-Nung}, booktitle = {Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing}, month = nov, year = {2024}, address = {Miami, Florida, USA}, publisher = {Association for Computational Linguistics}, pages = {46--74}, rtheme = {trustworthyml}, month_numeric = {11} }