research themes
data science and natural language processing toward critical informatics applications
trustworthy & interpretable machine learning
Responsible deployment of machine learning systems warrants careful consideration of potential failure modes and other limitations, such as hallucinations, opacity, and misalignment. Toward this end, recent work has examined uncertainty quantification (ICLR ‘26, arXiv ‘26) and obfuscated evil twin prompting (EMNLP ‘25, EMNLP ‘24) in large language models.
- SENECA: Small-Sample Discrete Entropy Estimation via Self-Consistent Missing Mass (Preprint, 2026)
- Estimating Semantic Alphabet Size for LLM Uncertainty Quantification (ICLR, 2026)
- Demystifying optimized prompts in language models (EMNLP, 2025)
- Prompts have evil twins (EMNLP, 2024)
public health data science & biomedical informatics
- Network analysis of U.S. non-fatal opioid-involved overdose journeys, 2018–2023 (Appl. Netw. Sci., 2024)
- cosasi: Graph Diffusion Source Inference in Python (JOSS, 2022)
- Surveillance nanotechnology for multi-organ cancer metastases (Nature BME, 2017)