research themes
data science and natural language processing toward critical informatics applications
trustworthy & interpretable machine learning
Responsible deployment of machine learning systems warrants careful consideration of potential failure modes and other limitations, such as hallucinations, opacity, and misalignment. Toward this end, recent work has examined uncertainty quantification (ICLR ‘26, arXiv ‘26) and obfuscated evil twin prompting (EMNLP ‘25, EMNLP ‘24) in large language models.
- SENECA: Small-Sample Discrete Entropy Estimation via Self-Consistent Missing Mass (Preprint 2026)
- Estimating Semantic Alphabet Size for LLM Uncertainty Quantification (ICLR 2026)
- Demystifying optimized prompts in language models (EMNLP 2025)
- Prompts have evil twins (EMNLP 2024)
public health data science & biomedical informatics
- Network analysis of U.S. non-fatal opioid-involved overdose journeys, 2018–2023 (Appl. Netw. Sci. 2024)
- cosasi: Graph Diffusion Source Inference in Python (JOSS 2022)
- Surveillance nanotechnology for multi-organ cancer metastases (Nature BME 2017)