External links:
This page contains select publications from my group. See the complete list in my Curriculum Vitae
2024
Learning the Regularization Strength for Deep Fine-Tuning via a Data-Emphasized Variational Objective
Ethan Harvey, Mikhail Petrov, and Michael C. Hughes
Workshop on Fine-Tuning in Modern Machine Learning (FITML), co-located with NeurIPS 2024
Validation of System for Automated Screening for Aortic Stenosis
Samuel Karmiy, Zhe Huang, Eileen Mai, Jing Li, Monica Dehn, Davinder Ramsingh, John Martin, Ayan R. Patel, Michael C. Hughes, and Benjamin S. Wessler
Abstract accepted for poster presentation at American College of Cardiology (ACC)
Spatiotemporal Forecasting of Opioid-related Fatal Overdoses: Towards Best Practices for Modeling and Evaluation
Kyle Heuton, Jyontika Kapoor, Shikhar Shrestha, Thomas J Stopka, Michael C. Hughes
American Journal of Epidemiology
SINCERE: Supervised Information Noise-Contrastive Estimation REvisited
Patrick Feeney and Michael C. Hughes
unpublished preprint, 2024
• [arXiv]
InterLUDE: Interactions between Labeled and Unlabeled Data to Enhance Semi-Supervised Learning
Zhe Huang, Xiaowei Yu, Dajiang Zhu, and Michael C. Hughes
International Conference on Machine Learning (ICML), 2024
Systematic comparison of semi-supervised and self-supervised learning for medical image classification
Zhe Huang, Ruijie Jiang, Shuchin Aeron, and Michael C. Hughes
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
Transfer Learning with Informative Priors: Simple Baselines Better than Previously Reported
Ethan Harvey, Mikhail Petrov, and Michael C. Hughes
Transactions on Machine Learning Research (TMLR)
Semi-Supervised Multimodal Multi-Instance Learning for Aortic Stenosis Diagnosis
Zhe Huang, Xiaowei Yu, Benjamin S. Wessler, Michael C. Hughes
unpublished preprint, 2024
• [arXiv]
A Neurosymbolic Cognitive Architecture Framework for Handling Novelties in Open Worlds
Shivam Goel, Panagiotis Lymperopoulos, Ravenna Thielstrom, Evan Krause, Patrick Feeney, Pierrick Lorang, Sarah Schneider, Yichen Wei, Eric Kildebeck, Stephen Goss, Michael C. Hughes, Liping Liu, Jivko Sinapov and Matthias Scheutz
Artificial Intelligence, 2024
2023
A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data
Ethan Harvey, Wansu Chen, David M. Kent, and Michael C. Hughes
Machine Learning for Health (ML4H), 2023
Detecting Heart Disease from Multi-View Ultrasound Images via Supervised Attention Multiple Instance Learning
Zhe Huang, Benjamin S. Wessler, and Michael C. Hughes
Machine Learning for Healthcare Conference (MLHC), 2023
[PDF] • [arXiv] • [PMLR] • [code on GitHub: SAMIL] • [video on SlidesLive]
Non-Parametric and Regularized Dynamical Wasserstein Barycenters for Sequential Observations
Kevin C. Cheng, Shuchin Aeron, Michael C. Hughes, Eric L. Miller
IEEE Transactions on Signal Processing, 2023
Approximate inference by broadening the support of the likelihood
Michael T. Wojnowicz, Martin D. Buck, Michael C. Hughes
Symposium on Advances in Approximate Bayesian Inference (AABI), 2023
Learning where to intervene with a differentiable top-k operator: Towards data-driven strategies to prevent fatal opioid overdoses
Kyle Heuton, Shikhar Shrestha, Thomas J. Stopka, Michael C Hughes
Workshop on Interpretable Machine Learning in Healthcare (IMLH), 2023
Semi-supervised Ordinal Regression via Cumulative Link Models for Predicting In-Hospital Length-of-Stay
Alexander Arjun Lobo, Preetish Rath, Michael C Hughes
Workshop on Interpretable Machine Learning in Healthcare (IMLH), 2023
NovelCraft: A Dataset for Novelty Detection and Discovery in Open Worlds
Patrick Feeney, Sarah Schneider, Panagiotis Lymperopoulos, Li-Ping Liu, Matthias Scheutz, and Michael C. Hughes
Transactions on Machine Learning Research (TMLR), 03/2023
[OpenReview] • [arXiv] • [dataset: NovelCraft.cs.tufts.edu] • [code on GitHub] • [video on YouTube]
Fix-A-Step: Semi-supervised Learning from Uncurated Unlabeled Data
Zhe Huang, Mary-Joy Sidhom, Benjamin S. Wessler, and Michael C. Hughes
Artificial Intelligence and Statistics (AISTATS), 2023
[paper PDF] • [arXiv] • [code on GitHub] • [Heart2Heart data on GitHub]
Automated Detection of Aortic Stenosis using Machine Learning
Benjamin S. Wessler, Zhe Huang, Gary Long, Stefano Pacifici, Nishant Prashar, Samuel Karmiy, Roman A. Sandler, Joseph Sokol, Daniel B. Sokol, Monica M. Dehn, Luisa Maslon, Eileen Mai, Ayan R. Patel, and Michael C. Hughes
2022
Predicting Spatiotemporal Counts of Opioid-related Fatal Overdoses via Zero-Inflated Gaussian Processes
Kyle Heuton, Shikhar Shrestha, Thomas J. Stopka, Jennifer Pustz, Li-Ping Liu, and Michael C. Hughes
Semi-supervised Learning from Uncurated Echocardiogram Images with Fix-A-Step
Zhe Huang, Mary-Joy Sidhom, Benjamin S. Wessler, Michael C. Hughes
Prediction-Constrained Markov Models for Medical Time Series with Missing Data and Few Labels
Preetish Rath, Gabriel Hope, Kyle Heuton, Erik B. Sudderth, and Michael C. Hughes
Learning from Time Series For Health (TS4H) Workshop at NeurIPS 2022
Easy Variational Inference for Categorical Models via an Independent Binary Approximation
Michael T. Wojnowicz, Shuchin Aeron, Eric L. Miller, and Michael C. Hughes
International Conference on Machine Learning (ICML), 2022
[paper PDF] • [arXiv] • [ICML proceedings on PMLR] • [code on GitHub] • [video on SlidesLive] • [related workshop paper from UAI TPM 2021]
TMED 2: A Dataset for Semi-Supervised Classification of Echocardiograms
Zhe Huang, Gary Long, Benjamin S. Wessler, and Michael C. Hughes
"DataPerf: Benchmarking Data for Data-Centric AI" workshop, co-located with ICML 2022
[paper PDF] • [dataset: TMED.cs.tufts.edu] • [video on SlidesLive] • [code on GitHub]
Optimizing Early Warning Classifiers to Control False Alarms via a Minimum Precision Constraint
Preetish Rath and Michael C. Hughes
Artificial Intelligence & Statistics (AISTATS), 2022.
[paper PDF] • [code on GitHub] • [Twitter thread] • [slides PDF] • [video on SlidesLive] • [earlier workshop paper from IMLH 2021]
Learning Consistent Deep Generative Models from Sparsely Labeled Data
Gabriel Hope, Madina Abdrakhmanova, Xiaoyin Chen, Michael C. Hughes, and Erik B. Sudderth
4th Symposium on Advances in Approximate Bayesian Inference (AABI 2022)
[PDF on OpenReview] • [older version on arXiv] • [poster PDF on GDrive]
2021
Dynamical Wasserstein Barycenters for Time-series Modeling
Kevin C. Cheng, Shuchin Aeron, Michael C. Hughes, and Eric L. Miller
Neural Information Processing Systems (NeurIPS), 2021
• [PDF in NeurIPS proceedings] • [code on GitHub] • [arXiv] • [PDF on OpenReview] • [video on SlidesLive]
The Tufts fNIRS Mental Workload Dataset & Benchmark for Brain-Computer Interfaces that Generalize
Zhe Huang, Liang Wang, Giles Blaney, Christopher Slaughter, Devon McKeon, Ziyu Zhou, Robert Jacob, Michael C. Hughes
Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS D&B), 2021
• [PDF on OpenReview] • [Dataset] • [Code on GitHub] • [PDF in NeurIPS proceedings] • [poster PDF] [Twitter thread]
A New Semi-supervised Learning Benchmark for Classifying View and Diagnosing Aortic Stenosis from Echocardiograms
Zhe Huang, Gary Long, Benjamin Wessler, and Michael C. Hughes
Machine Learning for Healthcare (MLHC)
• [paper PDF] • [dataset: TMED.cs.tufts.edu] • [arXiv] • [PMLR] • [spotlight video on Youtube] • [code on GitHub] • [poster PDF] • [Twitter thread]
Approximate Bayesian Computation for an Explicit-Duration Hidden Markov Model of COVID-19 Hospital Trajectories
Gian Marco Visani, Alexandra Hope Lee, Cuong Nguyen, David M. Kent, John B. Wong, Joshua T. Cohen, and Michael C. Hughes
Machine Learning for Healthcare (MLHC)
• [paper PDF] • [arXiv] • [PMLR] • [spotlight video on Youtube] • [code on GitHub] • [poster PDF] • [Twitter thread]
Forecasting COVID-19 Counts At A Single Hospital: A Hierarchical Bayesian Approach
Alexandra Hope Lee, Panagiotis Lymperopoulos, Joshua T. Cohen, John B. Wong, and Michael C. Hughes
ICLR 2021 Workshop: Machine Learning for Preventing and Combating Pandemics
• [paper PDF] • [PDF on arXiv] • [code on GitHub: single-hospital-count-forecasting]
Enzyme Promiscuity Prediction Using Hierarchy-Informed Multi-Label Classification
Gian Marco Visani, Michael C. Hughes, and Soha Hassoun
Bioinformatics
• [paper]
Modeling Graph Node Correlations with Neighbor Mixture Models
Linfeng Liu, Michael C. Hughes, and Liping Liu
unpublished preprint
2020
Assessment of a Prediction Model for Antidepressant Treatment Stability Using Supervised Topic Models
Michael C. Hughes, Melanie F. Pradier, Andrew Slavin Ross, Thomas H. McCoy Jr, Roy H. Perlis, Finale Doshi-Velez
• [paper PDF] • [supplement PDF] • [listing on jamanetwork.com] • [supplement on jamanetwork.com] • [earlier draft on medRxiv]
MIMIC-Extract: A Data Extraction, Preprocessing, and Representation Pipeline for MIMIC-III
Shirly Wang, Matthew B. A. McDermott, Geeticka Chauhan, Marzyeh Ghassemi, Michael C. Hughes, and Tristan Naumann
CHIL 2020: Proceedings of the ACM Conference on Health, Inference, and Learning
• [paper PDF] • [listing on acm.org] ≺ [code on GitHub: MLforHealth/MIMIC_Extract] ≺ [video on SlidesLive]
POPCORN: Partially Observed Prediction-Constrained Reinforcement Learning
Joseph Futoma, Michael C. Hughes, and Finale Doshi-Velez"
• [paper PDF] • [arXiv] • [video on SlidesLive] • [previous workshop paper PDF]
Optimal Transport Based Change Point Detection and Time Series Clustering
Kevin C. Cheng, Shuchin Aeron, Michael C. Hughes, Erika Hussey, and Eric L. Miller
ICASSP 2020: IEEE International Conference on Acoustics, Speech and Signal Processing
• [paper PDF] • [IEEE Xplore listing] • [Virtual Presentation]
2019
Rapid Model Comparison by Amortizing Across Models
Lily H. Zhang and Michael C. Hughes
2nd Symposium on Advances in Approximate Bayesian Inference (AABI 2019)
Challenges in Computing and Optimizing Upper Bounds of Marginal Likelihood based on Chi-Square Divergences
Melanie F. Pradier, Michael C. Hughes, and Finale Doshi-Velez
2nd Symposium on Advances in Approximate Bayesian Inference (AABI 2019)
Supervised Machine Learning Algorithms Using Patient Related Factors to Predict in-Hospital Mortality Following Acute Myeloid Leukemia Therapy
Nauman Saleem Siddiqui, Andreas Klein, Amandeep Godara, Cindy Varga, Rachel J. Buchsbaum, and Michael C. Hughes
Proceedings of 61st Annual Meeting of the American Hematology Society
2018
Prediction-Constrained POMDPs
Joseph Futoma, Michael C. Hughes, and Finale Doshi-Velez
Reinforcement Learning under Partial Observability (RLPO) workshop at NeurIPS 2018
Rethinking clinical prediction: Why machine learning must consider year of care and feature aggregation
Bret Nestor, Matthew B. A. McDermott, Geeticka Chauhan, Tristan Naumann, Michael C. Hughes, Anna Goldenberg, Marzyeh Ghassemi
Machine Learning for Healthcare (ML4H) workshop at NeurIPS 2018
• [paper PDF] • [arXiv]
Semi-Supervised Prediction-Constrained Topic Models
Michael C. Hughes, Gabriel Hope, Leah Weiner, Thomas H. McCoy, Jr., Roy H. Perlis, Erik B. Sudderth, and Finale Doshi-Velez
• [paper PDF] • [supplement PDF] • [Github code] • [learned topics HTML] • [arXiv PDF (slightly older, longer version)] •
2017
Prediction-Constrained Topic Models for Antidepressant Prediction
Michael C. Hughes, Gabriel Hope, Leah Weiner, Thomas H. McCoy, MD, Roy H. Perlis, MD, Erik B. Sudderth, and Finale Doshi-Velez
• [short paper PDF] • [poster PDF] • [longer tech report on arXiv]
2016
Supervised topic models for clinical interpretability
Michael C. Hughes, Huseyin Melih Elibol Thomas McCoy, MD Roy Perlis, MD Finale Doshi-Velez
• [paper PDF] • [arXiv]
Fast Learning of Clusters and Topics via Sparse Posteriors
Michael C. Hughes, Erik B. Sudderth
unpublished preprint
• [paper PDF] • [arXiv]
2015
Scalable Adaptation of State Complexity for Nonparametric Hidden Markov Models
Michael C. Hughes, William Stephenson, Erik B. Sudderth
Neural Information Processing Systems, 2015.
• [paper PDF] • [supplement PDF] • [Python code for inference: bnpy] • [Python code for plots: x-hdphmm-nips2015]
Reliable and Scalable Variational Inference for the Hierarchical Dirichlet Process.
Michael C. Hughes, Daeil Kim, Erik B. Sudderth
Artificial Intelligence & Statistics, 2015.
• [paper PDF] • [supplement PDF] • [Python code: bnpy] • [poster PDF] • [poster PPTX]
2014
bnpy: Reliable and scalable variational inference for Bayesian nonparametric models
Michael C. Hughes, Erik B. Sudderth
3rd NIPS Workshop on Probabilistic Programming
Joint Modeling of Multiple Time Series via the Beta Process with Application to Motion Capture Segmentation.
Emily Fox, Michael C. Hughes, Erik B. Sudderth, Michael I. Jordan
Annals of Applied Statistics, Vol. 8(3), 2014.
• [paper PDF] • [supplement PDF] • [Matlab code: NPBayesHMM]
2013
Memoized Online Variational Inference for Dirichlet Process Mixture Models
Michael C. Hughes, Erik B. Sudderth
Neural Information Processing Systems, 2013.
• [paper PDF] • [supplement PDF] • [poster PDF] • [Python code: bnpy]
2012
Effective Split-Merge Monte Carlo Methods for Nonparametric Models of Sequential Data
Michael C. Hughes, Emily Fox, Erik B. Sudderth
Neural Information Processing Systems, 2012.
The Nonparametric Metadata Dependent Relational Model
Dae Il Kim, Michael C. Hughes, Erik B. Sudderth
International Conference on Machine Learning, 2012.
Nonparametric Discovery of Activity Patterns from Video Collections
Michael C. Hughes, Erik B. Sudderth
CVPR Workshop on Perceptual Organization in Computer Vision (POCV), 2012.
• [paper PDF] • [supplement ZIP] • [Matlab code: NPBayesHMM]