Prediction-Constrained Topic Models for Antidepressant Recommendation (1712.00499v1)

Published 1 Dec 2017 in cs.LG and stat.ML

Abstract: Supervisory signals can help topic models discover low-dimensional data representations that are more interpretable for clinical tasks. We propose a framework for training supervised latent Dirichlet allocation that balances two goals: faithful generative explanations of high-dimensional data and accurate prediction of associated class labels. Existing approaches fail to balance these goals by not properly handling a fundamental asymmetry: the intended task is always predicting labels from data, not data from labels. Our new prediction-constrained objective trains models that predict labels from heldout data well while also producing good generative likelihoods and interpretable topic-word parameters. In a case study on predicting depression medications from electronic health records, we demonstrate improved recommendations compared to previous supervised topic models and high- dimensional logistic regression from words alone.

Authors (7)

Michael C. Hughes (39 papers)
Gabriel Hope (4 papers)
Leah Weiner (3 papers)
Thomas H. McCoy (2 papers)
Roy H. Perlis (4 papers)
Erik B. Sudderth (18 papers)
Finale Doshi-Velez (134 papers)

Citations (9)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Prediction-Constrained Topic Models for Antidepressant Recommendation (1712.00499v1)

Summary

Related Papers