Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Nested HDP for Hierarchical Topic Models (1301.3570v1)

Published 16 Jan 2013 in stat.ML

Abstract: We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP is a generalization of the nested Chinese restaurant process (nCRP) that allows each word to follow its own path to a topic node according to a document-specific distribution on a shared tree. This alleviates the rigid, single-path formulation of the nCRP, allowing a document to more easily express thematic borrowings as a random effect. We demonstrate our algorithm on 1.8 million documents from The New York Times.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. John Paisley (60 papers)
  2. Chong Wang (308 papers)
  3. David Blei (40 papers)
  4. Michael I. Jordan (438 papers)
Citations (9)

Summary

We haven't generated a summary for this paper yet.