Skip to content

Add LightLDA transform #373

Closed
Closed
@ganik

Description

@ganik

LightLDA: Big Topic Models on Modest Compute Clusters

  • Current implementations of LDA ( Latent Dirichlet Allocation ) such as SparseLDA or AliasLDA allow to achieve massive data and model scales, for example models with tens of billions of parameters to be inferred from billions of documents. However this requires using cluster up to thousands of machines with all ensuing costs to setup and maintain.
  • LightLDA solves this problem in a more cost-effective manner by providing an implementation that is efficient enough for modest clusters with at most tens of machines...

For more details please see LightLDA paper:
http://arxiv.org/abs/1412.1576
http://www.www2015.it/documents/proceedings/proceedings/p1351.pdf
and open source implementation:
https://github.com/Microsoft/LightLDA

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions