.. gensim documentation master file, created by
   sphinx-quickstart on Tue Mar 16 19:45:41 2010.
   You can adapt this file completely to your liking, but it should at least
   contain the root `toctree` directive.

Gensim -- Topic Modelling for Humans
=====================================================


.. raw:: html
   :file: _static/tagcloud.html


.. raw:: html

   <table class="links" style="font-size: 200%;">
        <tr>
                <td><a href='intro.html'>What is this?</a></td>
                <td><a href='install.html'>Installation</a></td>
        </tr>
        <tr>
                <td><a href='tutorial.html'>Get started!</a></td>
                <td><a href='about.html'>About</a></td>
        </tr>
   </table>


Quick Reference Example
------------------------

>>> from gensim import corpora, models, similarities
>>>
>>> # Load corpus iterator from a Matrix Market file on disk.
>>> corpus = corpora.MmCorpus('/path/to/corpus.mm')
>>>
>>> # Initialize a transformation (Latent Semantic Indexing with 200 latent dimensions).
>>> lsi = models.LsiModel(corpus, num_topics=200)
>>>
>>> # Convert another corpus to the latent space and index it.
>>> index = similarities.MatrixSimilarity(lsi[another_corpus])
>>>
>>> # determine similarity of a query document against each document in the index
>>> sims = index[query]


.. admonition:: What's new?

   * 15 Sep 2012: release 0.8.6 : added the `hashing trick <http://en.wikipedia.org/wiki/Hashing-Trick>`_ to allow online changes to the vocabulary; fixed parallel lemmatization + `other minor improvements <https://github.com/piskvorky/gensim/blob/develop/CHANGELOG.txt>`_
   * 22 Jul 2012: release 0.8.5 : better Wikipedia parsing, faster similarity queries, maintenance fixes
   * 30 Apr 2012: William Bert's `interview with me <http://williamjohnbert.com/2012/04/interview-with-radim-rehurek-creator-of-gensim/>`_
   * 9 Mar 2012: release 0.8.4: new model `Hierarchical Dirichlet Process <http://radimrehurek.com/gensim/models/hdpmodel.html>`_ (full `CHANGELOG <https://github.com/piskvorky/gensim/blob/develop/CHANGELOG.txt>`_)


.. toctree::
   :hidden:
   :maxdepth: 1

   intro
   install
   tutorial
   distributed
   wiki
   apiref
