Metadata-Version: 1.1
Name: gutenberg-cleaner
Version: 0.0.6
Summary: Module to clean up texts from Project Gutenberg
Home-page: https://github.com/c-w/GutenbergCleaner
Author: Clemens Wolff
Author-email: clemens.wolff+pypi@gmail.com
License: LICENSE.txt
Download-URL: http://pypi.python.org/pypi/GutenbergCleaner
Description: ****************
        GutenbergCleaner
        ****************
        
        
        Overview
        ========
        
        This package contains utility methods to help in cleaning up public domain texts
        from `Project Gutenberg <http://www.gutenberg.org>`_.
        
        Texts on Project Gutenberg come with a number of different disclaimers,
        copyright headers, legal boilerplate, and so forth.  This makes it difficult to
        work with the texts in an automated way - e.g. Natural Language Processing:
        Authorship Attribution, Stylometry, Text Summarization, ...  The utilities in
        this package aim to remove the boilerplate, leaving only the raw works behind.
        
        
        Installation
        ============
        
        This project is on `PyPI <https://pypi.python.org/pypi/gutenberg_cleaner>`_, so
        I'd recommend that you just install everything from there using your favourite
        Python package manager.
        
        .. sourcecode :: sh
        
            pip install gutenberg_cleaner
            easy_install gutenberg_cleaner
        
        
        Usage
        =====
        
        There are some programs demonstrating how to use this library in the *scripts*
        directory.
        
Platform: UNKNOWN
