Metadata-Version: 1.0
Name: textdata
Version: 0.415
Summary: Get clean line or text data from multi-line strings
Home-page: https://bitbucket.org/jeunice/textdata
Author: Jonathan Eunice
Author-email: jonathan.eunice@gmail.com
License: UNKNOWN
Description: 
        It's very common to need to extract text or text lines from within program
        source. The way Python likes to have its text indented, however, means that
        there will often be extra spaces appended to the beginning of each line, as well
        as possibly extra lines at the start and end of the text that are there for
        inclusion in the program source, but not useful in the resulting data.
        
        Python string methods give easy ways to clean this text up, but it's no joy
        reinventing that particular text-cleanup wheel every time you need it.
        
        This module helps clean up included text (or text lines) in a simple, reusuable
        way that won't muck up your programs with extra code, or require constant
        wheel-reinvention.
        
        Usage
        =====
        
        ::
          
            data = lines("""
                There was an old woman who lived in a shoe.
                She had so many children, she didn't know what to do;
                She gave them some broth without any bread;
                Then whipped them all soundly and put them to bed.    
            """)
            
        will result in::
        
            ['There was an old woman who lived in a shoe.',
             "She had so many children, she didn't know what to do;",
             'She gave them some broth without any bread;',
             'Then whipped them all soundly and put them to bed.']
        
        If instead you used ``textlines()``, the result is the same, but joined
        by newlines into
        into a single string::
        
            "There was an old woman who lived in a shoe.\nShe ... to bed."
            # where the ... abbreviates exactly the characters you'd expect
            
        Both routines provide  typically-desired cleanups:
        
          * remove blank lines default), but at least first and last blanks
            (which usually appear due to Python formatting)
          * remove common line prefix (default)
          * strip leading/trailing spaces (leading by request, trailing by default)
          
        The API
        =======
        
        ``lines(text, noblanks=True, dedent=True, lstrip=False, rstrip=True)``
        
            Returns text as a series of cleaned-up lines.
        
            * ``text`` is the text to be processed. 
            * ``noblanks`` => all blank lines are eliminated, not just starting and ending ones. (default ``True``). 
            * ``dedent`` => strip a common prefix (usually whitespace) from each line (default ``True``). 
            * ``lstrip`` => strip all left (leading) space from each line (default ``False``). 
              Note that ``lstrip`` and ``dedent`` are  mutualy exclusive ways of handling leading space. 
            * ``rstrip`` => strip all right (trailing) space from each line (default ``True``)
        
        ``textlines(text, noblanks=True, dedent=True, lstrip=False, rstrip=True)``
        
            Does the same helpful cleanups as ``lines()``, but returns result as a single string, with lines
            separated by newlines (and without a trailing newline).
        
        Design Notes
        ============
        
        It's tempting to define a constant such as ``Dedent``  that might be the default for
        the ``lstrip`` parameter, instead of having separate ``dedent`` and ``lstrip`` booleans.
        The more I use singleton classes in Python as designated special values, the more
        useful they seem.
        
        Recent Changes
        ==============
        
         * Commenced automated mutli-version testing with
           `pytest <http://pypi.python.org/pypi/pytest>`_
           and `tox <http://pypi.python.org/pypi/tox>`_. Versions later than 0.4 
           successfully test against Python 2.5, 2.6, 2.7, 3.2, 3.3 and PyPy 1.9 (based
           on 2.7.2)
           
         * Common line prefix is now computed without considering blank lines, so blank
           lines need not have any indentation on them just to "make things work."
           
         * The tricky case where all lines have a common prefix, but it's not entirely
           composed of whitespace, now properly handled.
        
        Installation
        ============
        
        ::
        
            pip install -U textdata
            
        To ``easy_install`` under a specific Python version (3.3 in this example)::
        
            python3.3 -m easy_install --upgrade textdata
            
        (You may need to prefix these with "sudo " to authorize installation.)
Keywords: text data lines
Platform: UNKNOWN
Classifier: Development Status :: 3 - Alpha
Classifier: Operating System :: OS Independent
Classifier: License :: OSI Approved :: BSD License
Classifier: Intended Audience :: Developers
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 2.5
Classifier: Programming Language :: Python :: 2.6
Classifier: Programming Language :: Python :: 2.7
Classifier: Programming Language :: Python :: 3.2
Classifier: Programming Language :: Python :: 3.3
