Metadata-Version: 1.0
Name: HtmlList
Version: 2.2.1
Summary: Extract information from HTML pages that have some kind of a repetitive pattern
Home-page: http://pyhtmllist.sourceforge.net/
Author: Erez Bibi
Author-email: erezbibi@users.sourceforge.net
License: GPL
Description: This package finds repetitive format patterns in an
        HTML page that contains one or more lists and extracts the sub-html text that
        creates the patterns. The idea is that in a typical HTML data page containing a
        list of items, there will be a repetitive pattern for the human eye (the page
        format). This pattern can be recognized automatically, and the data in the list
        can be extracted.
Keywords: HTML list information extraction repetitive pattern
Platform: UNKNOWN
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: GNU General Public License (GPL)
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python
Classifier: Topic :: Text Processing :: Markup :: HTML
Classifier: Topic :: Utilities
