  - parser dtd is hardcoded to xhtml1-strict
    PyXML validating parser should come bundled with the w3c dtd's instead of
    hitting the w3c site each time for a dtd that never changes.
  - filter is hardcoded


And whatever issues there are in the tracker:
https://bitbucket.org/charstring/crawlidator/issues?status=new&status=open
