Introduction
============

transmogrify.htmlcontentextractor
   This blueprint extracts out title, description and body from html 
   either via xpath, TAL or by automatic cluster analysis
