I have received reports that parsing large (~18MB) HTML files resulted in completely insane memory use (1GB+). This needs to be confirmed, and if so, addressed.