HtmlCleaner 2.6.1

Free and open source HTML parser

  Add it to your Download Basket!

 Add it to your Watch List!

0/5

Rate it!

What's new in HtmlCleaner 2.6.1:

  • Fixed Issue 90: Re-instating the HtmlCleaner's public instance method clean(Reader)
Read full changelog
report
malware
send us
an update
LICENSE TYPE:
BSD 
FILE SIZE:
124 KB
USER RATING:
UNRATED
  0.0/5
DEVELOPED BY:
Vladimir Nikic
CATEGORY:
Home \ Utilities
1 HtmlCleaner Screenshot:
HtmlCleaner - Usage screen for the application when running it from a Terminal window.
HtmlCleaner is a free and open source open-source HTML parser written in Java. HTML found on Web is usually dirty, ill-formed and unsuitable for further processing.

For any serious consumption of such documents, it is necessary to first clean up the mess and bring the order to tags, attributes and ordinary text.

For the given HTML document, HtmlCleaner reorders individual elements and produces well-formed XML.

By default, HtmlCleaner follows similar rules that the most of web browsers use in order to create Document Object Model. However, user may provide custom tag and rule set for tag filtering and balancing.

Last updated on September 9th, 2013

Runs on: Mac OS X (Universal Binary)

requirements

#parse html #html parser #xml generator #parser #xml #HTML #parse

Add your review!

SUBMIT