HtmlCleaner iconHtmlCleaner 2.6.1

Free and open source HTML parser
HtmlCleaner is a free and open source open-source HTML parser written in Java. HTML found on Web is usually dirty, ill-formed and unsuitable for further processing.

For any serious consumption of such documents, it is necessary to first clean up the mess and bring the order to tags, attributes and ordinary text.

For the given HTML document, HtmlCleaner reorders individual elements and produces well-formed XML.

By default, HtmlCleaner follows similar rules that the most of web browsers use in order to create Document Object Model. However, user may provide custom tag and rule set for tag filtering and balancing.

last updated on:
September 9th, 2013, 17:23 GMT
file size:
124 KB
price:
FREE!
developed by:
Vladimir Nikic
license type:
BSD 
operating system(s):
Mac OS X
binary format:
Universal Binary
category:
Home \ Utilities

FREE!

In a hurry? Add it to your Download Basket!

user rating

UNRATED
0.0/5
 

0/5

1 Screenshot
HtmlCleaner - Usage screen for the application when running it from a Terminal window.
What's New in This Release:
  • Fixed Issue 90: Re-instating the HtmlCleaner's public instance method clean(Reader)
read full changelog

Add your review!

SUBMIT