LinkChecker is an open source and free command-line utility that will help you check websites and HTML documents for broken links.
LinkChecker features recursive checking, multithreading, output in colored or normal text, HTML, SQL, CSV or a sitemap graph in GML or XML, support for HTTP/1.1, HTTPS, FTP, mailto:, news:, nntp:, Telnet and local file links, restriction of link checking with regular expression filters for URLs, proxy support, username/password authorization for HTTP and FTP, robots.txt exclusion protocol support, i18n support, a command line interface and a (Fast)CGI web interface (requires HTTP server) .
Here are some key features of "LinkChecker":
· recursive and multithreaded checking
· output in colored or normal text, HTML, SQL, CSV, XML or a sitemap graph in different formats
· HTTP/1.1, HTTPS, FTP, mailto:, news:, nntp:, Telnet and local file links support
· restriction of link checking with regular expression filters for URLs
· proxy support
· username/password authorization for HTTP and FTP and Telnet
· honors robots.txt exclusion protocol
· Cookie support
· HTML and CSS syntax check
· Antivirus check
· a command line interface
· a GUI client interface
· a (Fast)CGI web interface (requires HTTP server)
Requirements:
· Python 2.5 or later
What's New in This Release: [ read full changelog ]
Fixes:
· checking: Always use GET for Zope servers since their HEAD support is broken.
· installation: Install correct MSVC++ runtime DLL version for Windows.
· installation: Install missing Python modules for twill, cssutils and HTMLTidy.
Changes:
· documentation: Made the --ignore-url documentation more clear. Patch from Charles Jones.
· installation: Report missing py2app instead of generating a Distutils error.
· documentation: Fix typo in linkcheckerrc.5 manual page.
Features:
· installation: Add dependency declaration documentation to setup.py.