SiteSucker is an application that automatically downloads web sites from the Internet. SiteSucker does this by copying the site's HTML documents, images, backgrounds, movies, and other files to your local hard drive.
Just enter a URL and click a button and SiteSucker can download an entire web site.
SiteSucker can be used to make local copies of your web sites for easy maintenance.
SiteSucker can download files unmodified or it can "localize" the files it downloads, allowing you to browse a site off-line. Best of all, SiteSucker is free.
SiteSucker has been completely rewritten as a Universal Cocoa application. It now uses WebKit to asynchronously download files, and it includes integrated online help.
With SiteSucker, you can now save all the information about a download in a document. This allows you to create a document that you can use to perform the same download whenever you want.
If SiteSucker is in the middle of a download when you choose the Save command, SiteSucker will pause the download and save its status with the document. When you open the document later, you can restart the download from where it left off by pressing the Resume button.
Requirements:
· CarbonLib 1.5 or greater
Limitations:
· If a link is specified in a different tag, SiteSucker will not see it.
·
· SiteSucker totally ignores JavaScript. Any link specified within JavaScript will not be seen by SiteSucker and will not be downloaded. (If the Log Warnings option is on in the download settings, SiteSucker will include a warning in the log file for any page that uses JavaScript.)
·
· SiteSucker scans Flash (.swf) files for embedded plain text links, but it can only detect links to files that have one of the following extensions: html, swf, mp3, sit, zip, mov, gif, jpg, png, doc, or txt. SiteSucker also scans QuickTime movies (.mov) for URLs to alternate movies. SiteSucker cannot localize Flash files or QuickTime movies, and SiteSucker does not examine other media files for embedded links.
·
· By default, SiteSucker honors robots.txt exclusions and the Robots META tag. Therefore, any directories or pages disallowed by robot exclusions will not be downloaded by SiteSucker. This behavior, however, can be overridden with the Ignore Robot Exclusions setting under the Advanced tab in the download settings.
What's New in This Release: [ read full changelog ]
· Fixed problems with URL encoding.
· Fixed a problem handling allowed paths in robots.txt.
· Added "Download URLs" Automator action.
· Added "Generate HTML" option to the Download Settings.
· Modified the Path setting under the Parameters tab so that it works correctly.
· Modified the "Include Supporting Files" option to ignore the "Maximum Number of Levels" setting.
· Fixed various bugs.