screen-scraper is a cross-platform web scraper for extracting data from Web sites.
screen-scraper is a cross-platform web scraper for extracting data from Web sites consists of a proxy server that allows the contents of HTTP and HTTPS requests to be viewed, and an engine that can be configured to extract information from Web sites using special patterns and regular expressions.
screen-scraper is a cross-platform web scraper for extracting data from Web sites handles authentication, redirects, and cookies, and contains an embedded scripting engine that allows extracted data to be manipulated, written out to a file, or inserted into a database.
screen-scraper is a cross-platform web scraper for extracting data from Web sites can be used with .NET, ColdFusion, Java, PHP, or any COM-friendly language such as Visual Basic or Active Server Pages.
Here are some key features of "screen-scraper":
· Proxy server for recording web page browsing
· Scraping Engine for extracting data
· Scripting
· Write extracted data to csv, plain text or databases
· Proxy server for recording web page browsing
· Web page data extraction
· File system data extraction
· Invoke sessions from the command line
· Export scraping sessions
· Scripting in Interpreted Java, JScript, JavaScript, Python and VBScript
· Forum access
· Page Scraping and Web Scraping
· Web Site Data Extraction
· Web Content Mining
· Web Fetching
· Web Parsing
What's New in This Release: [ read full changelog ]
· feature: added REST interface
· feature: can now filter out less useful proxy transactions
· feature: added DataManager to facilitate saving data to a database
· feature: generate multiple scrapeable files from proxy session
· feature: made button bar persistent for extractor patterns
· feature: retained number of lines to display for scraping session log between sessions
· feature: updated scrapeable file icons to indicate when they are and are not invoked in sequence
· feature: added a delete option for scraping sessions to web interface
· feature: enhanced data set viewer with list view and colored tokens
· feature: improved script error messages
· feature: added a method to allow HTTP parameters to be removed from scrapeable files
· feature: added logging levels to scraping session
· feature: added ability to compare request in scrapeable file with transaction in proxy session
· feature: enhanced breakpoint window to show more information, such as current script and number of scripts on the stack
· feature:...