MAC CATEGORIES:



NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>

7-DAY TOP DOWNLOAD

#
Program
iPhone and iPod
Firmware 3.1.2

15,897
Internet Explorer
5.2.3

6,013
Toast Titanium
10.0.4

3,898
Apple iLife '09
3,725
Apple GarageBand
Update 5.1

3,332
RAR Expander 0.8.5
Beta 3 / 0.8.4

3,260
Matlab 7.9
3,070
AC3 Codec 0.4
2,966
Kid Pix Deluxe 3X
Update 1.2.4

2,957
aMule 2.2.6
2,813

WEEK'S BEST

  • HandBrake 0.9.4
  • Apple iPhoto Updat...
  • Thunderbird 3.0.0 ...
  • VueScan 8.5.39
  • Opera 10.20 Build ...
  • Dropbox 0.7.75 RC ...
  • Adobe Lightroom 3....
  • Google Earth 5.1.3...
  • Camino 2.0
  • Vuze (formerly Azu...
  • Safari 4.0.4
  • OnyX 2.1.1
  • FileZilla 3.3.0.1
  • Quicksilver 1.0 Be...
  • Cocktail 4.6.1
  • Picasa 3.5.2.90
  • Adobe Shockwave Pl...
  • VLC Media Player 1...
  • LimeWire X 5.4.2 B...
  • Apple iTunes 9.0.2
  • VirtualBox 3.1.0 B...
  • FileZilla 3.3.0.1
  • Mozilla Firefox 3....
  • aMSN 0.98.1
  • NeoOffice 3.0.1 Pa...
  • VMware Fusion 3.0....
  • Quinn 3.5.7
  • App Store Expense ...
  • Ableton Live 8.0.9
  • Parallels Desktop ...
  • Home / Mac / Internet Utilities
     Report spyware

    Methabot 1.7.0

    Download button

    No screenshots available
    Downloads: 468  Add to download basket  Tell us about an update
    User Rating:
    Rated by:
    NOT RATED
    0 user(s)
    Developer:

    License / Price:

    Size / OS:

    Binary Format:

    Last Updated:

    Category:
    Emil Romanus | More programs
    Freeware / FREE
    479 KB / Mac OS X
    Universal Binary Universal Binary
    November 8th, 2009, 01:44 GMT [view history]
    C: \ Internet Utilities

     Read user reviews (0)  Add a review  Refer to a friend  Subscribe

     

    Methabot description

     

    A free web crawler and command line tool optimized for speed

    Methabot supports scripted filetype parsing, a wide variety of customization options and is easily configured to fit anyones particular needs.

    Methabot is targeted for extensibility and customization. It's being developed for high modularity, and comes with javascript as scripting language.

    With the use of the module system and scripting language, users are able to take full or partial control of the crawling process and decide however Methabot should store web data, statistics and much more.

    Just by running Methabot from command line you are able configure custom filetypes, filtering expressions, behaviour, and much more, so you don't have to be a scripter!

    Methabot is portable and tested with success on Mac OS X, 32-bit/64-bit Linux 2.6, 32-bit/64-bit FreeBSD 6.x/7.0, and Windows XP. Should work on almost any Unix-like OS.

    Here are some key features of "Methabot":

    · It's fast, designed from the ground and up with speed-optimization in mind.
    · Scriptable through E4X
    · User-defined filetype filtering (according to MIME type, file extension or UMEX expression)
    · Multi-threaded
    · Highly configurable from command line
    · Extensible module system, supporting custom data parsers and filters.
    · Simple yet powerful filtering of URLs through UMEX.
    · Automated downloading
    · Support for automatic cookie handling when running over HTTP
    · Reliable, fault-tolerant networking

    What's New in This Release: [ read full changelog ]

    · Support for converting between character encodings through libiconv
    · New parser utf8conv for converting almost any character encoding to utf8
    · New parser entityconv, converts html entities such as ä to the
    · corresponding utf-8 character
    · The configuration system has been moved to a seperate library, libmethaconfig
    · Various improvements to the configuration loader, such as dynamically adding
    · and changing classes and scopes
    · Lots of memory usage optimizations and cleanup fixes
    · The documentation available in the wiki has been copied to a texinfo file,
    · from now on all documentation will be put in this texinfo file and available
    · as a manual both online and offline
    · Support for filetype attributes. Parsers can now set custom data that will
    · be associated with a parsed file. Attributes' primary area of use is when you
    · are connected to a Methanol system and want to store meta-data about a URL.
    · new Javascript function set_attribute() for setting attributes for the
    · current URL
    · API support for custom status, error/warning and target reporter functions
    · lmetha_global_setopt() is no longer available, replaced with lmetha_setopt()
    · options
    · SpiderMonkey-1.8.0 support added
    · New global Javascript function exec()
    · New built-in handler function writefile
    · libmetha no longer depends on libev, but instead uses pipes and epoll() for
    · inter-thread communication and waiting for events on sockets.
    · Added internal counters useful for keeping statistics
    · New filetype option 'ignore_host'
    · --external option set to false can no longer be circumvented using a HTTP-
    · redirect
    · Support for CURIE (why not?) in the built-in HTML parser added
    · Bugfix, a syntax error would in some rare cases occur when parsing integer
    · values in configuration files
    · Bugfix in the configuration file parser when reading flag values
    · Bugfix, when javascript filetype parsers did not return a value, it was
    · treated as a string, "undefined", and used as a relative URL

     


    TAGS:

    web crawler | url filter | website crawler | crawl | filter | crawler



    HTML code for linking to this page:


    Go to top

    Windows tabGames tabDrivers tabMac tabLinux tabScripts tabMobile tabHandheld tabGadgets tabNews tab

    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   ENTER NEWS SITE   |   ENGLISH BOARD   |   ROMANIAN FORUM