Duke for Mac

1.2 Apache    
  UNRATED
DOWNLOAD NOW 5.1 MB

  292 downloads

A fast deduplication engine

description

download

specs

changelog

Duke is a small, free, easy to use, fast and flexible deduplication (entity resolution or record linkage) engine written in Java on top of Lucene.

At the moment Duke can process 1,000,000 records in 11 minutes on a standard laptop in a single thread.

Duke can be used to find duplicate records inside a single table/data source, or it can be used to find records in different tables/sources which most likely represent the same real-world entity.

Duke is written in the Java programming language and it can be used on Mac OS X, Windows and Linux.
read more   
Last updated on February 18th, 2014

#deduplication engine #entity resolution #record linkage #deduplication #engine #resolution #develop

Duke - Duke will report its findings to the MatchListener, you can write your own MatchListeners, or use those which come with Duke.

top FREE alternatives

0 User reviews so far.

SUBMIT