Duke for Mac

5.1 MB   292 downloads
1.2 Apache    
  not rated
A fast deduplication engine

description

download

specifications

changelog

Duke is a small, free, easy to use, fast and flexible deduplication (entity resolution or record linkage) engine written in Java on top of Lucene.

At the moment Duke can process 1,000,000 records in 11 minutes on a standard laptop in a single thread.

Duke can be used to find duplicate records inside a single table/data source, or it can be used to find records in different tables/sources which most likely represent the same real-world entity.

Duke is written in the Java programming language and it can be used on Mac OS X, Windows and Linux.
READ MORE   
Last updated on February 18th, 2014
1  
Duke - Duke will report its findings to the MatchListener, you can write your own MatchListeners, or use those which come with Duke.

top FREE alternatives

0 User reviews so far.

SUBMIT