Commit Graph

14 Commits

Author SHA1 Message Date
miconis 72b14ec36b implementation of the decision tree. It takes place of the distance algos, necessaryConditions and sufficientConditions are still there. The model contains only path, type and name of the field. ignoreMissing is still in the model because it is used by the conditions. 2019-08-09 10:08:34 +02:00
Claudio Atzori c7963d5afc optimized classpath resolvers 2019-06-19 10:01:35 +02:00
miconis a5526f6254 implementation of the integration test, addition of document blocks to group entities after clustering 2019-05-21 16:38:26 +02:00
miconis 3018031621 branch cities merged into master 2019-04-03 12:22:33 +02:00
miconis f738c2b641 addition of a sparktester test, implementation of 2 different classes for testing in dnet-dedup-test module, addition of new terms in the vocabulary and change in the implementation of the JaroWinklerNormalizedName comparator 2019-04-03 09:40:14 +02:00
Claudio Atzori e5a77f0a53 added new properties to FieldDef (size, length) to limit the information mapped onto each MapDocument 2018-11-19 17:37:57 +01:00
Claudio Atzori 9a14b0ecbc propagate exceptions in case of serialization errors, removed configuration pretty printing, removed unused class ScoredResult 2018-11-12 15:52:18 +01:00
Michele De Bonis 7c59c3ebf0 serialization test added. useless getter methods ignored by json serialization 2018-10-29 16:16:11 +01:00
Michele De Bonis 0d03030694 DedupConf parsed using Jackson library 2018-10-29 11:13:55 +01:00
Michele De Bonis d059bf68b8 modification in the initialization of clustering functions, distance algos and conditions. 2018-10-25 15:15:40 +02:00
Michele De Bonis 1d678ddc9c update in the discovery of clustering, conditions and distance functions (annotated with custom annotations) 2018-10-24 12:09:41 +02:00
Claudio Atzori bc4505e0e6 revised PidMatch implementation, cleanup 2018-10-20 08:38:19 +02:00
Sandro La Bruzzo 67e5f9858b Added FSpark Implementation of dedup 2018-10-11 15:19:20 +02:00
Sandro La Bruzzo a043d0c716 added d-net pace core module and ignored target folder 2018-10-02 10:37:54 +02:00