miconis
|
72b14ec36b
|
implementation of the decision tree. It takes place of the distance algos, necessaryConditions and sufficientConditions are still there. The model contains only path, type and name of the field. ignoreMissing is still in the model because it is used by the conditions.
|
2019-08-09 10:08:34 +02:00 |
Claudio Atzori
|
c7963d5afc
|
optimized classpath resolvers
|
2019-06-19 10:01:35 +02:00 |
miconis
|
a5526f6254
|
implementation of the integration test, addition of document blocks to group entities after clustering
|
2019-05-21 16:38:26 +02:00 |
miconis
|
3018031621
|
branch cities merged into master
|
2019-04-03 12:22:33 +02:00 |
miconis
|
f738c2b641
|
addition of a sparktester test, implementation of 2 different classes for testing in dnet-dedup-test module, addition of new terms in the vocabulary and change in the implementation of the JaroWinklerNormalizedName comparator
|
2019-04-03 09:40:14 +02:00 |
Claudio Atzori
|
e5a77f0a53
|
added new properties to FieldDef (size, length) to limit the information mapped onto each MapDocument
|
2018-11-19 17:37:57 +01:00 |
Claudio Atzori
|
9a14b0ecbc
|
propagate exceptions in case of serialization errors, removed configuration pretty printing, removed unused class ScoredResult
|
2018-11-12 15:52:18 +01:00 |
Michele De Bonis
|
7c59c3ebf0
|
serialization test added. useless getter methods ignored by json serialization
|
2018-10-29 16:16:11 +01:00 |
Michele De Bonis
|
0d03030694
|
DedupConf parsed using Jackson library
|
2018-10-29 11:13:55 +01:00 |
Michele De Bonis
|
d059bf68b8
|
modification in the initialization of clustering functions, distance algos and conditions.
|
2018-10-25 15:15:40 +02:00 |
Michele De Bonis
|
1d678ddc9c
|
update in the discovery of clustering, conditions and distance functions (annotated with custom annotations)
|
2018-10-24 12:09:41 +02:00 |
Claudio Atzori
|
bc4505e0e6
|
revised PidMatch implementation, cleanup
|
2018-10-20 08:38:19 +02:00 |
Sandro La Bruzzo
|
67e5f9858b
|
Added FSpark Implementation of dedup
|
2018-10-11 15:19:20 +02:00 |
Sandro La Bruzzo
|
a043d0c716
|
added d-net pace core module and ignored target folder
|
2018-10-02 10:37:54 +02:00 |