Commit Graph

57 Commits

Author SHA1 Message Date
miconis 9618e889bd test implementation for the new fdup version 2022-04-13 09:48:56 +02:00
miconis 661818da9e bug fix in test 2022-03-21 14:43:55 +01:00
miconis b2cbc09fda bug fix in the normalization of a legalname, city map updated and transliteration support added 2022-03-15 14:59:13 +01:00
miconis cb72ce0a22 bug fix in the AuthorMatch, implementation of the concat function in the model creation with jpath query 2022-03-09 12:53:09 +01:00
miconis 321cba82d0 minor change to remove local machine references 2022-02-02 10:42:11 +01:00
miconis de66199001 minor change 2022-01-13 17:20:20 +01:00
miconis e168d95ec0 bug fix in the authormatch comparator, implementation of tests 2022-01-13 11:58:28 +01:00
miconis 5e8757a457 implementation of new comparators for publication dedup configuration update 2021-12-27 17:35:02 +01:00
miconis 2c1488b91f implementation of the test to see dedup results 2021-10-22 11:21:09 +02:00
miconis fb314e3441 implementation of the graph visualization tool 2021-09-17 10:33:29 +02:00
miconis a9ee4dbe61 bug fix in graph drawer 2021-09-14 10:02:14 +02:00
miconis 5a52aed8e1 dedup test implementation & graph drawing tools 2021-09-13 14:53:19 +02:00
miconis e65526848a implementation of the wf to dedup entities, addition of the module to run the wf on the cluster 2020-12-04 15:41:31 +01:00
miconis 5021e5048f fixed error in the treeprocessor. it used th=-1 as default value, now it use th=1 2020-09-29 12:01:25 +02:00
Claudio Atzori c4890b2e92 ignoring local test 2020-07-15 17:55:44 +02:00
Claudio Atzori 055adbb56d Revert "wordssuffixprefix: adjust the token length according to the number of words; removed maven release temporary files"
This reverts commit ecebdff026.
2020-07-15 17:35:56 +02:00
Claudio Atzori ecebdff026 wordssuffixprefix: adjust the token length according to the number of words; removed maven release temporary files 2020-07-15 17:13:45 +02:00
Claudio Atzori d86dfe4103 Revert "ignoring local test"
This reverts commit 74dece3e3e.
2020-07-15 17:05:55 +02:00
Claudio Atzori 74dece3e3e ignoring local test 2020-07-15 17:03:48 +02:00
miconis b7a27ace62 clusteringtester removed in order to move it to dnet-dedup-openaire 2020-07-13 11:15:09 +02:00
miconis 12621b1c45 implementation of a class to test the clustering functions 2020-07-12 10:13:54 +02:00
miconis 33eadb7c9c implemented new function for clustering 2020-07-02 17:04:17 +02:00
miconis b3ec4194da implementation of the test for the dedup and addition of new support classes 2020-06-11 10:46:46 +02:00
miconis eeeb374480 minor changes in comparators 2020-01-24 10:01:11 +01:00
miconis cc86591fad minor changes 2020-01-20 16:45:16 +01:00
miconis 6a27fb14a8 update in the implementation of the tree: addition of new logic aggregations and statistics 2020-01-14 11:42:43 +02:00
miconis 72ca3bb9ba implementation of new aggregation in the tree node processing 2019-12-18 16:19:36 +01:00
miconis 4af490221b implementation of new aggregation in the tree node processing 2019-12-18 16:19:26 +01:00
miconis 159cb2a493 implementation of new json comparator and update of the publication configuration 2019-12-17 09:16:26 +01:00
Sandro La Bruzzo d09193a094 merged JqMapping branch into tree2 2019-12-13 11:30:02 +01:00
miconis 7cd63a37cf update of the strict configuration with new terms 2019-11-29 14:13:46 +01:00
miconis 5676e625bd implementation of romansmatch and re-implementation of the getNumber function. New terms in the translation map and update of the configuration 2019-11-28 16:54:44 +01:00
miconis 493b385b5b addition of one term to the translation maps in the configurations 2019-11-27 15:48:37 +01:00
miconis 40808200f0 the param map has been updated: now it accepts string parameters 2019-11-21 09:37:56 +01:00
miconis 79e62787cf jarowinklernormalizedname splitted in 3 different comparators: citymatch, keywordmatch and jarowinkler. Implementation of the TreeStatistic support functions 2019-11-20 10:45:00 +01:00
miconis 676e9c8e37 code cleaning and implementation of the TreeDedup + minor changes 2019-11-14 10:01:21 +01:00
miconis 5b3adb3e65 code cleaning, distribution of the classes in packages and implementation of the new configuration 2019-11-07 12:47:12 +01:00
miconis 3ff5be675b put the last modification of the master branch into the tree2. Addition of the configuration as parameter of the comparator. This is to allow the comparator to access it 2019-10-29 16:38:42 +01:00
miconis 2ffaa235a2 minor changes and configuration updates (synonym field added) 2019-10-23 16:31:45 +02:00
miconis 4bcf353a72 implementation of the conditions in tree nodes. get rid of the conditions part of the configuration 2019-08-09 15:41:49 +02:00
miconis 72b14ec36b implementation of the decision tree. It takes place of the distance algos, necessaryConditions and sufficientConditions are still there. The model contains only path, type and name of the field. ignoreMissing is still in the model because it is used by the conditions. 2019-08-09 10:08:34 +02:00
miconis 85070ce3fe addition of the BlockUtils class for meta-blocking, implementation of a new local test with edge filtering example 2019-08-06 12:09:34 +02:00
miconis a5526f6254 implementation of the integration test, addition of document blocks to group entities after clustering 2019-05-21 16:38:26 +02:00
miconis 3018031621 branch cities merged into master 2019-04-03 12:22:33 +02:00
miconis f738c2b641 addition of a sparktester test, implementation of 2 different classes for testing in dnet-dedup-test module, addition of new terms in the vocabulary and change in the implementation of the JaroWinklerNormalizedName comparator 2019-04-03 09:40:14 +02:00
miconis e9894ed089 minor changes 2019-03-26 15:48:21 +01:00
miconis 1dbb765343 minor changes 2019-03-26 15:40:40 +01:00
Michele De Bonis f87790f701 update of the comparator for legalnames of organizations 2019-03-21 14:27:27 +01:00
Michele De Bonis b02aa08833 implementation of the test classes and minor changes 2019-02-08 12:56:47 +01:00
Claudio Atzori e5a77f0a53 added new properties to FieldDef (size, length) to limit the information mapped onto each MapDocument 2018-11-19 17:37:57 +01:00