Commit Graph

26 Commits

Author SHA1 Message Date
miconis b2cbc09fda bug fix in the normalization of a legalname, city map updated and transliteration support added 2022-03-15 14:59:13 +01:00
miconis fad803bd46 implementation of cross comparison for different fields, addition of clustering mechanism to collapse keys from different clustering functions on the same cluster 2021-05-03 15:37:41 +02:00
miconis 7188648bdc implementation of the clustering function for the suffixprefix chain 2020-07-16 18:57:55 +02:00
Claudio Atzori 20848c1c6e reverted to 4.0.3-SNAPSHOT 2020-07-15 17:37:36 +02:00
Claudio Atzori 055adbb56d Revert "wordssuffixprefix: adjust the token length according to the number of words; removed maven release temporary files"
This reverts commit ecebdff026.
2020-07-15 17:35:56 +02:00
Claudio Atzori ecebdff026 wordssuffixprefix: adjust the token length according to the number of words; removed maven release temporary files 2020-07-15 17:13:45 +02:00
Claudio Atzori 1262f3dd8e Revert "wordssuffixprefix: adjust the token length according to the number of words; removed maven release temporary files"
This reverts commit b46be9c8ae.
2020-07-15 17:11:46 +02:00
Claudio Atzori b46be9c8ae wordssuffixprefix: adjust the token length according to the number of words; removed maven release temporary files 2020-07-15 16:49:47 +02:00
miconis 33eadb7c9c implemented new function for clustering 2020-07-02 17:04:17 +02:00
Sandro La Bruzzo b6c4f4acf3 upgraded maven version of commons-lang 2020-02-10 12:38:40 +01:00
miconis 79e62787cf jarowinklernormalizedname splitted in 3 different comparators: citymatch, keywordmatch and jarowinkler. Implementation of the TreeStatistic support functions 2019-11-20 10:45:00 +01:00
miconis 1cbb48f77b minor changes 2019-10-08 16:49:07 +02:00
miconis 03c1b334d5 translation map moved in json configuration, support for synonyms added in the configuration, now the configuration is argument of conditions, distancealgos and clusteringfunctions 2019-10-08 14:53:52 +02:00
miconis f0b4c4cbd4 addition of a fixSpecial function to address the problem with special character in organization names, addition of new terms in translation maps 2019-08-06 17:06:05 +02:00
miconis 84974dcdfa restyling of the JaroWinklerNormalizedName comparator, now it is optimized. Addition of some translations in the translation maps, addition of a clustering based on keywords in organizations legalnames 2019-07-19 17:10:29 +02:00
miconis 0509ea8d1e bug fixing in the keywordsclustering class 2019-07-08 11:01:49 +02:00
miconis 2b866cfbeb addition of doi normalization in PidMatch comparator, addition of keywordsclustering (clustering based on terms in the translation maps for the organizations), minor changes 2019-07-08 09:44:02 +02:00
miconis a5526f6254 implementation of the integration test, addition of document blocks to group entities after clustering 2019-05-21 16:38:26 +02:00
Michele De Bonis f87790f701 update of the comparator for legalnames of organizations 2019-03-21 14:27:27 +01:00
Michele De Bonis 0bd20c565a implementation of the decisional tree, addition of the dnet-openaire-data-protos module, definition of the person proto, blockprocessor and paceconfig modified with addition of support for the tree processing 2018-12-12 16:30:03 +01:00
Michele De Bonis 7c59c3ebf0 serialization test added. useless getter methods ignored by json serialization 2018-10-29 16:16:11 +01:00
Michele De Bonis 0d03030694 DedupConf parsed using Jackson library 2018-10-29 11:13:55 +01:00
Michele De Bonis d059bf68b8 modification in the initialization of clustering functions, distance algos and conditions. 2018-10-25 15:15:40 +02:00
Michele De Bonis 1d678ddc9c update in the discovery of clustering, conditions and distance functions (annotated with custom annotations) 2018-10-24 12:09:41 +02:00
Sandro La Bruzzo d0edb7b773 Added First Implementation of Spark Test 2018-10-02 17:07:17 +02:00
Sandro La Bruzzo a043d0c716 added d-net pace core module and ignored target folder 2018-10-02 10:37:54 +02:00