Commit Graph

245 Commits

Author SHA1 Message Date
miconis 5676e625bd implementation of romansmatch and re-implementation of the getNumber function. New terms in the translation map and update of the configuration 2019-11-28 16:54:44 +01:00
miconis 493b385b5b addition of one term to the translation maps in the configurations 2019-11-27 15:48:37 +01:00
miconis c72f48fb33 minor change in the citymatch 2019-11-21 10:54:02 +01:00
miconis 40808200f0 the param map has been updated: now it accepts string parameters 2019-11-21 09:37:56 +01:00
miconis 79e62787cf jarowinklernormalizedname splitted in 3 different comparators: citymatch, keywordmatch and jarowinkler. Implementation of the TreeStatistic support functions 2019-11-20 10:45:00 +01:00
miconis 676e9c8e37 code cleaning and implementation of the TreeDedup + minor changes 2019-11-14 10:01:21 +01:00
miconis 5b3adb3e65 code cleaning, distribution of the classes in packages and implementation of the new configuration 2019-11-07 12:47:12 +01:00
miconis 3ff5be675b put the last modification of the master branch into the tree2. Addition of the configuration as parameter of the comparator. This is to allow the comparator to access it 2019-10-29 16:38:42 +01:00
miconis 8564fdd19c minor changes 2019-10-29 15:58:21 +01:00
miconis bc7dd4bfa2 [maven-release-plugin] prepare for next development iteration 2019-10-24 11:34:19 +02:00
miconis 098c5e2f64 [maven-release-plugin] prepare release dnet-dedup-3.0.15 2019-10-24 11:34:12 +02:00
miconis 8dba7a04f8 dependency-reduced-pom deleted 2019-10-24 11:28:20 +02:00
miconis 58f128d861 Revert "[maven-release-plugin] prepare release dnet-dedup-3.0.15"
This reverts commit 452ab7892d.
2019-10-24 11:23:01 +02:00
miconis 452ab7892d [maven-release-plugin] prepare release dnet-dedup-3.0.15 2019-10-24 11:17:07 +02:00
miconis 4712fef82f release rollback 2019-10-24 11:11:07 +02:00
miconis 4874038f8e minor changes 2019-10-23 16:37:20 +02:00
miconis 2ffaa235a2 minor changes and configuration updates (synonym field added) 2019-10-23 16:31:45 +02:00
miconis 1cbb48f77b minor changes 2019-10-08 16:49:07 +02:00
miconis 7998f37ce1 normalization of the term in the translation map added 2019-10-08 15:13:45 +02:00
miconis 03c1b334d5 translation map moved in json configuration, support for synonyms added in the configuration, now the configuration is argument of conditions, distancealgos and clusteringfunctions 2019-10-08 14:53:52 +02:00
Claudio Atzori 42e3bff05f [maven-release-plugin] prepare for next development iteration 2019-09-25 10:39:46 +02:00
Claudio Atzori 259d502d70 [maven-release-plugin] prepare release dnet-dedup-3.0.14 2019-09-25 10:39:39 +02:00
Claudio Atzori fda7f1ce93 updated translation map and some tests 2019-09-25 10:15:13 +02:00
miconis 93b332cbe5 translation map updated 2019-09-25 09:53:06 +02:00
miconis 3a92456fd0 optimize imports 2019-08-09 15:42:41 +02:00
miconis 4bcf353a72 implementation of the conditions in tree nodes. get rid of the conditions part of the configuration 2019-08-09 15:41:49 +02:00
miconis 72b14ec36b implementation of the decision tree. It takes place of the distance algos, necessaryConditions and sufficientConditions are still there. The model contains only path, type and name of the field. ignoreMissing is still in the model because it is used by the conditions. 2019-08-09 10:08:34 +02:00
miconis cb51e017aa code refactoring: useless module removed 2019-08-07 15:16:59 +02:00
miconis f0b4c4cbd4 addition of a fixSpecial function to address the problem with special character in organization names, addition of new terms in translation maps 2019-08-06 17:06:05 +02:00
miconis 85070ce3fe addition of the BlockUtils class for meta-blocking, implementation of a new local test with edge filtering example 2019-08-06 12:09:34 +02:00
miconis 2472f2b1e8 Merge branch 'master' of https://github.com/dnet-team/dnet-dedup 2019-07-19 17:10:53 +02:00
miconis 84974dcdfa restyling of the JaroWinklerNormalizedName comparator, now it is optimized. Addition of some translations in the translation maps, addition of a clustering based on keywords in organizations legalnames 2019-07-19 17:10:29 +02:00
Claudio Atzori 19468fa864 [maven-release-plugin] prepare for next development iteration 2019-07-08 11:12:52 +02:00
Claudio Atzori 953b78ab9b [maven-release-plugin] prepare release dnet-dedup-3.0.13 2019-07-08 11:12:45 +02:00
miconis d5d228aef3 Merge branch 'master' of https://github.com/dnet-team/dnet-dedup 2019-07-08 11:02:29 +02:00
miconis 0509ea8d1e bug fixing in the keywordsclustering class 2019-07-08 11:01:49 +02:00
Claudio Atzori ceaf19c83c [maven-release-plugin] prepare for next development iteration 2019-07-08 10:11:24 +02:00
Claudio Atzori 6314f896d1 [maven-release-plugin] prepare release dnet-dedup-3.0.12 2019-07-08 10:11:17 +02:00
miconis 8f5bc52ab2 [maven-release-plugin] rollback the release of dnet-dedup-3.0.12 2019-07-08 10:00:48 +02:00
miconis 813778d647 [maven-release-plugin] prepare for next development iteration 2019-07-08 09:48:10 +02:00
miconis b8fb3e46aa [maven-release-plugin] prepare release dnet-dedup-3.0.12 2019-07-08 09:48:03 +02:00
miconis 2b866cfbeb addition of doi normalization in PidMatch comparator, addition of keywordsclustering (clustering based on terms in the translation maps for the organizations), minor changes 2019-07-08 09:44:02 +02:00
Claudio Atzori 9f6fb0e030 [maven-release-plugin] prepare for next development iteration 2019-06-19 10:02:39 +02:00
Claudio Atzori 07d1b7df15 [maven-release-plugin] prepare release dnet-dedup-3.0.11 2019-06-19 10:02:32 +02:00
Claudio Atzori c7963d5afc optimized classpath resolvers 2019-06-19 10:01:35 +02:00
Claudio Atzori c9fc377712 [maven-release-plugin] prepare for next development iteration 2019-06-18 14:46:34 +02:00
Claudio Atzori e1ee2d40b3 [maven-release-plugin] prepare release dnet-dedup-3.0.10 2019-06-18 14:46:27 +02:00
Claudio Atzori cbec51e922 avoid to divide by zero: in case of missing values, return undefined response 2019-06-18 14:45:15 +02:00
Claudio Atzori 7063d286e0 cleanup 2019-06-18 14:44:42 +02:00
Claudio Atzori e6944249ca Merge branch 'master' of https://github.com/dnet-team/dnet-dedup 2019-06-18 14:06:41 +02:00