Sandro La Bruzzo
|
35008fdbf9
|
fix stuff
|
2019-12-06 15:28:30 +01:00 |
Sandro La Bruzzo
|
16c670a5d5
|
Improved deduplication
|
2019-12-05 14:14:25 +01:00 |
miconis
|
49f9beb4a8
|
implementation of romansmatch and re-implementation of the getNumber function. New terms in the translation map and update of the configuration
|
2019-11-28 16:54:44 +01:00 |
miconis
|
f791730330
|
addition of one term to the translation maps in the configurations
|
2019-11-27 15:48:37 +01:00 |
miconis
|
d2278fe358
|
minor change in the citymatch
|
2019-11-21 10:54:02 +01:00 |
miconis
|
8c0d346005
|
the param map has been updated: now it accepts string parameters
|
2019-11-21 09:37:56 +01:00 |
miconis
|
ddd40540aa
|
jarowinklernormalizedname splitted in 3 different comparators: citymatch, keywordmatch and jarowinkler. Implementation of the TreeStatistic support functions
|
2019-11-20 10:45:00 +01:00 |
miconis
|
c687956371
|
code cleaning and implementation of the TreeDedup + minor changes
|
2019-11-14 10:01:21 +01:00 |
miconis
|
0973899865
|
code cleaning, distribution of the classes in packages and implementation of the new configuration
|
2019-11-07 12:47:12 +01:00 |
miconis
|
30a873265f
|
put the last modification of the master branch into the tree2. Addition of the configuration as parameter of the comparator. This is to allow the comparator to access it
|
2019-10-29 16:38:42 +01:00 |
miconis
|
1beb776691
|
minor changes
|
2019-10-29 15:58:21 +01:00 |
miconis
|
075f741d28
|
[maven-release-plugin] prepare for next development iteration
|
2019-10-24 11:34:19 +02:00 |
miconis
|
ced4bcdd59
|
[maven-release-plugin] prepare release dnet-dedup-3.0.15
|
2019-10-24 11:34:12 +02:00 |
miconis
|
13f93e6055
|
Revert "[maven-release-plugin] prepare release dnet-dedup-3.0.15"
This reverts commit cf93515d94 .
|
2019-10-24 11:23:01 +02:00 |
miconis
|
cf93515d94
|
[maven-release-plugin] prepare release dnet-dedup-3.0.15
|
2019-10-24 11:17:07 +02:00 |
miconis
|
285ec3ca17
|
release rollback
|
2019-10-24 11:11:07 +02:00 |
miconis
|
5f249fd56c
|
minor changes
|
2019-10-23 16:37:20 +02:00 |
miconis
|
c9863debfa
|
minor changes and configuration updates (synonym field added)
|
2019-10-23 16:31:45 +02:00 |
miconis
|
5499ca17c3
|
minor changes
|
2019-10-08 16:49:07 +02:00 |
miconis
|
50b7a12b3f
|
normalization of the term in the translation map added
|
2019-10-08 15:13:45 +02:00 |
miconis
|
26b383fea2
|
translation map moved in json configuration, support for synonyms added in the configuration, now the configuration is argument of conditions, distancealgos and clusteringfunctions
|
2019-10-08 14:53:52 +02:00 |
Claudio Atzori
|
07355d2811
|
[maven-release-plugin] prepare for next development iteration
|
2019-09-25 10:39:46 +02:00 |
Claudio Atzori
|
254eb46809
|
[maven-release-plugin] prepare release dnet-dedup-3.0.14
|
2019-09-25 10:39:39 +02:00 |
Claudio Atzori
|
74c6462b49
|
updated translation map and some tests
|
2019-09-25 10:15:13 +02:00 |
miconis
|
aed81e4cfa
|
translation map updated
|
2019-09-25 09:53:06 +02:00 |
miconis
|
afd2b398d5
|
optimize imports
|
2019-08-09 15:42:41 +02:00 |
miconis
|
d71dae5fd2
|
implementation of the conditions in tree nodes. get rid of the conditions part of the configuration
|
2019-08-09 15:41:49 +02:00 |
miconis
|
a5c5d2f01b
|
implementation of the decision tree. It takes place of the distance algos, necessaryConditions and sufficientConditions are still there. The model contains only path, type and name of the field. ignoreMissing is still in the model because it is used by the conditions.
|
2019-08-09 10:08:34 +02:00 |
miconis
|
f2136e1024
|
code refactoring: useless module removed
|
2019-08-07 15:16:59 +02:00 |
miconis
|
8c867101ef
|
addition of a fixSpecial function to address the problem with special character in organization names, addition of new terms in translation maps
|
2019-08-06 17:06:05 +02:00 |
miconis
|
4502b44337
|
addition of the BlockUtils class for meta-blocking, implementation of a new local test with edge filtering example
|
2019-08-06 12:09:34 +02:00 |
miconis
|
cffb712a99
|
Merge branch 'master' of https://github.com/dnet-team/dnet-dedup
|
2019-07-19 17:10:53 +02:00 |
miconis
|
a85576c27e
|
restyling of the JaroWinklerNormalizedName comparator, now it is optimized. Addition of some translations in the translation maps, addition of a clustering based on keywords in organizations legalnames
|
2019-07-19 17:10:29 +02:00 |
Claudio Atzori
|
6cb846331a
|
[maven-release-plugin] prepare for next development iteration
|
2019-07-08 11:12:52 +02:00 |
Claudio Atzori
|
c04d2232c2
|
[maven-release-plugin] prepare release dnet-dedup-3.0.13
|
2019-07-08 11:12:45 +02:00 |
miconis
|
fb5e38db26
|
Merge branch 'master' of https://github.com/dnet-team/dnet-dedup
|
2019-07-08 11:02:29 +02:00 |
miconis
|
3c6f8d1e44
|
bug fixing in the keywordsclustering class
|
2019-07-08 11:01:49 +02:00 |
Claudio Atzori
|
a69022617d
|
[maven-release-plugin] prepare for next development iteration
|
2019-07-08 10:11:24 +02:00 |
Claudio Atzori
|
c6baeb93d4
|
[maven-release-plugin] prepare release dnet-dedup-3.0.12
|
2019-07-08 10:11:17 +02:00 |
miconis
|
f5de20a508
|
[maven-release-plugin] rollback the release of dnet-dedup-3.0.12
|
2019-07-08 10:00:48 +02:00 |
miconis
|
ba50aa8654
|
[maven-release-plugin] prepare for next development iteration
|
2019-07-08 09:48:10 +02:00 |
miconis
|
7065110a21
|
[maven-release-plugin] prepare release dnet-dedup-3.0.12
|
2019-07-08 09:48:03 +02:00 |
miconis
|
15bec5e876
|
addition of doi normalization in PidMatch comparator, addition of keywordsclustering (clustering based on terms in the translation maps for the organizations), minor changes
|
2019-07-08 09:44:02 +02:00 |
Claudio Atzori
|
2dcffb965f
|
[maven-release-plugin] prepare for next development iteration
|
2019-06-19 10:02:39 +02:00 |
Claudio Atzori
|
85126c59f7
|
[maven-release-plugin] prepare release dnet-dedup-3.0.11
|
2019-06-19 10:02:32 +02:00 |
Claudio Atzori
|
15d7b584f3
|
optimized classpath resolvers
|
2019-06-19 10:01:35 +02:00 |
Claudio Atzori
|
ff4956def9
|
[maven-release-plugin] prepare for next development iteration
|
2019-06-18 14:46:34 +02:00 |
Claudio Atzori
|
eb5ce312a3
|
[maven-release-plugin] prepare release dnet-dedup-3.0.10
|
2019-06-18 14:46:27 +02:00 |
Claudio Atzori
|
f2bc665403
|
avoid to divide by zero: in case of missing values, return undefined response
|
2019-06-18 14:45:15 +02:00 |
Claudio Atzori
|
e3f86b92c8
|
cleanup
|
2019-06-18 14:44:42 +02:00 |