Claudio Atzori
|
74c6462b49
|
updated translation map and some tests
|
2019-09-25 10:15:13 +02:00 |
miconis
|
aed81e4cfa
|
translation map updated
|
2019-09-25 09:53:06 +02:00 |
miconis
|
afd2b398d5
|
optimize imports
|
2019-08-09 15:42:41 +02:00 |
miconis
|
d71dae5fd2
|
implementation of the conditions in tree nodes. get rid of the conditions part of the configuration
|
2019-08-09 15:41:49 +02:00 |
miconis
|
a5c5d2f01b
|
implementation of the decision tree. It takes place of the distance algos, necessaryConditions and sufficientConditions are still there. The model contains only path, type and name of the field. ignoreMissing is still in the model because it is used by the conditions.
|
2019-08-09 10:08:34 +02:00 |
miconis
|
f2136e1024
|
code refactoring: useless module removed
|
2019-08-07 15:16:59 +02:00 |
miconis
|
8c867101ef
|
addition of a fixSpecial function to address the problem with special character in organization names, addition of new terms in translation maps
|
2019-08-06 17:06:05 +02:00 |
miconis
|
4502b44337
|
addition of the BlockUtils class for meta-blocking, implementation of a new local test with edge filtering example
|
2019-08-06 12:09:34 +02:00 |
miconis
|
cffb712a99
|
Merge branch 'master' of https://github.com/dnet-team/dnet-dedup
|
2019-07-19 17:10:53 +02:00 |
miconis
|
a85576c27e
|
restyling of the JaroWinklerNormalizedName comparator, now it is optimized. Addition of some translations in the translation maps, addition of a clustering based on keywords in organizations legalnames
|
2019-07-19 17:10:29 +02:00 |
Claudio Atzori
|
6cb846331a
|
[maven-release-plugin] prepare for next development iteration
|
2019-07-08 11:12:52 +02:00 |
Claudio Atzori
|
c04d2232c2
|
[maven-release-plugin] prepare release dnet-dedup-3.0.13
|
2019-07-08 11:12:45 +02:00 |
miconis
|
fb5e38db26
|
Merge branch 'master' of https://github.com/dnet-team/dnet-dedup
|
2019-07-08 11:02:29 +02:00 |
miconis
|
3c6f8d1e44
|
bug fixing in the keywordsclustering class
|
2019-07-08 11:01:49 +02:00 |
Claudio Atzori
|
a69022617d
|
[maven-release-plugin] prepare for next development iteration
|
2019-07-08 10:11:24 +02:00 |
Claudio Atzori
|
c6baeb93d4
|
[maven-release-plugin] prepare release dnet-dedup-3.0.12
|
2019-07-08 10:11:17 +02:00 |
miconis
|
f5de20a508
|
[maven-release-plugin] rollback the release of dnet-dedup-3.0.12
|
2019-07-08 10:00:48 +02:00 |
miconis
|
ba50aa8654
|
[maven-release-plugin] prepare for next development iteration
|
2019-07-08 09:48:10 +02:00 |
miconis
|
7065110a21
|
[maven-release-plugin] prepare release dnet-dedup-3.0.12
|
2019-07-08 09:48:03 +02:00 |
miconis
|
15bec5e876
|
addition of doi normalization in PidMatch comparator, addition of keywordsclustering (clustering based on terms in the translation maps for the organizations), minor changes
|
2019-07-08 09:44:02 +02:00 |
Claudio Atzori
|
2dcffb965f
|
[maven-release-plugin] prepare for next development iteration
|
2019-06-19 10:02:39 +02:00 |
Claudio Atzori
|
85126c59f7
|
[maven-release-plugin] prepare release dnet-dedup-3.0.11
|
2019-06-19 10:02:32 +02:00 |
Claudio Atzori
|
15d7b584f3
|
optimized classpath resolvers
|
2019-06-19 10:01:35 +02:00 |
Claudio Atzori
|
ff4956def9
|
[maven-release-plugin] prepare for next development iteration
|
2019-06-18 14:46:34 +02:00 |
Claudio Atzori
|
eb5ce312a3
|
[maven-release-plugin] prepare release dnet-dedup-3.0.10
|
2019-06-18 14:46:27 +02:00 |
Claudio Atzori
|
f2bc665403
|
avoid to divide by zero: in case of missing values, return undefined response
|
2019-06-18 14:45:15 +02:00 |
Claudio Atzori
|
e3f86b92c8
|
cleanup
|
2019-06-18 14:44:42 +02:00 |
miconis
|
54e4d0af04
|
exact match condition gives undefined if a field is missing, ignoremissing semantics changed: now performs the comparison in any case if =true, if false gives -1 in case of missing
|
2019-06-18 14:05:31 +02:00 |
miconis
|
e8db8f2abb
|
implementation of the integration test, addition of document blocks to group entities after clustering
|
2019-05-21 16:38:26 +02:00 |
Sandro La Bruzzo
|
53ec9bccca
|
changed the implemetation of RabitMQ Comunication
|
2019-04-16 12:28:01 +02:00 |
Sandro La Bruzzo
|
403c13eebf
|
Implemented message manager, Fixed bug on collection worker, implemented Collecion and Transform spark job
|
2019-04-11 15:39:29 +02:00 |
enricoottonello
|
58c4e1f725
|
cleaned properties
|
2019-04-09 12:17:03 +02:00 |
Sandro La Bruzzo
|
9294851a6c
|
implemented comunication layer using rabbitMq between oozie node and Dnet
|
2019-04-05 12:19:25 +02:00 |
Sandro La Bruzzo
|
3f4ba71bbd
|
resolved conflicts
|
2019-04-03 16:12:57 +02:00 |
Sandro La Bruzzo
|
ded6aef5e1
|
moved collector worker
|
2019-04-03 16:05:16 +02:00 |
enricoottonello
|
2f79eb930a
|
added apidescriptor
|
2019-04-03 16:03:44 +02:00 |
Sandro La Bruzzo
|
c2ecbf5572
|
moved collector worker
|
2019-04-03 16:03:36 +02:00 |
Claudio Atzori
|
f7a3bdf3f8
|
[maven-release-plugin] prepare for next development iteration
|
2019-04-03 12:35:00 +02:00 |
Claudio Atzori
|
98c179c8fb
|
[maven-release-plugin] prepare release dnet-dedup-3.0.9
|
2019-04-03 12:34:52 +02:00 |
miconis
|
3e61a90c8f
|
[maven-release-plugin] rollback the release of dnet-dedup-3.0.9
|
2019-04-03 12:27:28 +02:00 |
miconis
|
15fb9eb883
|
[maven-release-plugin] prepare for next development iteration
|
2019-04-03 12:26:05 +02:00 |
miconis
|
a1ff4daa7f
|
[maven-release-plugin] prepare release dnet-dedup-3.0.9
|
2019-04-03 12:25:56 +02:00 |
miconis
|
1d29bae47c
|
branch cities merged into master
|
2019-04-03 12:22:33 +02:00 |
enricoottonello
|
b316467608
|
added common module
|
2019-04-03 10:53:54 +02:00 |
miconis
|
7e7018c51f
|
addition of a sparktester test, implementation of 2 different classes for testing in dnet-dedup-test module, addition of new terms in the vocabulary and change in the implementation of the JaroWinklerNormalizedName comparator
|
2019-04-03 09:40:14 +02:00 |
Michele Artini
|
1a1ec7da8e
|
Merge branch 'master' of https://github.com/dnet-team/dnet-hadoop.git
|
2019-04-01 10:40:59 +02:00 |
Michele Artini
|
f8c8b6669f
|
ui
|
2019-04-01 10:40:09 +02:00 |
miconis
|
4bd5a9beee
|
minor changes
|
2019-03-26 15:48:21 +01:00 |
Sandro La Bruzzo
|
0b503949de
|
Merge branch 'master' of https://github.com/dnet-team/dnet-hadoop
|
2019-03-25 15:19:08 +01:00 |
Sandro La Bruzzo
|
b2cd76831e
|
added common cli
|
2019-03-25 15:18:48 +01:00 |