Claudio Atzori
|
aa7459973e
|
[maven-release-plugin] rollback the release of dnet-dedup-4.0.3
|
2020-07-15 17:24:43 +02:00 |
Claudio Atzori
|
af32d604da
|
[maven-release-plugin] prepare release dnet-dedup-4.0.3
|
2020-07-15 17:24:25 +02:00 |
Claudio Atzori
|
3de4f571f4
|
reverted to 4.0.3-SNAPSHOT
|
2020-07-15 17:20:12 +02:00 |
Claudio Atzori
|
ecebdff026
|
wordssuffixprefix: adjust the token length according to the number of words; removed maven release temporary files
|
2020-07-15 17:13:45 +02:00 |
Claudio Atzori
|
d86dfe4103
|
Revert "ignoring local test"
This reverts commit 74dece3e3e .
|
2020-07-15 17:05:55 +02:00 |
Claudio Atzori
|
70b53dcd1d
|
[maven-release-plugin] prepare release dnet-dedup-4.0.3
|
2020-07-15 17:04:20 +02:00 |
Claudio Atzori
|
74dece3e3e
|
ignoring local test
|
2020-07-15 17:03:48 +02:00 |
miconis
|
b7a27ace62
|
clusteringtester removed in order to move it to dnet-dedup-openaire
|
2020-07-13 11:15:09 +02:00 |
miconis
|
12621b1c45
|
implementation of a class to test the clustering functions
|
2020-07-12 10:13:54 +02:00 |
miconis
|
3d047d300d
|
[maven-release-plugin] prepare for next development iteration
|
2020-07-02 17:06:38 +02:00 |
miconis
|
73cdf7e5c2
|
[maven-release-plugin] prepare release dnet-dedup-4.0.2
|
2020-07-02 17:06:36 +02:00 |
miconis
|
33eadb7c9c
|
implemented new function for clustering
|
2020-07-02 17:04:17 +02:00 |
miconis
|
b3ec4194da
|
implementation of the test for the dedup and addition of new support classes
|
2020-06-11 10:46:46 +02:00 |
miconis
|
aa4d03cfa3
|
[maven-release-plugin] prepare for next development iteration
|
2020-04-24 14:39:01 +02:00 |
miconis
|
96cb51e7e1
|
[maven-release-plugin] prepare release dnet-dedup-4.0.1
|
2020-04-24 14:38:58 +02:00 |
miconis
|
a73bc6cddc
|
minor changes
|
2020-03-20 18:02:52 +01:00 |
Sandro La Bruzzo
|
0ba1f6f96a
|
[maven-release-plugin] prepare for next development iteration
|
2020-02-10 12:51:04 +01:00 |
Sandro La Bruzzo
|
a24fcdecbd
|
[maven-release-plugin] prepare release dnet-dedup-4.0.0
|
2020-02-10 12:50:45 +01:00 |
miconis
|
eeeb374480
|
minor changes in comparators
|
2020-01-24 10:01:11 +01:00 |
miconis
|
cc86591fad
|
minor changes
|
2020-01-20 16:45:16 +01:00 |
miconis
|
6a27fb14a8
|
update in the implementation of the tree: addition of new logic aggregations and statistics
|
2020-01-14 11:42:43 +02:00 |
miconis
|
72ca3bb9ba
|
implementation of new aggregation in the tree node processing
|
2019-12-18 16:19:36 +01:00 |
miconis
|
4af490221b
|
implementation of new aggregation in the tree node processing
|
2019-12-18 16:19:26 +01:00 |
miconis
|
159cb2a493
|
implementation of new json comparator and update of the publication configuration
|
2019-12-17 09:16:26 +01:00 |
Sandro La Bruzzo
|
d09193a094
|
merged JqMapping branch into tree2
|
2019-12-13 11:30:02 +01:00 |
Sandro La Bruzzo
|
bd79999fb8
|
Improved deduplication
|
2019-12-05 14:14:25 +01:00 |
miconis
|
7cd63a37cf
|
update of the strict configuration with new terms
|
2019-11-29 14:13:46 +01:00 |
miconis
|
5676e625bd
|
implementation of romansmatch and re-implementation of the getNumber function. New terms in the translation map and update of the configuration
|
2019-11-28 16:54:44 +01:00 |
miconis
|
493b385b5b
|
addition of one term to the translation maps in the configurations
|
2019-11-27 15:48:37 +01:00 |
miconis
|
40808200f0
|
the param map has been updated: now it accepts string parameters
|
2019-11-21 09:37:56 +01:00 |
miconis
|
79e62787cf
|
jarowinklernormalizedname splitted in 3 different comparators: citymatch, keywordmatch and jarowinkler. Implementation of the TreeStatistic support functions
|
2019-11-20 10:45:00 +01:00 |
miconis
|
676e9c8e37
|
code cleaning and implementation of the TreeDedup + minor changes
|
2019-11-14 10:01:21 +01:00 |
miconis
|
5b3adb3e65
|
code cleaning, distribution of the classes in packages and implementation of the new configuration
|
2019-11-07 12:47:12 +01:00 |
miconis
|
3ff5be675b
|
put the last modification of the master branch into the tree2. Addition of the configuration as parameter of the comparator. This is to allow the comparator to access it
|
2019-10-29 16:38:42 +01:00 |
miconis
|
bc7dd4bfa2
|
[maven-release-plugin] prepare for next development iteration
|
2019-10-24 11:34:19 +02:00 |
miconis
|
098c5e2f64
|
[maven-release-plugin] prepare release dnet-dedup-3.0.15
|
2019-10-24 11:34:12 +02:00 |
miconis
|
8dba7a04f8
|
dependency-reduced-pom deleted
|
2019-10-24 11:28:20 +02:00 |
miconis
|
58f128d861
|
Revert "[maven-release-plugin] prepare release dnet-dedup-3.0.15"
This reverts commit 452ab7892d .
|
2019-10-24 11:23:01 +02:00 |
miconis
|
452ab7892d
|
[maven-release-plugin] prepare release dnet-dedup-3.0.15
|
2019-10-24 11:17:07 +02:00 |
miconis
|
4712fef82f
|
release rollback
|
2019-10-24 11:11:07 +02:00 |
miconis
|
4874038f8e
|
minor changes
|
2019-10-23 16:37:20 +02:00 |
miconis
|
2ffaa235a2
|
minor changes and configuration updates (synonym field added)
|
2019-10-23 16:31:45 +02:00 |
miconis
|
03c1b334d5
|
translation map moved in json configuration, support for synonyms added in the configuration, now the configuration is argument of conditions, distancealgos and clusteringfunctions
|
2019-10-08 14:53:52 +02:00 |
Claudio Atzori
|
42e3bff05f
|
[maven-release-plugin] prepare for next development iteration
|
2019-09-25 10:39:46 +02:00 |
Claudio Atzori
|
259d502d70
|
[maven-release-plugin] prepare release dnet-dedup-3.0.14
|
2019-09-25 10:39:39 +02:00 |
miconis
|
93b332cbe5
|
translation map updated
|
2019-09-25 09:53:06 +02:00 |
miconis
|
4bcf353a72
|
implementation of the conditions in tree nodes. get rid of the conditions part of the configuration
|
2019-08-09 15:41:49 +02:00 |
miconis
|
72b14ec36b
|
implementation of the decision tree. It takes place of the distance algos, necessaryConditions and sufficientConditions are still there. The model contains only path, type and name of the field. ignoreMissing is still in the model because it is used by the conditions.
|
2019-08-09 10:08:34 +02:00 |
miconis
|
f0b4c4cbd4
|
addition of a fixSpecial function to address the problem with special character in organization names, addition of new terms in translation maps
|
2019-08-06 17:06:05 +02:00 |
miconis
|
85070ce3fe
|
addition of the BlockUtils class for meta-blocking, implementation of a new local test with edge filtering example
|
2019-08-06 12:09:34 +02:00 |
miconis
|
2472f2b1e8
|
Merge branch 'master' of https://github.com/dnet-team/dnet-dedup
|
2019-07-19 17:10:53 +02:00 |
miconis
|
84974dcdfa
|
restyling of the JaroWinklerNormalizedName comparator, now it is optimized. Addition of some translations in the translation maps, addition of a clustering based on keywords in organizations legalnames
|
2019-07-19 17:10:29 +02:00 |
Claudio Atzori
|
19468fa864
|
[maven-release-plugin] prepare for next development iteration
|
2019-07-08 11:12:52 +02:00 |
Claudio Atzori
|
953b78ab9b
|
[maven-release-plugin] prepare release dnet-dedup-3.0.13
|
2019-07-08 11:12:45 +02:00 |
Claudio Atzori
|
ceaf19c83c
|
[maven-release-plugin] prepare for next development iteration
|
2019-07-08 10:11:24 +02:00 |
Claudio Atzori
|
6314f896d1
|
[maven-release-plugin] prepare release dnet-dedup-3.0.12
|
2019-07-08 10:11:17 +02:00 |
miconis
|
8f5bc52ab2
|
[maven-release-plugin] rollback the release of dnet-dedup-3.0.12
|
2019-07-08 10:00:48 +02:00 |
miconis
|
813778d647
|
[maven-release-plugin] prepare for next development iteration
|
2019-07-08 09:48:10 +02:00 |
miconis
|
b8fb3e46aa
|
[maven-release-plugin] prepare release dnet-dedup-3.0.12
|
2019-07-08 09:48:03 +02:00 |
miconis
|
2b866cfbeb
|
addition of doi normalization in PidMatch comparator, addition of keywordsclustering (clustering based on terms in the translation maps for the organizations), minor changes
|
2019-07-08 09:44:02 +02:00 |
Claudio Atzori
|
9f6fb0e030
|
[maven-release-plugin] prepare for next development iteration
|
2019-06-19 10:02:39 +02:00 |
Claudio Atzori
|
07d1b7df15
|
[maven-release-plugin] prepare release dnet-dedup-3.0.11
|
2019-06-19 10:02:32 +02:00 |
Claudio Atzori
|
c9fc377712
|
[maven-release-plugin] prepare for next development iteration
|
2019-06-18 14:46:34 +02:00 |
Claudio Atzori
|
e1ee2d40b3
|
[maven-release-plugin] prepare release dnet-dedup-3.0.10
|
2019-06-18 14:46:27 +02:00 |
miconis
|
e7d170d0eb
|
exact match condition gives undefined if a field is missing, ignoremissing semantics changed: now performs the comparison in any case if =true, if false gives -1 in case of missing
|
2019-06-18 14:05:31 +02:00 |
miconis
|
a5526f6254
|
implementation of the integration test, addition of document blocks to group entities after clustering
|
2019-05-21 16:38:26 +02:00 |
Claudio Atzori
|
3dfbf5fab7
|
[maven-release-plugin] prepare for next development iteration
|
2019-04-03 12:35:00 +02:00 |
Claudio Atzori
|
6837b59c6e
|
[maven-release-plugin] prepare release dnet-dedup-3.0.9
|
2019-04-03 12:34:52 +02:00 |
miconis
|
d4c5e293a6
|
[maven-release-plugin] rollback the release of dnet-dedup-3.0.9
|
2019-04-03 12:27:28 +02:00 |
miconis
|
4f4713c6aa
|
[maven-release-plugin] prepare for next development iteration
|
2019-04-03 12:26:05 +02:00 |
miconis
|
bb072cec20
|
[maven-release-plugin] prepare release dnet-dedup-3.0.9
|
2019-04-03 12:25:56 +02:00 |
miconis
|
3018031621
|
branch cities merged into master
|
2019-04-03 12:22:33 +02:00 |
miconis
|
14c3afba23
|
clean up
|
2019-04-03 11:35:25 +02:00 |
miconis
|
f738c2b641
|
addition of a sparktester test, implementation of 2 different classes for testing in dnet-dedup-test module, addition of new terms in the vocabulary and change in the implementation of the JaroWinklerNormalizedName comparator
|
2019-04-03 09:40:14 +02:00 |
miconis
|
e9894ed089
|
minor changes
|
2019-03-26 15:48:21 +01:00 |
miconis
|
1dbb765343
|
minor changes
|
2019-03-26 15:40:40 +01:00 |
Michele De Bonis
|
f87790f701
|
update of the comparator for legalnames of organizations
|
2019-03-21 14:27:27 +01:00 |
Claudio Atzori
|
14a07ff400
|
[maven-release-plugin] prepare for next development iteration
|
2019-02-18 09:09:14 +01:00 |
Claudio Atzori
|
d722368780
|
[maven-release-plugin] prepare release dnet-dedup-3.0.8
|
2019-02-18 09:09:07 +01:00 |
Claudio Atzori
|
63e1607d5c
|
[maven-release-plugin] prepare for next development iteration
|
2019-02-17 12:56:19 +01:00 |
Claudio Atzori
|
1b8d257036
|
[maven-release-plugin] prepare release dnet-dedup-3.0.7
|
2019-02-17 12:56:11 +01:00 |
Michele De Bonis
|
b02aa08833
|
implementation of the test classes and minor changes
|
2019-02-08 12:56:47 +01:00 |
Michele De Bonis
|
9ff83d6567
|
implementation of the decision tree for the deduplication of the authors, implementation of multiple comparators to be used in a tree node and definition of the proto for person entity
|
2018-12-20 09:54:41 +01:00 |
Michele De Bonis
|
0bd20c565a
|
implementation of the decisional tree, addition of the dnet-openaire-data-protos module, definition of the person proto, blockprocessor and paceconfig modified with addition of support for the tree processing
|
2018-12-12 16:30:03 +01:00 |
Claudio Atzori
|
d72960f8b9
|
apply limits (length, size) to pace Fields
|
2018-11-20 10:51:38 +01:00 |
Claudio Atzori
|
1ff5be3f04
|
[maven-release-plugin] prepare for next development iteration
|
2018-11-19 17:41:45 +01:00 |
Claudio Atzori
|
31b228d38b
|
[maven-release-plugin] prepare release dnet-dedup-3.0.6
|
2018-11-19 17:41:37 +01:00 |
Claudio Atzori
|
e5a77f0a53
|
added new properties to FieldDef (size, length) to limit the information mapped onto each MapDocument
|
2018-11-19 17:37:57 +01:00 |
Claudio Atzori
|
db37cce4a4
|
[maven-release-plugin] prepare for next development iteration
|
2018-11-17 09:13:16 +01:00 |
Claudio Atzori
|
4deac3f1f3
|
[maven-release-plugin] prepare release dnet-dedup-3.0.5
|
2018-11-17 09:13:09 +01:00 |
Michele De Bonis
|
23c5a16525
|
addition of cities check
|
2018-11-16 16:11:03 +01:00 |
Claudio Atzori
|
caf5ead565
|
[maven-release-plugin] prepare for next development iteration
|
2018-11-16 09:18:00 +01:00 |
Claudio Atzori
|
4d139bbc18
|
[maven-release-plugin] prepare release dnet-dedup-3.0.4
|
2018-11-16 09:17:53 +01:00 |
Claudio Atzori
|
71fe456a62
|
[maven-release-plugin] prepare for next development iteration
|
2018-11-12 14:23:36 +01:00 |
Claudio Atzori
|
690bfcef1e
|
[maven-release-plugin] prepare release dnet-dedup-3.0.3
|
2018-11-12 14:23:29 +01:00 |
Michele De Bonis
|
3a517a6551
|
Merge branch 'master' of https://github.com/dnet-team/dnet-dedup
|
2018-11-12 14:11:26 +01:00 |
Michele De Bonis
|
33387a3532
|
configuration file updated, addition of condition on domain
|
2018-11-12 14:11:15 +01:00 |
Claudio Atzori
|
1f9b908d6c
|
[maven-release-plugin] prepare for next development iteration
|
2018-11-12 12:46:50 +01:00 |
Claudio Atzori
|
99379e2505
|
[maven-release-plugin] prepare release dnet-dedup-3.0.2
|
2018-11-12 12:46:42 +01:00 |
Claudio Atzori
|
c7d6b1a41a
|
[maven-release-plugin] prepare for next development iteration
|
2018-11-12 11:40:42 +01:00 |