Commit Graph

245 Commits

Author SHA1 Message Date
miconis e7d170d0eb exact match condition gives undefined if a field is missing, ignoremissing semantics changed: now performs the comparison in any case if =true, if false gives -1 in case of missing 2019-06-18 14:05:31 +02:00
miconis a5526f6254 implementation of the integration test, addition of document blocks to group entities after clustering 2019-05-21 16:38:26 +02:00
Claudio Atzori 6dcbfd9755 added more ignores 2019-04-03 17:43:55 +02:00
Claudio Atzori 3dfbf5fab7 [maven-release-plugin] prepare for next development iteration 2019-04-03 12:35:00 +02:00
Claudio Atzori 6837b59c6e [maven-release-plugin] prepare release dnet-dedup-3.0.9 2019-04-03 12:34:52 +02:00
miconis d4c5e293a6 [maven-release-plugin] rollback the release of dnet-dedup-3.0.9 2019-04-03 12:27:28 +02:00
miconis 4f4713c6aa [maven-release-plugin] prepare for next development iteration 2019-04-03 12:26:05 +02:00
miconis bb072cec20 [maven-release-plugin] prepare release dnet-dedup-3.0.9 2019-04-03 12:25:56 +02:00
miconis 3018031621 branch cities merged into master 2019-04-03 12:22:33 +02:00
miconis 14c3afba23 clean up 2019-04-03 11:35:25 +02:00
miconis f738c2b641 addition of a sparktester test, implementation of 2 different classes for testing in dnet-dedup-test module, addition of new terms in the vocabulary and change in the implementation of the JaroWinklerNormalizedName comparator 2019-04-03 09:40:14 +02:00
miconis e9894ed089 minor changes 2019-03-26 15:48:21 +01:00
miconis 1dbb765343 minor changes 2019-03-26 15:40:40 +01:00
Michele De Bonis f87790f701 update of the comparator for legalnames of organizations 2019-03-21 14:27:27 +01:00
Claudio Atzori 14a07ff400 [maven-release-plugin] prepare for next development iteration 2019-02-18 09:09:14 +01:00
Claudio Atzori d722368780 [maven-release-plugin] prepare release dnet-dedup-3.0.8 2019-02-18 09:09:07 +01:00
Claudio Atzori 27eeeec1f3 default configuration includes configurationId 2019-02-18 09:07:23 +01:00
Claudio Atzori 63e1607d5c [maven-release-plugin] prepare for next development iteration 2019-02-17 12:56:19 +01:00
Claudio Atzori 1b8d257036 [maven-release-plugin] prepare release dnet-dedup-3.0.7 2019-02-17 12:56:11 +01:00
Claudio Atzori cabc2d21c2 replace existing attributes when loading default configuration 2019-02-17 12:48:25 +01:00
Michele De Bonis b02aa08833 implementation of the test classes and minor changes 2019-02-08 12:56:47 +01:00
Michele De Bonis babf67663b modification of the README 2018-12-20 11:05:08 +01:00
Michele De Bonis d9372745f2 modification of the README 2018-12-20 10:59:22 +01:00
Michele De Bonis f91220980a modification of the README 2018-12-20 10:57:17 +01:00
Michele De Bonis 0e3ce0100c modification of the README 2018-12-20 10:53:32 +01:00
Michele De Bonis 07315ed492 modification of the README 2018-12-20 10:51:53 +01:00
Michele De Bonis 2be03ecce9 modification of the README 2018-12-20 10:50:05 +01:00
Michele De Bonis 6e9bf11e2d modification of the README 2018-12-20 10:47:56 +01:00
Michele De Bonis 9ff83d6567 implementation of the decision tree for the deduplication of the authors, implementation of multiple comparators to be used in a tree node and definition of the proto for person entity 2018-12-20 09:54:41 +01:00
Michele De Bonis 0bd20c565a implementation of the decisional tree, addition of the dnet-openaire-data-protos module, definition of the person proto, blockprocessor and paceconfig modified with addition of support for the tree processing 2018-12-12 16:30:03 +01:00
Claudio Atzori d72960f8b9 apply limits (length, size) to pace Fields 2018-11-20 10:51:38 +01:00
Claudio Atzori 1ff5be3f04 [maven-release-plugin] prepare for next development iteration 2018-11-19 17:41:45 +01:00
Claudio Atzori 31b228d38b [maven-release-plugin] prepare release dnet-dedup-3.0.6 2018-11-19 17:41:37 +01:00
Claudio Atzori 75c3daf38c using released mapping-utils module 2018-11-19 17:39:28 +01:00
Claudio Atzori e5a77f0a53 added new properties to FieldDef (size, length) to limit the information mapped onto each MapDocument 2018-11-19 17:37:57 +01:00
Claudio Atzori db37cce4a4 [maven-release-plugin] prepare for next development iteration 2018-11-17 09:13:16 +01:00
Claudio Atzori 4deac3f1f3 [maven-release-plugin] prepare release dnet-dedup-3.0.5 2018-11-17 09:13:09 +01:00
Claudio Atzori a0e0df1cfd added distance function fot software titles 2018-11-17 09:11:38 +01:00
Michele De Bonis 23c5a16525 addition of cities check 2018-11-16 16:11:03 +01:00
Claudio Atzori caf5ead565 [maven-release-plugin] prepare for next development iteration 2018-11-16 09:18:00 +01:00
Claudio Atzori 4d139bbc18 [maven-release-plugin] prepare release dnet-dedup-3.0.4 2018-11-16 09:17:53 +01:00
Claudio Atzori fa657a05e6 default (empty) configuration should be aligned with the updated model 2018-11-15 16:52:56 +01:00
Claudio Atzori e4ae7d426a less verbose logging 2018-11-13 09:07:45 +01:00
Claudio Atzori 9a14b0ecbc propagate exceptions in case of serialization errors, removed configuration pretty printing, removed unused class ScoredResult 2018-11-12 15:52:18 +01:00
Claudio Atzori 71fe456a62 [maven-release-plugin] prepare for next development iteration 2018-11-12 14:23:36 +01:00
Claudio Atzori 690bfcef1e [maven-release-plugin] prepare release dnet-dedup-3.0.3 2018-11-12 14:23:29 +01:00
Claudio Atzori 4a5f13c8f5 added more ignores 2018-11-12 14:22:19 +01:00
Michele De Bonis 3a517a6551 Merge branch 'master' of https://github.com/dnet-team/dnet-dedup 2018-11-12 14:11:26 +01:00
Michele De Bonis 33387a3532 configuration file updated, addition of condition on domain 2018-11-12 14:11:15 +01:00
Claudio Atzori 1f9b908d6c [maven-release-plugin] prepare for next development iteration 2018-11-12 12:46:50 +01:00