Commit Graph

5693 Commits

Author SHA1 Message Date
miconis 7e7018c51f addition of a sparktester test, implementation of 2 different classes for testing in dnet-dedup-test module, addition of new terms in the vocabulary and change in the implementation of the JaroWinklerNormalizedName comparator 2019-04-03 09:40:14 +02:00
Michele Artini 1a1ec7da8e Merge branch 'master' of https://github.com/dnet-team/dnet-hadoop.git 2019-04-01 10:40:59 +02:00
Michele Artini f8c8b6669f ui 2019-04-01 10:40:09 +02:00
miconis 4bd5a9beee minor changes 2019-03-26 15:48:21 +01:00
Sandro La Bruzzo 0b503949de Merge branch 'master' of https://github.com/dnet-team/dnet-hadoop 2019-03-25 15:19:08 +01:00
Sandro La Bruzzo b2cd76831e added common cli 2019-03-25 15:18:48 +01:00
Sandro La Bruzzo 12c65eab4c implemented command line 2019-03-25 15:18:31 +01:00
Michele Artini b1200b6f46 ui 2019-03-25 11:54:18 +01:00
Michele Artini 6f813fbc8e ui 2019-03-22 15:15:57 +01:00
Michele Artini e2f1013b3d logs and ui 2019-03-22 11:41:00 +01:00
Michele Artini 405f418dfc fix a conflict
Merge branch 'master' of https://github.com/dnet-team/dnet-hadoop
2019-03-22 10:24:42 +01:00
Michele Artini 55aa7e1f26 new info 2019-03-22 10:16:47 +01:00
Enrico Ottonello 3b5f291df2 added angular client 2019-03-22 10:13:56 +01:00
Enrico Ottonello 639483090a added swagger comments to input parameters 2019-03-22 10:11:46 +01:00
Michele De Bonis 662448e584 update of the comparator for legalnames of organizations 2019-03-21 14:27:27 +01:00
Enrico Ottonello 042d477878 added versions support 2019-03-20 15:36:23 +01:00
Sandro La Bruzzo 859957d0fd Merge branch 'master' of https://github.com/dnet-team/dnet-hadoop 2019-03-20 09:49:53 +01:00
Enrico Ottonello 46bc3e1f53 added view on mdstore and transaction tables 2019-03-19 13:34:40 +01:00
Enrico Ottonello cefcb38e62 added module dhp-applications 2019-03-18 16:02:23 +01:00
enrico ed484bf24e Merge branch 'master' of https://github.com/dnet-team/dnet-hadoop.git 2019-03-18 15:38:04 +01:00
Enrico Ottonello 126d89ed38 first implementation of dhp-mdstore-manager-app 2019-03-18 15:30:07 +01:00
Michele De Bonis 2af1d61d43 test 2019-03-18 11:36:10 +01:00
Sandro La Bruzzo 6156562893 Added test 2019-03-18 10:47:28 +01:00
Sandro La Bruzzo 49d8cc716e added oozie wf 2019-03-18 10:46:07 +01:00
Sandro La Bruzzo c0da3da4c4 addedd common modules for CollectionBuilder in spark 2019-03-18 10:45:30 +01:00
Sandro La Bruzzo e67d9ee1a9 added first implementation of dnet-workflows 2019-03-18 10:44:35 +01:00
Sandro La Bruzzo 37b84b6afa Added description of the module 2019-03-13 14:53:35 +01:00
luosolo 1eb0281b38 refactored structure of the project 2019-03-13 14:43:20 +01:00
luosolo c10770cd3e added module that allows to collect data into HDFS 2019-03-12 15:40:55 +01:00
Claudio Atzori f2394fcd9f [maven-release-plugin] prepare for next development iteration 2019-02-18 09:09:14 +01:00
Claudio Atzori 722431dde1 [maven-release-plugin] prepare release dnet-dedup-3.0.8 2019-02-18 09:09:07 +01:00
Claudio Atzori 470c4b0f20 default configuration includes configurationId 2019-02-18 09:07:23 +01:00
Claudio Atzori ccb7e83196 [maven-release-plugin] prepare for next development iteration 2019-02-17 12:56:19 +01:00
Claudio Atzori 7d8e62d4cc [maven-release-plugin] prepare release dnet-dedup-3.0.7 2019-02-17 12:56:11 +01:00
Claudio Atzori 968cd47436 replace existing attributes when loading default configuration 2019-02-17 12:48:25 +01:00
Michele De Bonis 0735f3a822 implementation of the test classes and minor changes 2019-02-08 12:56:47 +01:00
Michele De Bonis 7a8d28991f implementation of the decision tree for the deduplication of the authors, implementation of multiple comparators to be used in a tree node and definition of the proto for person entity 2018-12-20 09:54:41 +01:00
Michele De Bonis 39613dbbd6 implementation of the decisional tree, addition of the dnet-openaire-data-protos module, definition of the person proto, blockprocessor and paceconfig modified with addition of support for the tree processing 2018-12-12 16:30:03 +01:00
Claudio Atzori f1c68d8ba3 apply limits (length, size) to pace Fields 2018-11-20 10:51:38 +01:00
Claudio Atzori c5979ffe18 [maven-release-plugin] prepare for next development iteration 2018-11-19 17:41:45 +01:00
Claudio Atzori 9869dff1d2 [maven-release-plugin] prepare release dnet-dedup-3.0.6 2018-11-19 17:41:37 +01:00
Claudio Atzori c2d4cb3ba6 added new properties to FieldDef (size, length) to limit the information mapped onto each MapDocument 2018-11-19 17:37:57 +01:00
Claudio Atzori 394fcafd41 [maven-release-plugin] prepare for next development iteration 2018-11-17 09:13:16 +01:00
Claudio Atzori 397554130c [maven-release-plugin] prepare release dnet-dedup-3.0.5 2018-11-17 09:13:09 +01:00
Claudio Atzori 0dfb2ea600 added distance function fot software titles 2018-11-17 09:11:38 +01:00
Michele De Bonis 3d4372ced9 addition of cities check 2018-11-16 16:11:03 +01:00
Claudio Atzori 55a9b4f501 [maven-release-plugin] prepare for next development iteration 2018-11-16 09:18:00 +01:00
Claudio Atzori 35ab630493 [maven-release-plugin] prepare release dnet-dedup-3.0.4 2018-11-16 09:17:53 +01:00
Claudio Atzori 399e4bc80f default (empty) configuration should be aligned with the updated model 2018-11-15 16:52:56 +01:00
Claudio Atzori 59bab8dba4 less verbose logging 2018-11-13 09:07:45 +01:00