Commit Graph

1089 Commits

Author SHA1 Message Date
Miriam Baglioni 0d10e3bd22 modified the mapping to include the groups. Added step to workflow to send directly to the catalogue 2020-07-02 14:22:20 +02:00
Miriam Baglioni 2d380aea1d added logic to directly send records to catalogue 2020-07-02 11:12:14 +02:00
Miriam Baglioni 566a763175 - 2020-07-01 18:13:48 +02:00
Miriam Baglioni e71e857e48 removed test 2020-07-01 17:42:32 +02:00
Miriam Baglioni 9864bff488 mapping adaptations 2020-07-01 17:41:58 +02:00
Miriam Baglioni 42ee1ef284 Merge branch 'd4science' of code-repo.d4science.org:miriam.baglioni/dnet-hadoop into d4science 2020-06-30 14:09:04 +02:00
Alessia Bardi 474ae69df8 use the same name generation procedure of the mapping 2020-06-24 12:59:03 +02:00
Miriam Baglioni 952a4a4482 - 2020-06-24 10:59:58 +02:00
Miriam Baglioni 563378ce3f changed the mapping and added new resources for testing 2020-06-23 15:30:34 +02:00
Miriam Baglioni d6838e18e6 Merge branch 'd4science' of code-repo.d4science.org:miriam.baglioni/dnet-hadoop into d4science 2020-06-23 11:57:30 +02:00
Miriam Baglioni de62582c28 new test resource 2020-06-23 11:57:25 +02:00
Alessia Bardi 743d948d1c print logs 2020-06-23 11:52:19 +02:00
Miriam Baglioni a2aa3c5b67 Merge branch 'd4science' of code-repo.d4science.org:miriam.baglioni/dnet-hadoop into d4science 2020-06-23 11:36:39 +02:00
Alessia Bardi fcabee9242 last wrong assert fixed 2020-06-23 11:36:04 +02:00
Miriam Baglioni 2d9811ac4c Merge branch 'd4science' of code-repo.d4science.org:miriam.baglioni/dnet-hadoop into d4science 2020-06-23 11:31:16 +02:00
Alessia Bardi 71ef7d9e66 using proper assertions 2020-06-23 11:30:41 +02:00
Miriam Baglioni 60a3206de5 fixed a typo in the name of a filed 2020-06-23 11:19:23 +02:00
Miriam Baglioni f12b1ede24 Merge branch 'd4science' of code-repo.d4science.org:miriam.baglioni/dnet-hadoop into d4science 2020-06-23 11:16:25 +02:00
Alessia Bardi b762c28cb6 moved test to proper package 2020-06-23 11:15:02 +02:00
Miriam Baglioni 844948f3e0 real output from the cluster 2020-06-23 11:08:43 +02:00
Miriam Baglioni 33e2ebeaaa fix to the mapper, and changed of the json for testing 2020-06-23 11:07:42 +02:00
Alessia Bardi a27b93859e method to purge all items in the d4science catalog 2020-06-22 19:25:25 +02:00
Miriam Baglioni 3da12be81f - 2020-06-22 19:14:06 +02:00
Alessia Bardi d9c07eb800 GCat API and test - disabled 2020-06-22 18:49:04 +02:00
Miriam Baglioni 1566fd590e added set of same type of entries -url cf hb- before creating extras to have them distinct 2020-06-22 17:45:38 +02:00
Miriam Baglioni 004bf225cb added repartition to one before writing so as to have just one file for each community product 2020-06-22 17:38:02 +02:00
Miriam Baglioni e983d02c1c added check to fix issue when entry is present but value it is not 2020-06-22 17:37:30 +02:00
Miriam Baglioni b570f011d1 changed the workflow name 2020-06-22 16:53:32 +02:00
Miriam Baglioni d133368d2d merge branch with fork master 2020-06-22 16:25:56 +02:00
Miriam Baglioni 25a7205549 merge branch with fork master 2020-06-22 16:23:23 +02:00
Miriam Baglioni 06b03840bd new classes for Gcat catalogue, Mapping to the catalogue, spark code and workflow definition 2020-06-22 16:23:00 +02:00
Claudio Atzori 8a3bc7c183 Merge branch 'master' of https://code-repo.d4science.org/D-Net/dnet-hadoop 2020-06-22 14:12:33 +02:00
Claudio Atzori e162ba5075 added dnet workflows to orchestrate the execution of graph2hive, updateSolr and updateStats oozie wfs 2020-06-22 14:12:28 +02:00
Michele Artini 3ce20c198e reformatting 2020-06-22 12:14:25 +02:00
Michele Artini ed787398b3 refactoring wf 2020-06-22 11:45:14 +02:00
Claudio Atzori 9cd27183b6 [maven-release-plugin] prepare for next development iteration 2020-06-22 11:27:44 +02:00
Claudio Atzori 1e3dab0631 [maven-release-plugin] prepare release dhp-1.2.3 2020-06-22 11:27:39 +02:00
Claudio Atzori 961a0d0b49 [actionset promotion] log debugging info in case of error in the action payload extraction or parsing the data 2020-06-22 10:20:45 +02:00
Claudio Atzori 5e8b922962 Merge branch 'master' of https://code-repo.d4science.org/D-Net/dnet-hadoop 2020-06-22 09:50:47 +02:00
Claudio Atzori 7d416f08d8 graph cleaning workflow: set hostedby to unknown repository when defined as NULL 2020-06-22 09:50:43 +02:00
Michele Artini 16c7a18435 refactoring 2020-06-22 08:51:31 +02:00
Alessia Bardi ec19fcace0 API for D4science GCat 2020-06-19 17:37:22 +02:00
Michele Artini f9fc64ffaf âÃMerge branch 'master' of code-repo.d4science.org:D-Net/dnet-hadoop 2020-06-19 15:24:43 +02:00
Michele Artini d88fe0ac84 join methods 2020-06-19 15:24:30 +02:00
Sandro La Bruzzo 464eeeec87 Merge branch 'master' of code-repo.d4science.org:D-Net/dnet-hadoop 2020-06-19 15:11:53 +02:00
Sandro La Bruzzo 1681de672d updated mapping scholexplorer to OAF 2020-06-19 15:11:46 +02:00
Michele Artini 4822747313 some fixes 2020-06-19 13:53:56 +02:00
Michele Artini 834f139e6e fixed some NPE 2020-06-19 12:33:29 +02:00
Claudio Atzori d0ac7514b2 cleaning workflow to include cleaning of default values 2020-06-18 19:37:25 +02:00
Michele Artini 52f62d5d8c events 2020-06-18 14:49:13 +02:00