Collection of OOZIE workflows for the OpenAIRE Graph construction, processing, provisioning.
Go to file
Claudio Atzori 06c1913062 added different limits for grouping by source and by target, incremented spark.sql.shuffle.partitions for the join operations 2020-07-10 19:03:33 +02:00
dhp-build [maven-release-plugin] prepare for next development iteration 2020-06-22 11:27:44 +02:00
dhp-common [maven-release-plugin] prepare for next development iteration 2020-06-22 11:27:44 +02:00
dhp-doc-resources/img updated image 2020-03-05 15:11:42 +01:00
dhp-schemas [maven-release-plugin] prepare for next development iteration 2020-06-22 11:27:44 +02:00
dhp-workflows added different limits for grouping by source and by target, incremented spark.sql.shuffle.partitions for the join operations 2020-07-10 19:03:33 +02:00
.gitignore ignore *.log files 2020-03-27 13:45:25 +01:00
LICENSE added LICENSE file - AGPL-3.0 2020-04-29 16:11:17 +02:00
README.md test commit 2019-10-02 15:12:09 +02:00
pom.xml adjusted dedup configuration for result entities: using new wordssuffixprefix clustering function, removed ngrampairs, adjusted queueMaxSize (800) and slidingWindowSize (80) 2020-07-02 17:35:22 +02:00

README.md

dnet-hadoop

Dnet-hadoop is a tool for