dnet-hadoop/dhp-workflows/dhp-dedup-openaire/src/main/resources/eu/dnetlib/dhp/oa/dedup
miconis 0e54803177 bug fix in the id generator and implementation of jobs for organization dedup 2020-10-20 12:19:46 +02:00
..
consistency/oozie_app various refactorings on the dnet-dedup-openaire workflow 2020-04-18 12:06:23 +02:00
neworgs/oozie_app implementation of the workflow for new organizations in openorgs 2020-10-06 13:58:09 +02:00
orgsdedup/oozie_app bug fix in the id generator and implementation of jobs for organization dedup 2020-10-20 12:19:46 +02:00
scan/oozie_app configurable number of partitions used in the SparkCreateSimRels phase 2020-07-13 16:07:07 +02:00
statistics/oozie_app SparkBlockStats allows to repartition the input rdd via the numPartitions workflow parameter 2020-07-13 20:09:06 +02:00
collectSimRels_parameters.json implementation of the job to collect simrels from postgres db 2020-09-22 09:43:27 +02:00
createBlockStats_parameters.json SparkBlockStats allows to repartition the input rdd via the numPartitions workflow parameter 2020-07-13 20:09:06 +02:00
createCC_parameters.json implemented test for cut of connected component 2020-07-13 15:28:17 +02:00
createDedupRecord_parameters.json various refactorings on the dnet-dedup-openaire workflow 2020-04-18 12:06:23 +02:00
createSimRels_parameters.json configurable number of partitions used in the SparkCreateSimRels phase 2020-07-13 16:07:07 +02:00
prepareNewOrgs_parameters.json bug fix in the id generator and implementation of jobs for organization dedup 2020-10-20 12:19:46 +02:00
prepareOrgRels_parameters.json bug fix in the id generator and implementation of jobs for organization dedup 2020-10-20 12:19:46 +02:00
propagateRelation_parameters.json various refactorings on the dnet-dedup-openaire workflow 2020-04-18 12:06:23 +02:00
updateEntity_parameters.json configurable number of partitions used in the SparkCreateSimRels phase 2020-07-13 16:07:07 +02:00