dnet-hadoop/dhp-workflows/dhp-graph-mapper/src/main/resources/eu/dnetlib/dhp/oa/graph
Michele Artini 3268570b2c mapping of project PIDs 2024-02-22 14:47:21 +01:00
..
clean/oozie_app fixed input path supplemented to GetDatasourceFromCountry; adjusted the various spark.sql.shuffle.partitions 2023-03-24 13:09:12 +01:00
group/oozie_app [graph grouping] added isLookupUrl to the workflow definition, passed to the grouping spark aciton 2023-12-03 13:32:52 +01:00
hive/oozie_app added metaresourcetype to the result hive DB view 2023-12-21 12:27:10 +01:00
hostedbymap - 2023-12-20 14:26:54 +01:00
merge/oozie_app pre-group the records in each table before joning the contents from BETA and PROD together 2023-03-02 14:49:19 +01:00
raw_all/oozie_app [aggregator graph] using dedicated path to sync claims, adjusted paths with wildcards 2023-03-08 21:16:52 +01:00
raw_organizations/oozie_app [DoiBoost Organizations] added parameter to specify the action in the wf raw_organizations to be able to load the openorgs organization as in the loading step for the construction of the graph 2022-01-13 13:52:00 +01:00
resolution added defaults to the graph resolution workflow config-default.xml 2023-10-20 22:28:12 +02:00
sql mapping of project PIDs 2024-02-22 14:47:21 +01:00
xquery WIP: graph cleaner implementation 2020-06-09 17:20:40 +02:00
copy_hdfs_oaf_parameters.json conflict resolved on merge 2021-10-26 09:40:47 +02:00
datasourcemaster_parameters.json [graph cleaning] patch the result's collectedfrom and hostedby identifiers according to the datasource master-duplicate mapping 2022-11-28 09:54:18 +01:00
dispatch_entities_parameters.json raw graph creation workflow moved under dhp-graph-mapper, claims integration is included 2020-04-10 17:53:07 +02:00
generate_entities_parameters.json [aggregator graph] save invalid records aside for further inspection 2022-09-16 14:06:28 +02:00
hive_db_importer_parameters.json parallel implementation for graph Hive importer 2020-05-15 09:05:26 +02:00
hive_table_importer_parameters.json introduced parameter 'numParitions', driving the hive DB table data partitioning. Currently specified only for table 'project' 2020-07-23 08:54:10 +02:00
input_clean_cfhb_parameters.json [Cleaning] fixed parameter name in property file 2022-12-08 16:59:34 +01:00
input_clean_context_parameters.json [graph cleaning] WIP: testing the collectedfron and hostedby patch procedure 2022-11-29 11:21:51 +01:00
input_clean_country_parameters.json [graph cleaning] WIP: testing the collectedfron and hostedby patch procedure 2022-11-29 11:21:51 +01:00
input_clean_graph_parameters.json [graph cleaning] WIP: refactoring of the cleaning stages, unit tests 2023-03-21 14:41:20 +01:00
input_datasource_country_parameters.json [graph cleaning] WIP: testing the collectedfron and hostedby patch procedure 2022-11-29 11:21:51 +01:00
input_graph_hive_parameters.json unit test for GraphHiveImporterJob 2020-04-08 13:24:43 +02:00
merge_claims_parameters.json raw graph creation workflow moved under dhp-graph-mapper, claims integration is included 2020-04-10 17:53:07 +02:00
merge_graphs_parameters.json added parameter to drive the graph merge strategy: priority (BETA|PROD) 2020-07-20 10:48:01 +02:00
migrate_db_entities_parameters.json blacklist of nsprefix 2020-07-30 16:13:38 +02:00
migrate_hdfs_mstores_parameters.json first implementation of Hdfs Mdstores Importer 2021-05-27 16:22:07 +02:00
migrate_mongo_mstores_parameters.json implemented synch for single mdstore 2022-12-01 11:34:43 +01:00
patch_relations_parameters.json [raw_all] added extra workflow step for patching the identifiers in the relations, given an id mapping dataset 2021-07-29 12:13:06 +02:00
verify_records_parameters.json [aggregator graph] save invalid records aside for further inspection 2022-09-16 14:06:28 +02:00