.. |
clean/oozie_app
|
fixed input path supplemented to GetDatasourceFromCountry; adjusted the various spark.sql.shuffle.partitions
|
2023-03-24 13:09:12 +01:00 |
group/oozie_app
|
[graph grouping] added isLookupUrl to the workflow definition, passed to the grouping spark aciton
|
2023-12-03 13:32:52 +01:00 |
hive/oozie_app
|
added metaresourcetype to the result hive DB view
|
2023-12-21 12:26:19 +01:00 |
hostedbymap
|
[HostedByMap] added left over from PR and fixed issue on workflow
|
2022-03-21 09:54:45 +01:00 |
merge/oozie_app
|
pre-group the records in each table before joning the contents from BETA and PROD together
|
2023-03-02 14:49:19 +01:00 |
raw_all/oozie_app
|
[aggregator graph] using dedicated path to sync claims, adjusted paths with wildcards
|
2023-03-08 21:16:52 +01:00 |
raw_organizations/oozie_app
|
[DoiBoost Organizations] added parameter to specify the action in the wf raw_organizations to be able to load the openorgs organization as in the loading step for the construction of the graph
|
2022-01-13 13:52:00 +01:00 |
resolution
|
added defaults to the graph resolution workflow config-default.xml
|
2023-10-20 22:28:12 +02:00 |
sql
|
[aggregator graph] added column alias when mapping organization PIDs from the OpenOrgs database
|
2023-06-13 11:38:10 +02:00 |
xquery
|
WIP: graph cleaner implementation
|
2020-06-09 17:20:40 +02:00 |
copy_hdfs_oaf_parameters.json
|
conflict resolved on merge
|
2021-10-26 09:40:47 +02:00 |
datasourcemaster_parameters.json
|
[graph cleaning] patch the result's collectedfrom and hostedby identifiers according to the datasource master-duplicate mapping
|
2022-11-28 09:54:18 +01:00 |
dispatch_entities_parameters.json
|
raw graph creation workflow moved under dhp-graph-mapper, claims integration is included
|
2020-04-10 17:53:07 +02:00 |
generate_entities_parameters.json
|
[aggregator graph] save invalid records aside for further inspection
|
2022-09-16 14:06:28 +02:00 |
hive_db_importer_parameters.json
|
parallel implementation for graph Hive importer
|
2020-05-15 09:05:26 +02:00 |
hive_table_importer_parameters.json
|
introduced parameter 'numParitions', driving the hive DB table data partitioning. Currently specified only for table 'project'
|
2020-07-23 08:54:10 +02:00 |
input_clean_cfhb_parameters.json
|
[Cleaning] fixed parameter name in property file
|
2022-12-08 16:59:34 +01:00 |
input_clean_context_parameters.json
|
[graph cleaning] WIP: testing the collectedfron and hostedby patch procedure
|
2022-11-29 11:21:51 +01:00 |
input_clean_country_parameters.json
|
[graph cleaning] WIP: testing the collectedfron and hostedby patch procedure
|
2022-11-29 11:21:51 +01:00 |
input_clean_graph_parameters.json
|
[graph cleaning] WIP: refactoring of the cleaning stages, unit tests
|
2023-03-21 14:41:20 +01:00 |
input_datasource_country_parameters.json
|
[graph cleaning] WIP: testing the collectedfron and hostedby patch procedure
|
2022-11-29 11:21:51 +01:00 |
input_graph_hive_parameters.json
|
unit test for GraphHiveImporterJob
|
2020-04-08 13:24:43 +02:00 |
merge_claims_parameters.json
|
raw graph creation workflow moved under dhp-graph-mapper, claims integration is included
|
2020-04-10 17:53:07 +02:00 |
merge_graphs_parameters.json
|
added parameter to drive the graph merge strategy: priority (BETA|PROD)
|
2020-07-20 10:48:01 +02:00 |
migrate_db_entities_parameters.json
|
blacklist of nsprefix
|
2020-07-30 16:13:38 +02:00 |
migrate_hdfs_mstores_parameters.json
|
first implementation of Hdfs Mdstores Importer
|
2021-05-27 16:22:07 +02:00 |
migrate_mongo_mstores_parameters.json
|
implemented synch for single mdstore
|
2022-12-01 11:34:43 +01:00 |
patch_relations_parameters.json
|
[raw_all] added extra workflow step for patching the identifiers in the relations, given an id mapping dataset
|
2021-07-29 12:13:06 +02:00 |
verify_records_parameters.json
|
[aggregator graph] save invalid records aside for further inspection
|
2022-09-16 14:06:28 +02:00 |