Fixes in Graph Provision
giambattista.bloisi
created branch beta_provision_relation in D-Net/dnet-hadoop
2024-05-07 15:44:42 +02:00
giambattista.bloisi
pushed to beta_provision_relation at D-Net/dnet-hadoop
2024-05-07 15:44:42 +02:00
711048ceed
PrepareRelationsJob rewritten to use Spark Dataframe API and Windowing functions
69c5efbd8b
Fix: when applying enrichments with no instance information the resulting merge entity was generated with no instance instead of keeping the original information
Miscellaneous related to changes in MergeUtils
giambattista.bloisi
created branch misc_fixes_merge_entities in D-Net/dnet-hadoop
2024-04-24 08:12:53 +02:00
giambattista.bloisi
pushed to misc_fixes_merge_entities at D-Net/dnet-hadoop
2024-04-24 08:12:53 +02:00
1878199dae
Miscellaneous fixes:
Describe OpenAIRE ID stability and usage of Pivot table
giambattista.bloisi
created branch pid_stability in D-Net/openaire-graph-docs
2024-04-22 22:04:28 +02:00
6bb810a606
Describe the usage of the pivot table to improve stability of “representative records” and how “non authoritative” PIDs are used to generate “representative records”
9222fe3456
Format md
giambattista.bloisi
pushed to airflow at giambattista.bloisi/lot1-kickoff
2024-04-18 13:13:19 +02:00
09b603925d
initial stage
giambattista.bloisi
pushed to airflow at giambattista.bloisi/lot1-kickoff
2024-04-18 12:27:12 +02:00
f89898e99b
initial stage
giambattista.bloisi
pushed to airflow at giambattista.bloisi/lot1-kickoff
2024-04-18 12:01:47 +02:00
a1b43d02eb
initial stage
giambattista.bloisi
deleted branch revised_merge_logic from D-Net/dnet-hadoop
2024-04-18 10:43:09 +02:00
Refinements to PR #404: refactoring the Oaf records merge utilities into dhp-common
giambattista.bloisi
created branch revised_merge_logic in D-Net/dnet-hadoop
2024-04-16 17:19:50 +02:00
giambattista.bloisi
deleted branch dedup_authorsmatch_bytoken from D-Net/dnet-hadoop
2024-04-16 10:24:25 +02:00
Enhance Dedup authors matching with algorithms used for ORCID enhancements (task 9690)