From 7c8afb7797a78bd7be8db30c3dcb3636906625aa Mon Sep 17 00:00:00 2001 From: Giambattista Bloisi Date: Wed, 29 May 2024 23:18:42 +0200 Subject: [PATCH] Change the selection criteria for the pivot record of a group so that by best pid type becomes the first criteria. This will have the effect to slowly converge to records having DOI pid --- .../deduplication/research-products.md | 5 ++--- 1 file changed, 2 insertions(+), 3 deletions(-) diff --git a/docs/graph-production-workflow/deduplication/research-products.md b/docs/graph-production-workflow/deduplication/research-products.md index 9287c2b..52e5d2f 100644 --- a/docs/graph-production-workflow/deduplication/research-products.md +++ b/docs/graph-production-workflow/deduplication/research-products.md @@ -167,9 +167,8 @@ an OpenAIRE identifier, derived from a record of the group known as the _pivot record_. It is determined after sorting the group of duplicate records by the following criteria: -1. Records previously chosen as pivot records in the graph's previous - generations. -2. Records with identifiers from a [PID authority](/data-model/pids-and-identifiers#pid-authorities). +1. Records with identifiers from a [PID authority](/data-model/pids-and-identifiers#pid-authorities). +2. Records chosen as pivots in the graph's previous generations. 3. Publications from CrossRef or datasets from DataCite. 4. Records with an earlier date of acceptance. 5. Records with smaller IDs in lexicographical order.