Enrich authors with ORCID info using new matching algorithm #398

Merged
giambattista.bloisi merged 2 commits from new_orcid_enhancement into beta 2024-03-22 17:29:20 +01:00

The new author enriching strategy for ORCID adopts a multi-pass strategy where different matching algorithms algorithms are applied in order of their expected reliability (from highest to lowest), so that next pass will work on remaining unmatched authors of its previous passes:

  1. exact fullname match, reconstruct ORCID fullname as givenName + familyName
  2. reversed fullname match, reconstruct ORCID fullname as familyName + givenName
  3. split author names in tokens, order the tokens, then check for matches of ordered full tokens or abbreviations
  4. exact matches of ORCID creditName
  5. exact matches in ORCID otherNames
The new author enriching strategy for ORCID adopts a multi-pass strategy where different matching algorithms algorithms are applied in order of their expected reliability (from highest to lowest), so that next pass will work on remaining unmatched authors of its previous passes: 1. exact fullname match, reconstruct ORCID fullname as givenName + familyName 2. reversed fullname match, reconstruct ORCID fullname as familyName + givenName 3. split author names in tokens, order the tokens, then check for matches of ordered full tokens or abbreviations 4. exact matches of ORCID creditName 5. exact matches in ORCID otherNames
giambattista.bloisi added 1 commit 2024-03-11 13:33:20 +01:00
giambattista.bloisi requested review from sandro.labruzzo 2024-03-11 13:33:39 +01:00
giambattista.bloisi requested review from claudio.atzori 2024-03-11 13:33:39 +01:00
giambattista.bloisi requested review from miriam.baglioni 2024-03-11 13:33:39 +01:00

To me it is OK

To me it is OK
giambattista.bloisi added 1 commit 2024-03-22 16:24:54 +01:00
giambattista.bloisi merged commit 3f22c101d9 into beta 2024-03-22 17:29:20 +01:00
Sign in to join this conversation.
No description provided.