graph cleaning, suggestions from ticket 8898 #325

Merged
miriam.baglioni merged 2 commits from cleaning_8898 into beta 2023-08-08 11:14:20 +02:00

This PR proposes a partial implementation of the suggestions from https://support.openaire.eu/issues/8898.

In particular

  • for Author names and the publisher name: get rid of tab, CR characters, \n(s), escape double quotes;
  • The fulltext field should be present only for publications and ORPs, when available for Dataset and Software it gets removed;
  • Rule out projects without a grant code.
This PR proposes a partial implementation of the suggestions from https://support.openaire.eu/issues/8898. In particular * for Author names and the publisher name: get rid of tab, CR characters, \n(s), escape double quotes; * The fulltext field should be present only for publications and ORPs, when available for Dataset and Software it gets removed; * Rule out projects without a grant code.
miriam.baglioni was assigned by claudio.atzori 2023-07-25 17:33:56 +02:00
claudio.atzori added 1 commit 2023-07-25 17:33:57 +02:00
claudio.atzori added 1 commit 2023-07-25 17:41:05 +02:00
miriam.baglioni merged commit c25ac21e5e into beta 2023-08-08 11:14:20 +02:00
claudio.atzori requested review from giambattista.bloisi 2023-10-31 14:58:23 +01:00
claudio.atzori removed review request for giambattista.bloisi 2023-10-31 14:58:36 +01:00
Sign in to join this conversation.
No reviewers
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: D-Net/dnet-hadoop#325
No description provided.