Cleaning of PIDs #490
No reviewers
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
RDGraph
RSAC
wontfix
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: D-Net/dnet-hadoop#490
Loading…
Reference in New Issue
No description provided.
Delete Branch "pid_cleaning"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
This PR makes the utility class
eu.dnetlib.dhp.schema.oaf.utils.PidCleaner
used across the pipeline. Replacing the usages of the utility methods that used to be defined withineu.dnetlib.dhp.schema.oaf.utils.CleaningFunctions
.It also adjusts the tests in
dhp-common
and indhp-graph-mapper
.Furthermore, it restores the removal of duplicated PIDs, already implemented which silently stopped working when the internal model classes did not expose the
hashCode
and theequals
methods.Merged also in the
beta
branch withd5867a1992
PidCleaner used pervasivelyto Cleaning of PIDs