Cleaning of PIDs #490

Merged
claudio.atzori merged 1 commits from pid_cleaning into main 2024-10-08 15:08:13 +02:00

This PR makes the utility class eu.dnetlib.dhp.schema.oaf.utils.PidCleaner used across the pipeline. Replacing the usages of the utility methods that used to be defined within eu.dnetlib.dhp.schema.oaf.utils.CleaningFunctions.
It also adjusts the tests in dhp-common and in dhp-graph-mapper.

Furthermore, it restores the removal of duplicated PIDs, already implemented which silently stopped working when the internal model classes did not expose the hashCode and the equals methods.

This PR makes the utility class `eu.dnetlib.dhp.schema.oaf.utils.PidCleaner` used across the pipeline. Replacing the usages of the utility methods that used to be defined within `eu.dnetlib.dhp.schema.oaf.utils.CleaningFunctions`. It also adjusts the tests in `dhp-common` and in `dhp-graph-mapper`. Furthermore, it restores the removal of duplicated PIDs, already implemented which silently stopped working when the internal model classes did not expose the `hashCode` and the `equals` methods.
claudio.atzori added 1 commit 2024-10-08 15:06:21 +02:00
claudio.atzori merged commit a50e04154e into main 2024-10-08 15:08:13 +02:00
claudio.atzori referenced this issue from a commit 2024-10-08 15:40:04 +02:00
Author
Owner

Merged also in the beta branch with d5867a1992

Merged also in the `beta` branch with d5867a19920d8acd232b263b5d4bb9a6a4d87de0
claudio.atzori changed title from PidCleaner used pervasively to Cleaning of PIDs 2024-10-08 15:41:35 +02:00
Sign in to join this conversation.
No reviewers
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: D-Net/dnet-hadoop#490
No description provided.