blacklist filtering moved before the cleanup phase in order to have case sensitive regex #485

Merged
claudio.atzori merged 1 commits from dedup_blacklist_fix into beta 2024-10-28 09:42:52 +01:00

This PR fixes an error in the blacklist application stage when deduplication is performed. The regex in the blacklist are now applied before the cleaning (lower casing, normalization, etc.) of the fields allowing to take advantage of its case sensitiveness.

This PR fixes an error in the blacklist application stage when deduplication is performed. The regex in the blacklist are now applied before the cleaning (lower casing, normalization, etc.) of the fields allowing to take advantage of its case sensitiveness.
michele.debonis added 1 commit 2024-09-25 08:59:01 +02:00
claudio.atzori merged commit 67e37f41fb into beta 2024-10-28 09:42:52 +01:00
Sign in to join this conversation.
No reviewers
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: D-Net/dnet-hadoop#485
No description provided.