[graph cleaning] consider terms as synonyms in the vocabulary lookup #170
No reviewers
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
RDGraph
RSAC
wontfix
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: D-Net/dnet-hadoop#170
Loading…
Reference in New Issue
No description provided.
Delete Branch "graph_cleaning"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
The cleaning workflow currently distinguishes terms and synonyms in a neat manner. This results that values already indicating the term are not found as the lookup operation works on the set of synonyms, causing the output to be still uncleaned.
This PR extends the synonym set construction phase, includind also the term labels among them, so that the lookup operation can resolve them.