implementation of the whitelist for similarity relations
#144
Merged
claudio.atzori
merged 3 commits from dedup_whitelist
into beta
3 years ago
Loading…
Reference in New Issue
There is no content yet.
Delete Branch 'dedup_whitelist'
Deleting a branch is permanent. It CANNOT be undone. Continue?
Implementation of a new Job for the Scan WF (de-duplication).
The job takes the whitelist file path to add whitelisted similarity relations to the relations calculated by the dedup algorithm.
File format: source_id####target_id (1 per line)
35619b93ee
into beta 3 years agoNote for updating the dnet workflow: the only parameter we need to introduce is the
whiteListPath
pointing to the HDFS location of the whitelist file.35619b93ee
.Step 1:
From your project repository, check out a new branch and test the changes.Step 2:
Merge the changes and update on Gitea.