Patch the identifiers (source/target) in the relations
#125
Merged
claudio.atzori
merged 3 commits from fct_project_id_replacement
into master
3 years ago
Loading…
Reference in New Issue
There is no content yet.
Delete Branch 'fct_project_id_replacement'
Deleting a branch is permanent. It CANNOT be undone. Continue?
This PR includes an extension of the oozie workflow responsible for the creation of the
raw
graph that optionally allows to run a phase where the identifiers included in the graph relations (source
andtarget
) can be updated with the values provided by a correspondence map (oldId -> newId
).In order to activate this phase, when running the
raw_all
workflow ensure toshouldPatchRelations = true
idMappingPath
). Note that the oozie workflow also checkes that the path exists@ -0,0 +81,4 @@
rels
.joinWith(idMapping, rels.col("source").equalTo(idMapping.col("oldId")), "full")
.filter((FilterFunction<Tuple2<Relation, RelationIdMapping>>) t -> Objects.nonNull(t._1()))
I think you could replace the filter step with a left join: you get all the original relations that is what you do with the full and the subsequent filter
f83dd70e1c
into master 3 years agoPR integrated also in the
beta
branch withe87e1805c4
One further fix added
patching relation identifier phase to be run at the end, i.e. includes also claimed relations
beta
commit5d08ad86ae
master
commite725c88ebb
Reviewers
f83dd70e1c
.Step 1:
From your project repository, check out a new branch and test the changes.Step 2:
Merge the changes and update on Gitea.