parameter naming conversions across the provision workflows #29
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
RDGraph
RSAC
wontfix
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: D-Net/dnet-hadoop#29
Loading…
Reference in New Issue
No description provided.
Delete Branch "%!s(<nil>)"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Currently the different oozie workflows assembled as the OpenAIRE data provision pipeline doesn't follow precise conventions for naming the input/output parameters.
An overview of the parameters available through the D-Net provisionw workflow definition:
shows that some simple naming conventions could allveviate the burden when it comes to maintain the parameter coherency across the oozie workflows as well as to introduce new graph processing steps, when needed.
From the descriptions below, there are many parameters describing the same concepts across different workflow steps.
aggregatorGraph
promoteActions
duplicateScan
dedupConsistency
graphCleaning
orcidPropagation
bulkTagging
affiliationPropagation
communityOrganizationPropagation
resultProjectPropagation
communitySemrelPropagation
countryPropagation
blacklistRelations
In particular, the most recurring parameters describe
isLookUpUrl
vsisLookupUrl
.So, I propose to adopt the same naming conventions for the three parameters: