master #11

Manually merged
claudio.atzori merged 275 commits from :master into enrichment_wfs 2020-05-11 15:14:56 +02:00

This PR includes:

1 - the propagation workflow
it is composed by
a. countryPropagation it propagates to results the country of the organization that provides the repository from which the result is collected. The propagation is done in the repository is of type "pubsrepo::institutional" or if the id of the repository is in the blacklist set. Changes the result content by adding information for the country
b. Author PID propagation: it passes the author pids (ORCID) to results linked by strong semantic relationships (isSupplementedBy, isSupplementTo) The orcid is passed if the result having the orcid and the one which receives it have the same surname for the author and the same name or the same initial for the name. The check is done on the lowercase of the name, surname. The pis is passed only if it was not already present. It changes the result content by adding information for the author PID
c. resul to project. Propagates the link to project to result linked with strong semantic relationships to other result already linked to the same project. For example: r1 -> isSupplementedBy -> r2 and r1 -> isProducedBy -> p than r2 -> isProducedBy ->p. The new relation is added only if not already present
d. result to community from organization. Propagates the context of communities to result coming from community related organizations. It changes the result by adding the community context
e. result to community from semantic relations. Propagates the context of communities to result linked with strong semantic relations with other results already linked to the community. It changed the result by adding the context for the community
f. result to organization from institutional repositories. It creates new relation linking the result collected from institutional repository to the organization providing the repository. It add new relations of type resultOrganization_affiliation_isAuthorInstitutionOf and the inverse
2 - the bulkTagging workflow
it changes the result by adding context related to communities. The context for the community can be added by subject match (if the result has at least one the subjects among thos selected by the community the community will be added to the context of the result), by zenodo communities (it the result belongs to at least one of the zenodo communities listed by the community the community will be added to the context of the result, by datasource (if the result is collected from one of the datasource listed by the community - and verifies the optional constraints expressed by the community- then the community will be added to the context of the result)
3 - the blacklist workflow
It removes all the relations that have been blacklisted

This PR includes: 1 - the propagation workflow it is composed by a. countryPropagation it propagates to results the country of the organization that provides the repository from which the result is collected. The propagation is done in the repository is of type "pubsrepo::institutional" or if the id of the repository is in the blacklist set. Changes the result content by adding information for the country b. Author PID propagation: it passes the author pids (ORCID) to results linked by strong semantic relationships (isSupplementedBy, isSupplementTo) The orcid is passed if the result having the orcid and the one which receives it have the same surname for the author and the same name or the same initial for the name. The check is done on the lowercase of the name, surname. The pis is passed only if it was not already present. It changes the result content by adding information for the author PID c. resul to project. Propagates the link to project to result linked with strong semantic relationships to other result already linked to the same project. For example: r1 -> isSupplementedBy -> r2 and r1 -> isProducedBy -> p than r2 -> isProducedBy ->p. The new relation is added only if not already present d. result to community from organization. Propagates the context of communities to result coming from community related organizations. It changes the result by adding the community context e. result to community from semantic relations. Propagates the context of communities to result linked with strong semantic relations with other results already linked to the community. It changed the result by adding the context for the community f. result to organization from institutional repositories. It creates new relation linking the result collected from institutional repository to the organization providing the repository. It add new relations of type resultOrganization_affiliation_isAuthorInstitutionOf and the inverse 2 - the bulkTagging workflow it changes the result by adding context related to communities. The context for the community can be added by subject match (if the result has at least one the subjects among thos selected by the community the community will be added to the context of the result), by zenodo communities (it the result belongs to at least one of the zenodo communities listed by the community the community will be added to the context of the result, by datasource (if the result is collected from one of the datasource listed by the community - and verifies the optional constraints expressed by the community- then the community will be added to the context of the result) 3 - the blacklist workflow It removes all the relations that have been blacklisted

Thank you Miriam, I'm going to integrate the PR in the enrichment_wfs branch and massage it a bit before integrating it in the master branch.

Thank you Miriam, I'm going to integrate the PR in the `enrichment_wfs` branch and massage it a bit before integrating it in the master branch.
claudio.atzori closed this pull request 2020-05-11 15:14:56 +02:00
Sign in to join this conversation.
No reviewers
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: D-Net/dnet-hadoop#11
No description provided.