[Affiliation Propagation] #162

Merged
claudio.atzori merged 12 commits from affiliationPropagation into beta 2021-12-14 15:28:24 +01:00

This PR extends the set of enrichment steps by exploiting the relation of type IsParentOf/IsChildOf between organization to create new affiliation relationships between results and organizations.

It works as follows:
Phase 1. Preparation step.
- It stores the couple organization, set of all its parents.
- It stores the couples result, set of known organization it is affiliated to
- It stores the set of leaves: organizations with no children
- It stoires the already existing set of relations between results and organizations

Phase 2. Execution Steps. The process is iterative. Each time it considers a given set of organizations from which to start to create new relationships. The first time it is executed, the starting set is the one of the leaves. It takes all the results associated to the leaves and tries to create affiliation relation with the leaves direct parents. If no relation already exists in the graph connecting the result and the organization a new relation is made. The new relations are used to extend the set of associations between results and organization. In this way a result connected in step i at the organization o can be propagated also to all the parents of o in subsequent steps.
All the direct parent of the set of leaves are selected and considered as the new set of leaves from which to start the propagation again. The process will stop once there is no more organization through which propagate or we have reached a maximum number of iteration (set to 5)

This PR extends the set of enrichment steps by exploiting the relation of type IsParentOf/IsChildOf between organization to create new affiliation relationships between results and organizations. It works as follows: Phase 1. Preparation step. - It stores the couple organization, set of all its parents. - It stores the couples result, set of known organization it is affiliated to - It stores the set of leaves: organizations with no children - It stoires the already existing set of relations between results and organizations Phase 2. Execution Steps. The process is iterative. Each time it considers a given set of organizations from which to start to create new relationships. The first time it is executed, the starting set is the one of the leaves. It takes all the results associated to the leaves and tries to create affiliation relation with the leaves direct parents. If no relation already exists in the graph connecting the result and the organization a new relation is made. The new relations are used to extend the set of associations between results and organization. In this way a result connected in step i at the organization o can be propagated also to all the parents of o in subsequent steps. All the direct parent of the set of leaves are selected and considered as the new set of leaves from which to start the propagation again. The process will stop once there is no more organization through which propagate or we have reached a maximum number of iteration (set to 5)
miriam.baglioni added the
enhancement
label 2021-11-16 15:23:41 +01:00
claudio.atzori was assigned by miriam.baglioni 2021-11-16 15:23:41 +01:00
miriam.baglioni added 8 commits 2021-11-16 15:23:41 +01:00
miriam.baglioni added 1 commit 2021-11-16 15:24:26 +01:00
miriam.baglioni added 1 commit 2021-11-16 16:36:06 +01:00
claudio.atzori added 1 commit 2021-11-18 18:06:28 +01:00
claudio.atzori added 1 commit 2021-12-14 15:26:58 +01:00
claudio.atzori merged commit e30e5ac8a8 into beta 2021-12-14 15:28:24 +01:00
Sign in to join this conversation.
No reviewers
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: D-Net/dnet-hadoop#162
No description provided.