H2020Classification fix and possibility to add datasources in blacklist for propagation of result to organization #108

Merged
claudio.atzori merged 12 commits from miriam.baglioni/dnet-hadoop:master into master 2021-05-20 15:25:05 +02:00

This PR fixes an issue while building the h2020classification action set. The name of two of the files changed as well as the name of the sheet to be red while getting topic information. THe sheet name is now a parameter and when missing an exception is fired.

This PR fixes an issue while building the h2020classification action set. The name of two of the files changed as well as the name of the sheet to be red while getting topic information. THe sheet name is now a parameter and when missing an exception is fired.
miriam.baglioni added 11 commits 2021-05-20 15:01:01 +02:00
claudio.atzori was assigned by miriam.baglioni 2021-05-20 15:01:25 +02:00
miriam.baglioni added the
enhancement
label 2021-05-20 15:01:35 +02:00
claudio.atzori added 1 commit 2021-05-20 15:22:37 +02:00
claudio.atzori merged commit 2e70aa43f0 into master 2021-05-20 15:25:05 +02:00

The changes look ok, but please drop a comment to describe how the parameters should be changed from the workflow caller for both workflows

  • H2020Classification
  • propagation of result to organization
The changes look ok, but please drop a comment to describe how the parameters should be changed from the workflow caller for both workflows * H2020Classification * propagation of result to organization
Author
Member

for the h2020classification the parameters are four:

The URL to download the project file
param name : projectFileURL
param value : https://cordis.europa.eu/data/cordis-h2020projects.csv

The URL to download the programme file
param name : programmeFileURL
param value : https://cordis.europa.eu/data/reference/cordisref-h2020programmes.csvt

The URL to download the topics file
param name : topicFileURL
param value : https://cordis.europa.eu/data/reference/cordisref-h2020topics.xlsx

The name of the sheet in the topic file to be read
param name. : sheetName
param value : Topics

for the propagation workflow, the added parameter is optional.
It allows to specify a set of institutional repository for which not to apply the propagation strategy

param name : blacklist
param value : the list of datasource id we want to add to the blacklist split by ;

for the h2020classification the parameters are four: The URL to download the project file param name : projectFileURL param value : https://cordis.europa.eu/data/cordis-h2020projects.csv The URL to download the programme file param name : programmeFileURL param value : https://cordis.europa.eu/data/reference/cordisref-h2020programmes.csvt The URL to download the topics file param name : topicFileURL param value : https://cordis.europa.eu/data/reference/cordisref-h2020topics.xlsx The name of the sheet in the topic file to be read param name. : sheetName param value : Topics for the propagation workflow, the added parameter is optional. It allows to specify a set of institutional repository for which not to apply the propagation strategy param name : blacklist param value : the list of datasource id we want to add to the blacklist split by ;
Sign in to join this conversation.
No reviewers
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: D-Net/dnet-hadoop#108
No description provided.