Master branch updates from beta December 2023 #369

Merged
claudio.atzori merged 102 commits from beta_to_master_dicember2023 into master 2023-12-15 11:18:31 +01:00

This PR brings to the master branch the changes available from the beta branch at December 2023.

  • #350 COAR based resource types & Irish tender fields
  • #352 URL Validator to accept double slashes
  • #353 Add Pubmed affiliations (inferred by BIP) as actionsets
  • #355 StatsDB workflow to export actionsets about OA routes, diamond, and publicly-funded
  • #356 graph cleaning, suggestions from ticket 8898 - round 2
  • #359 [ENRICHMENT][BETA] Use of community API in enrichment process AND addition to tagging result for communities through projects
  • #363 Changes for tables and creation of the new indicator indi_is_result_accessible
  • #364 ORCID Enrichment and Download
  • #365, #368 [graph provision] added serialization for the new fields imported from the stats DB
  • #366 [graph cleaning] added cleaning for result.publisher and result.instance.license
This PR brings to the master branch the changes available from the beta branch at December 2023. * #350 COAR based resource types & Irish tender fields * #352 URL Validator to accept double slashes * #353 Add Pubmed affiliations (inferred by BIP) as actionsets * #355 StatsDB workflow to export actionsets about OA routes, diamond, and publicly-funded * #356 graph cleaning, suggestions from ticket 8898 - round 2 * #359 [ENRICHMENT][BETA] Use of community API in enrichment process AND addition to tagging result for communities through projects * #363 Changes for tables and creation of the new indicator `indi_is_result_accessible` * #364 ORCID Enrichment and Download * #365, #368 [graph provision] added serialization for the new fields imported from the stats DB * #366 [graph cleaning] added cleaning for `result.publisher` and `result.instance.license`
claudio.atzori added 101 commits 2023-12-15 10:15:26 +01:00
89c4dfbaf4 StatsDB workflow to export actionsets about OA routes, diamond, and publicly-funded
A new oozie workflow capable to read from the stats db to produce a new actionSet for updating results with:
- green_oa ={true, false}
- openAccesColor = {gold, hybrid, bronze}
- in_diamond_journal={true, false}
- publicly_funded={true, false}

Inputs:

- outputPath
- statsDB
d524e30866 Changes to actionsets
Resolve comments from
#355
a94a54a2d0 Changes for tables and creation of the new indicator indi_is_result_accessible
- Drop table statements for all tables to avoid duplicates in case of wf rerun
- Add pdfsaggregated step to create the indi_is_result_accessible table. This step is executed on the new impala cluster only, since the pdfaggregation_i is updated on this cluster.
359e81b7a6 Update StatsAtomicActionsJob.java
Bug fix for duplicate bronze checks
af1c2634b3 added instanceTypeMapping original field in the mapping of
- DOIBoost
- Datacite
- PubMed
- Scholexplorer Datasource
86b5775e08 added vocabulary in instanceTypeMapping for
- DOIBoost
- Datacite
- PubMed
- Scholexplorer Datasource
48430a32a6 Update StatsAtomicActionsJob.java
Added indi_funded_result_with_fundref indicator
bf0fd27c36 Removed unused function
Applied PR Comment of Giambattista in the PR
76594ded23 Changes to indicators
Fixes on open access colours indicators
- indi_pub_green_oa
- indi_pub_gold_oa
- indi_pub_hybrid
- indi_pub_bronze_oa
- indi_pub_diamond
a397112cb8 Add new indicator
Add indi_pub_publicly_funded
claudio.atzori added 1 commit 2023-12-15 11:17:27 +01:00
claudio.atzori merged commit c4ec35b6cd into master 2023-12-15 11:18:31 +01:00
Sign in to join this conversation.
No reviewers
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: D-Net/dnet-hadoop#369
No description provided.