WIP: dump of the OpenAIRE graph - Changes #103
Closed
miriam.baglioni
wants to merge 77 commits from
miriam.baglioni/dnet-hadoop:dump
into master
pull from: miriam.baglioni/dnet-hadoop:dump
merge into: D-Net:master
D-Net:main
D-Net:beta
D-Net:main_bulktag
D-Net:Communities_patents
D-Net:actionset_promote_sequential
D-Net:propagateorcid
D-Net:COnnectSubCommunities
D-Net:datacite_dump_import
D-Net:person_through_the_graph_relazioniattributate
D-Net:crossref_mapping_improvement
D-Net:pubmed_fix
D-Net:dedup_new_comparators
D-Net:abtracts_guidelines4
D-Net:propagateorcid_publisher
D-Net:incremental_graph
D-Net:merge_resulttypes
D-Net:betaFixPerson
D-Net:k8s_orcidpropagation
D-Net:raw_affiliation_mapping
D-Net:kubernetes
D-Net:mergedids
D-Net:beta_provision_alignment_9.0.0
D-Net:bestinstancetype
D-Net:person_through_the_graph
D-Net:person_through_the_graph_newDevelopments
D-Net:affroNewModelonBeta
D-Net:affiliation-matching
D-Net:beta_fixes_oct
D-Net:dedup_local_test_implementation
D-Net:fix_decision_tree
D-Net:pid_cleaning
D-Net:affRoModelModificationOnmain
D-Net:scholix_small_index
D-Net:beta2main_sept2024
D-Net:osfPreprints_plugin
D-Net:fix_missing_project_rels
D-Net:enrich_more_subject_bug
D-Net:gtr2Publications_plugin
D-Net:merge_by_id_fix
D-Net:dedup_blacklist_fix
D-Net:claim-orgs
D-Net:9126-impact-indicators-wf-optimisation
D-Net:AffiliationFromPublisher
D-Net:pubmed_aggregation_improvements
D-Net:betaAffiliationFromPublisherPages
D-Net:oozie_spark_params
D-Net:GraphAnnotation
D-Net:rest-collector-request-header-map2
D-Net:dedup_countryInference_NPE
D-Net:webCrawlLessBlackList
D-Net:fos_l1l2
D-Net:sdgnodoi
D-Net:person
D-Net:tagOrganization
D-Net:entity_contexts
D-Net:broker_orcid
D-Net:metadata_collection_java_upgrade
D-Net:research_fi_collector_plugin
D-Net:openorgs_optimization
D-Net:checkContextOnDatasourceAndProject
D-Net:affRoFromRawStringmain
D-Net:openorgs_fixes
D-Net:master
D-Net:9872-create-solr-collection-aliases
D-Net:json_payload
D-Net:pivotselectionbypid
D-Net:import_openorg_type
D-Net:irish-oaipmh-exporter
D-Net:fix_mergedcliquesort
D-Net:spark34-integration
D-Net:beta_indexing_May2024
D-Net:mag_only_doi
D-Net:rest-collector-plugin-with-retry
D-Net:dependency-revision
D-Net:beta-release-1.2.5
D-Net:misc_fixes_merge_entities
D-Net:WebCrowlBeta
D-Net:WebCrowl
D-Net:doidoost_dismiss
D-Net:CrossrefFundersMap
D-Net:taggingProjects
D-Net:9647_datacite_affiliations
D-Net:UsageStatsRecordDS
D-Net:mergeutils
D-Net:oaf_country_beta
D-Net:index_records
D-Net:9559_DBLP_data
D-Net:base_stats_job_deprecated
D-Net:SWH_issue_377
D-Net:spark_join_param_tuning
D-Net:crossref_mapping_vocabularies
D-Net:oldPropagationOrganizationCommunity
D-Net:orcid_import
D-Net:dedup-with-dataframe-spark34
D-Net:ticket_8369
D-Net:tweaking_spark_params
D-Net:fc4e-rsac
D-Net:doiboost_authormerger
D-Net:beta_dedup_configuration
D-Net:scholix_flat_indexing
D-Net:scholix_data_type_openaire
D-Net:subjectPropagation
D-Net:enrichment
D-Net:validation
No reviewers
Labels
Clear labels
Something is not working
This issue or pull request already exists
New feature / refactoring
Need some help
Something is wrong
More information is needed
EOSC Research Discovery Graph
EOSC Research Software APIs and Connectors
This won't be fixed
bug
Something is not working
duplicate
This issue or pull request already exists
enhancement
New feature / refactoring
help wanted
Need some help
invalid
Something is wrong
question
More information is needed
RDGraph
EOSC Research Discovery Graph
RSAC
EOSC Research Software APIs and Connectors
wontfix
This won't be fixed
No Label
bug
duplicate
enhancement
help wanted
invalid
question
RDGraph
RSAC
wontfix
Milestone
Clear milestone
No items
No Milestone
Projects
Clear projects
No project
Assignees
Clear assignees
No Assignees
2 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.
No due date set.
Dependencies
No dependencies set.
Reference: D-Net/dnet-hadoop#103
Reference in New Issue
No description provided.
Delete Branch "miriam.baglioni/dnet-hadoop:dump"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
This PR adds:
fixes:
The PR also extends the dump with the possibility of dumping a single community content and upload it on Zenodo by its own.
It also adds in the dump only the communities with status "all". Those ith status "manager" are not dumped anymore.
Added also specification of dump WFs as subworkflows and the possibility to dump community results indipendently from the status
We shall re-issue the PR for updating the dump procedures as soon as the
dhp-schemas:2.7.17
will be released and made available on thebeta
branch.Pull request closed