forked from D-Net/openaire-graph-docs
download section: zenodo depositions referred as 'dataset' instead of 'dump'
This commit is contained in:
parent
b1344520fa
commit
e81f9e1b11
|
@ -4,10 +4,6 @@ sidebar_position: 2
|
||||||
|
|
||||||
# Beginner's kit
|
# Beginner's kit
|
||||||
|
|
||||||
<!-- :::caution
|
|
||||||
This version is not accompanied with public dump files, hence the files in this section are based on [v5.0.0](/docs/5.0.0/) of the Graph. The data of v.5.1.0 are only exposed via the [OpenAIRE Graph API](https://graph.openaire.eu/develop/) and added-value services that are built on top of this version of the Graph (e.g., the [OpenAIRE Explore](https://explore.openaire.eu/)). If you would be interested to get bulk access to Graph v5.1.0 data, please contact us via our [helpdesk](https://graph.openaire.eu/support).
|
|
||||||
::: -->
|
|
||||||
|
|
||||||
The large size of the OpenAIRE Graph is a major impediment for beginners to familiarise with the underlying data model and explore its contents.
|
The large size of the OpenAIRE Graph is a major impediment for beginners to familiarise with the underlying data model and explore its contents.
|
||||||
Working with the Graph in its full size typically requires access to a huge distributed computing infrastructure which cannot be easily accessible to everyone.
|
Working with the Graph in its full size typically requires access to a huge distributed computing infrastructure which cannot be easily accessible to everyone.
|
||||||
[The OpenAIRE Beginner’s Kit]( https://doi.org/10.5281/zenodo.7490192) aims to address this issue. It consists of two components:
|
[The OpenAIRE Beginner’s Kit]( https://doi.org/10.5281/zenodo.7490192) aims to address this issue. It consists of two components:
|
||||||
|
|
|
@ -2,30 +2,26 @@
|
||||||
sidebar_position: 1
|
sidebar_position: 1
|
||||||
---
|
---
|
||||||
|
|
||||||
# Full graph dump
|
# Full graph dataset
|
||||||
|
|
||||||
<!-- :::caution
|
You can download the full OpenAIRE Graph Dataset as well as its schema from the following links:
|
||||||
This version is not accompanied with public dump files, hence the files in this section are based on [v5.0.0](/docs/5.0.0/) of the Graph. The data of v.5.1.0 are only exposed via the [OpenAIRE Graph API](https://graph.openaire.eu/develop/) and added-value services that are built on top of this version of the Graph (e.g., the [OpenAIRE Explore](https://explore.openaire.eu/)). If you would be interested to get bulk access to Graph v5.1.0 data, please contact us via our [helpdesk](https://graph.openaire.eu/support).
|
|
||||||
::: -->
|
|
||||||
|
|
||||||
You can download the full OpenAIRE Graph Dump as well as its schema from the following links:
|
|
||||||
|
|
||||||
Dataset: https://doi.org/10.5281/zenodo.3516917
|
Dataset: https://doi.org/10.5281/zenodo.3516917
|
||||||
|
|
||||||
Schema: https://doi.org/10.5281/zenodo.4238938
|
Schema: https://doi.org/10.5281/zenodo.4238938
|
||||||
|
|
||||||
The schema used to dump this dataset mirrors the one described in the [Data Model](/data-model).
|
The schema used to create this dataset mirrors the one described in the [Data Model](/data-model).
|
||||||
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
||||||
It is composed of several files so that you can download the parts you are interested into. The files are named after the entity they store (i.e. publication, dataset). Each file is at most 10GB and it is
|
It is composed of several files so that you can download the parts you are interested into. The files are named after the entity they store (i.e. publication, dataset). Each file is at most 10GB and it is
|
||||||
a tar archive containing gz files, each with one json per line.
|
a tar archive containing gz files, each with one json per line.
|
||||||
|
|
||||||
## How to acknowledge this work
|
## How to acknowledge this work
|
||||||
|
|
||||||
Open Science services are open and transparent and survive thanks to your active support and to the visibility and reward they gather. If you use one of the [OpenAIRE Graph dumps](https://doi.org/10.5281/zenodo.3516917) for your research, please provide a proper citation following the recommendation that you find on the dump's Zenodo page or as provided below.
|
Open Science services are open and transparent and survive thanks to your active support and to the visibility and reward they gather. If you use one of the [OpenAIRE Graph datasets](https://doi.org/10.5281/zenodo.3516917) for your research, please provide a proper citation following the recommendation that you find on the dataset's Zenodo page or as provided below.
|
||||||
|
|
||||||
:::note How to cite
|
:::note How to cite
|
||||||
|
|
||||||
Manghi P., Atzori C., Bardi A., Baglioni M., Schirrwagen J., Dimitropoulos H., La Bruzzo S., Foufoulas I., Mannocci A., Horst M., Czerniak A., Iatropoulou K., Kokogiannaki A., De Bonis M., Artini M., Lempesis A., Ioannidis A., Manola N., Principe P., Vergoulis T., Chatzopoulos S., Pierrakos D. (2022). "OpenAIRE Research Graph Dump", *Dataset*, Zenodo. [doi:10.5281/zenodo.3516917](https://doi.org/10.5281/zenodo.3516917) ([BibTex](/bibtex/OpenAIRE_Research_Graph_dump.bib))
|
Manghi P., Atzori C., Bardi A., Baglioni M., Schirrwagen J., Dimitropoulos H., La Bruzzo S., Foufoulas I., Mannocci A., Horst M., Czerniak A., Iatropoulou K., Kokogiannaki A., De Bonis M., Artini M., Lempesis A., Ioannidis A., Manola N., Principe P., Vergoulis T., Chatzopoulos S., Pierrakos D. (2022). "OpenAIRE Research Graph Dataset", *Dataset*, Zenodo. [doi:10.5281/zenodo.3516917](https://doi.org/10.5281/zenodo.3516917) ([BibTex](/bibtex/OpenAIRE_Research_Graph_dataset.bib))
|
||||||
:::
|
:::
|
||||||
|
|
||||||
Please also consider citing [other relevant research products](/publications#relevant-research-products) that can be of interest.
|
Please also consider citing [other relevant research products](/publications#relevant-research-products) that can be of interest.
|
||||||
|
|
|
@ -6,7 +6,7 @@ sidebar_position: 4
|
||||||
|
|
||||||
In this page, we list other related datasets; please refer to their respective schema definitions for the data model they follow.
|
In this page, we list other related datasets; please refer to their respective schema definitions for the data model they follow.
|
||||||
|
|
||||||
## The dump of ScholeXplorer
|
## The dataset of ScholeXplorer
|
||||||
|
|
||||||
Dataset: https://doi.org/10.5281/zenodo.6338616
|
Dataset: https://doi.org/10.5281/zenodo.6338616
|
||||||
|
|
||||||
|
@ -15,11 +15,11 @@ In this page, we list other related datasets; please refer to their respective s
|
||||||
Schema (Scholix version 4): https://doi.org/10.5281/zenodo.6351557
|
Schema (Scholix version 4): https://doi.org/10.5281/zenodo.6351557
|
||||||
|
|
||||||
This dataset is licensed under a CC0 1.0 Universal (CC0 1.0) Public Domain Dedication.
|
This dataset is licensed under a CC0 1.0 Universal (CC0 1.0) Public Domain Dedication.
|
||||||
The dataset contains the GZ-compressed dump of the Scholix links exposed by the OpenAIRE ScholeXplorer service.
|
The dataset contains the GZ-compressed dataset of the Scholix links exposed by the OpenAIRE ScholeXplorer service.
|
||||||
|
|
||||||
## The OpenAIRE LOD dump
|
## The OpenAIRE LOD dataset
|
||||||
|
|
||||||
Dataset (RDF dump): https://doi.org/10.5281/zenodo.609943
|
Dataset (RDF): https://doi.org/10.5281/zenodo.609943
|
||||||
|
|
||||||
LOD Ontology: http://lod.openaire.eu/vocab
|
LOD Ontology: http://lod.openaire.eu/vocab
|
||||||
|
|
||||||
|
|
|
@ -2,71 +2,67 @@
|
||||||
sidebar_position: 3
|
sidebar_position: 3
|
||||||
---
|
---
|
||||||
|
|
||||||
# Sub-graph dumps
|
# Sub-graph datasets
|
||||||
|
|
||||||
<!-- :::caution
|
In order to facilitate users, different datasets are available under the Zenodo community called [OpenAIRE Graph](https://zenodo.org/communities/openaire-research-graph).
|
||||||
This version is not accompanied with public dump files, hence the files in this section are based on [v5.0.0](/docs/5.0.0/) of the Graph. More recent versions of the data are only exposed via the [OpenAIRE Graph API](https://graph.openaire.eu/develop/) and added-value services that are built on top of this version of the Graph (e.g., the [OpenAIRE Explore](https://explore.openaire.eu/)). If you would be interested to get bulk access to more recent versions of the Graph data, please contact us via our [helpdesk](https://graph.openaire.eu/support).
|
This page lists all alternative datasets currently available.
|
||||||
::: -->
|
|
||||||
|
|
||||||
In order to facilitate users, different dumps are available under the Zenodo community called [OpenAIRE Graph](https://zenodo.org/communities/openaire-research-graph).
|
|
||||||
This page lists all alternative dumps currently available.
|
|
||||||
|
|
||||||
|
|
||||||
## The OpenAIRE COVID-19 dump
|
## The OpenAIRE COVID-19 dataset
|
||||||
|
|
||||||
Dataset: https://doi.org/10.5281/zenodo.3980490
|
Dataset: https://doi.org/10.5281/zenodo.3980490
|
||||||
|
|
||||||
Schema: https://doi.org/10.5281/zenodo.3974225
|
Schema: https://doi.org/10.5281/zenodo.3974225
|
||||||
|
|
||||||
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
||||||
It contains metadata records of publications, research data, software and projects on the topic of Corona Virus and COVID-19.
|
It contains metadata records of publications, research data, software and projects on the topic of Corona Virus and COVID-19.
|
||||||
This dump is part of the activities of OpenAIRE to support the fight against COVID-19 together with the OpenAIRE COVID-19 Gateway.
|
This dataset is part of the activities of OpenAIRE to support the fight against COVID-19 together with the OpenAIRE COVID-19 Gateway.
|
||||||
The dump consists of a tar archive containing gzip files with one json per line. Please refer [here](#alternative-sub-graph-data-model) for details on the data model of this dump.
|
The dataset consists of a tar archive containing gzip files with one json per line. Please refer [here](#alternative-sub-graph-data-model) for details on the data model of this dataset.
|
||||||
|
|
||||||
## The dump of funded products
|
## The dataset of funded products
|
||||||
|
|
||||||
Dataset: https://doi.org/10.5281/zenodo.4559725
|
Dataset: https://doi.org/10.5281/zenodo.4559725
|
||||||
|
|
||||||
Schema: https://doi.org/10.5281/zenodo.3974225
|
Schema: https://doi.org/10.5281/zenodo.3974225
|
||||||
|
|
||||||
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
||||||
It contains metadata records of research products (research literature, data, software, other types of research products) with funding
|
It contains metadata records of research products (research literature, data, software, other types of research products) with funding
|
||||||
information available in the OpenAIRE Graph. Records are grouped by funder in a dedicated archive file. Each tar archive contains
|
information available in the OpenAIRE Graph. Records are grouped by funder in a dedicated archive file. Each tar archive contains
|
||||||
gzip files, each with one json record per line. The model of this dump differs from the one of the whole graph.
|
gzip files, each with one json record per line. The model of this dataset differs from the one of the whole graph.
|
||||||
Please refer [here](#alternative-sub-graph-data-model) for details on the data model of this dump.
|
Please refer [here](#alternative-sub-graph-data-model) for details on the data model of this dataset.
|
||||||
|
|
||||||
## The dump of delta projects
|
## The dataset of delta projects
|
||||||
|
|
||||||
Dataset: https://doi.org/10.5281/zenodo.6419021
|
Dataset: https://doi.org/10.5281/zenodo.6419021
|
||||||
|
|
||||||
Schema: https://doi.org/10.5281/zenodo.4238938
|
Schema: https://doi.org/10.5281/zenodo.4238938
|
||||||
|
|
||||||
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
||||||
It contains the metadata records of projects collected by OpenAIRE in a given time frame. Usually one deposition of collected projects is done for each release of the OpenAIRE Graph
|
It contains the metadata records of projects collected by OpenAIRE in a given time frame. Usually one deposition of collected projects is done for each release of the OpenAIRE Graph
|
||||||
The deposition is one tar archive containing gzip files, each with one json record per line.
|
The deposition is one tar archive containing gzip files, each with one json record per line.
|
||||||
|
|
||||||
## The dumps about research communities, initiatives and infrastructures
|
## The datasets about research communities, initiatives and infrastructures
|
||||||
|
|
||||||
Dataset: https://doi.org/10.5281/zenodo.3974604
|
Dataset: https://doi.org/10.5281/zenodo.3974604
|
||||||
|
|
||||||
Schema: https://doi.org/10.5281/zenodo.3974225
|
Schema: https://doi.org/10.5281/zenodo.3974225
|
||||||
|
|
||||||
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
||||||
The dataset contains one file per community/initiative/infrastructure collaborating with OpenAIRE. Check out also their community gateways on
|
The dataset contains one file per community/initiative/infrastructure collaborating with OpenAIRE. Check out also their community gateways on
|
||||||
CONNECT. Each file is a tar archive containing gzip files with one json per line. The only communities/research initiative/infrastructure we dump are those visible to everyone.
|
CONNECT. Each file is a tar archive containing gzip files with one json per line. The only communities/research initiative/infrastructure included are publicly visible ones.
|
||||||
The model of this dump differs from the one of the whole graph.
|
The model of this dataset differs from the one of the whole graph.
|
||||||
Please refer [here](#alternative-sub-graph-data-model) for details on the data model of this dump.
|
Please refer [here](#alternative-sub-graph-data-model) for details on the data model of this dataset.
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
## Alternative sub-graph data model
|
## Alternative sub-graph data model
|
||||||
|
|
||||||
It should be noted that the dumps for research communities, infrastructures, and products related to projects do not strictly follow the main data model of the OpenAIRE Graph. In particular, they differ in the following:
|
It should be noted that the datasets for research communities, infrastructures, and products related to projects do not strictly follow the main data model of the OpenAIRE Graph. In particular, they differ in the following:
|
||||||
|
|
||||||
* only research products are dumped (no relations, and entities different from results)
|
* only research products are included (no relations, and entities different from results)
|
||||||
* the dumped results are extended with information that can be inferred in the whole dump namely:
|
* the results are extended with information that can be inferred in the whole dataset namely:
|
||||||
* funding information if present
|
* funding information if present
|
||||||
* associated research community/infrastructure
|
* associated research community/infrastructure
|
||||||
* associated data sources
|
* associated data sources
|
||||||
|
|
||||||
So they have just one entity type, that is the [Extended Result](alternative-model/extendedresult.md).
|
So they have just one entity type, that is the [Extended Result](alternative-model/extendedresult.md).
|
||||||
|
|
|
@ -0,0 +1,35 @@
|
||||||
|
@dataset{manghi_paolo_2022_7488618,
|
||||||
|
author = {Manghi, Paolo and
|
||||||
|
Atzori, Claudio and
|
||||||
|
Bardi, Alessia and
|
||||||
|
Baglioni, Miriam and
|
||||||
|
Schirrwagen, Jochen and
|
||||||
|
Dimitropoulos, Harry and
|
||||||
|
La Bruzzo, Sandro and
|
||||||
|
Foufoulas, Ioannis and
|
||||||
|
Mannocci, Andrea and
|
||||||
|
Horst, Marek and
|
||||||
|
Czerniak, Andreas and
|
||||||
|
Iatropoulou, Katerina and
|
||||||
|
Kokogiannaki, Argiro and
|
||||||
|
De Bonis, Michele and
|
||||||
|
Artini, Michele and
|
||||||
|
Lempesis, Antonis and
|
||||||
|
Ioannidis, Alexandros and
|
||||||
|
Manola, Natalia and
|
||||||
|
Principe, Pedro and
|
||||||
|
Vergoulis, Thanasis and
|
||||||
|
Chatzopoulos, Serafeim and
|
||||||
|
Pierrakos, Dimitris},
|
||||||
|
title = {OpenAIRE Research Graph Dataset},
|
||||||
|
month = dec,
|
||||||
|
year = 2022,
|
||||||
|
note = {{A new version of this dataset is published every 6
|
||||||
|
months. The content available on the OpenAIRE
|
||||||
|
EXPLORE and CONNECT portals might be more up-to-
|
||||||
|
date with respect to the data you find here.}},
|
||||||
|
publisher = {Zenodo},
|
||||||
|
version = {5.0.0},
|
||||||
|
doi = {10.5281/zenodo.7488618},
|
||||||
|
url = {https://doi.org/10.5281/zenodo.7488618}
|
||||||
|
}
|
|
@ -4,10 +4,6 @@ sidebar_position: 2
|
||||||
|
|
||||||
# Beginner's kit
|
# Beginner's kit
|
||||||
|
|
||||||
<!-- :::caution
|
|
||||||
This version is not accompanied with public dump files, hence the files in this section are based on [v5.0.0](/docs/5.0.0/) of the Graph. The data of v.5.1.0 are only exposed via the [OpenAIRE Graph API](https://graph.openaire.eu/develop/) and added-value services that are built on top of this version of the Graph (e.g., the [OpenAIRE Explore](https://explore.openaire.eu/)). If you would be interested to get bulk access to Graph v5.1.0 data, please contact us via our [helpdesk](https://graph.openaire.eu/support).
|
|
||||||
::: -->
|
|
||||||
|
|
||||||
The large size of the OpenAIRE Graph is a major impediment for beginners to familiarise with the underlying data model and explore its contents.
|
The large size of the OpenAIRE Graph is a major impediment for beginners to familiarise with the underlying data model and explore its contents.
|
||||||
Working with the Graph in its full size typically requires access to a huge distributed computing infrastructure which cannot be easily accessible to everyone.
|
Working with the Graph in its full size typically requires access to a huge distributed computing infrastructure which cannot be easily accessible to everyone.
|
||||||
[The OpenAIRE Beginner’s Kit]( https://doi.org/10.5281/zenodo.7490192) aims to address this issue. It consists of two components:
|
[The OpenAIRE Beginner’s Kit]( https://doi.org/10.5281/zenodo.7490192) aims to address this issue. It consists of two components:
|
||||||
|
|
|
@ -2,30 +2,26 @@
|
||||||
sidebar_position: 1
|
sidebar_position: 1
|
||||||
---
|
---
|
||||||
|
|
||||||
# Full graph dump
|
# Full graph dataset
|
||||||
|
|
||||||
<!-- :::caution
|
You can download the full OpenAIRE Graph Dataset as well as its schema from the following links:
|
||||||
This version is not accompanied with public dump files, hence the files in this section are based on [v5.0.0](/docs/5.0.0/) of the Graph. The data of v.5.1.0 are only exposed via the [OpenAIRE Graph API](https://graph.openaire.eu/develop/) and added-value services that are built on top of this version of the Graph (e.g., the [OpenAIRE Explore](https://explore.openaire.eu/)). If you would be interested to get bulk access to Graph v5.1.0 data, please contact us via our [helpdesk](https://graph.openaire.eu/support).
|
|
||||||
::: -->
|
|
||||||
|
|
||||||
You can download the full OpenAIRE Graph Dump as well as its schema from the following links:
|
|
||||||
|
|
||||||
Dataset: https://doi.org/10.5281/zenodo.3516917
|
Dataset: https://doi.org/10.5281/zenodo.3516917
|
||||||
|
|
||||||
Schema: https://doi.org/10.5281/zenodo.4238938
|
Schema: https://doi.org/10.5281/zenodo.4238938
|
||||||
|
|
||||||
The schema used to dump this dataset mirrors the one described in the [Data Model](/data-model).
|
The schema used to create this dataset mirrors the one described in the [Data Model](/data-model).
|
||||||
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
||||||
It is composed of several files so that you can download the parts you are interested into. The files are named after the entity they store (i.e. publication, dataset). Each file is at most 10GB and it is
|
It is composed of several files so that you can download the parts you are interested into. The files are named after the entity they store (i.e. publication, dataset). Each file is at most 10GB and it is
|
||||||
a tar archive containing gz files, each with one json per line.
|
a tar archive containing gz files, each with one json per line.
|
||||||
|
|
||||||
## How to acknowledge this work
|
## How to acknowledge this work
|
||||||
|
|
||||||
Open Science services are open and transparent and survive thanks to your active support and to the visibility and reward they gather. If you use one of the [OpenAIRE Graph dumps](https://doi.org/10.5281/zenodo.3516917) for your research, please provide a proper citation following the recommendation that you find on the dump's Zenodo page or as provided below.
|
Open Science services are open and transparent and survive thanks to your active support and to the visibility and reward they gather. If you use one of the [OpenAIRE Graph datasets](https://doi.org/10.5281/zenodo.3516917) for your research, please provide a proper citation following the recommendation that you find on the dataset's Zenodo page or as provided below.
|
||||||
|
|
||||||
:::note How to cite
|
:::note How to cite
|
||||||
|
|
||||||
Manghi P., Atzori C., Bardi A., Baglioni M., Schirrwagen J., Dimitropoulos H., La Bruzzo S., Foufoulas I., Mannocci A., Horst M., Czerniak A., Iatropoulou K., Kokogiannaki A., De Bonis M., Artini M., Lempesis A., Ioannidis A., Manola N., Principe P., Vergoulis T., Chatzopoulos S., Pierrakos D. (2022). "OpenAIRE Research Graph Dump", *Dataset*, Zenodo. [doi:10.5281/zenodo.3516917](https://doi.org/10.5281/zenodo.3516917) ([BibTex](/bibtex/OpenAIRE_Research_Graph_dump.bib))
|
Manghi P., Atzori C., Bardi A., Baglioni M., Schirrwagen J., Dimitropoulos H., La Bruzzo S., Foufoulas I., Mannocci A., Horst M., Czerniak A., Iatropoulou K., Kokogiannaki A., De Bonis M., Artini M., Lempesis A., Ioannidis A., Manola N., Principe P., Vergoulis T., Chatzopoulos S., Pierrakos D. (2022). "OpenAIRE Research Graph Dataset", *Dataset*, Zenodo. [doi:10.5281/zenodo.3516917](https://doi.org/10.5281/zenodo.3516917) ([BibTex](/bibtex/OpenAIRE_Research_Graph_dataset.bib))
|
||||||
:::
|
:::
|
||||||
|
|
||||||
Please also consider citing [other relevant research products](/publications#relevant-research-products) that can be of interest.
|
Please also consider citing [other relevant research products](/publications#relevant-research-products) that can be of interest.
|
||||||
|
|
|
@ -6,7 +6,7 @@ sidebar_position: 4
|
||||||
|
|
||||||
In this page, we list other related datasets; please refer to their respective schema definitions for the data model they follow.
|
In this page, we list other related datasets; please refer to their respective schema definitions for the data model they follow.
|
||||||
|
|
||||||
## The dump of ScholeXplorer
|
## The dataset of ScholeXplorer
|
||||||
|
|
||||||
Dataset: https://doi.org/10.5281/zenodo.6338616
|
Dataset: https://doi.org/10.5281/zenodo.6338616
|
||||||
|
|
||||||
|
@ -15,11 +15,11 @@ In this page, we list other related datasets; please refer to their respective s
|
||||||
Schema (Scholix version 4): https://doi.org/10.5281/zenodo.6351557
|
Schema (Scholix version 4): https://doi.org/10.5281/zenodo.6351557
|
||||||
|
|
||||||
This dataset is licensed under a CC0 1.0 Universal (CC0 1.0) Public Domain Dedication.
|
This dataset is licensed under a CC0 1.0 Universal (CC0 1.0) Public Domain Dedication.
|
||||||
The dataset contains the GZ-compressed dump of the Scholix links exposed by the OpenAIRE ScholeXplorer service.
|
The dataset contains the GZ-compressed dataset of the Scholix links exposed by the OpenAIRE ScholeXplorer service.
|
||||||
|
|
||||||
## The OpenAIRE LOD dump
|
## The OpenAIRE LOD dataset
|
||||||
|
|
||||||
Dataset (RDF dump): https://doi.org/10.5281/zenodo.609943
|
Dataset (RDF): https://doi.org/10.5281/zenodo.609943
|
||||||
|
|
||||||
LOD Ontology: http://lod.openaire.eu/vocab
|
LOD Ontology: http://lod.openaire.eu/vocab
|
||||||
|
|
||||||
|
|
|
@ -2,71 +2,67 @@
|
||||||
sidebar_position: 3
|
sidebar_position: 3
|
||||||
---
|
---
|
||||||
|
|
||||||
# Sub-graph dumps
|
# Sub-graph datasets
|
||||||
|
|
||||||
<!-- :::caution
|
In order to facilitate users, different datasets are available under the Zenodo community called [OpenAIRE Graph](https://zenodo.org/communities/openaire-research-graph).
|
||||||
This version is not accompanied with public dump files, hence the files in this section are based on [v5.0.0](/docs/5.0.0/) of the Graph. More recent versions of the data are only exposed via the [OpenAIRE Graph API](https://graph.openaire.eu/develop/) and added-value services that are built on top of this version of the Graph (e.g., the [OpenAIRE Explore](https://explore.openaire.eu/)). If you would be interested to get bulk access to more recent versions of the Graph data, please contact us via our [helpdesk](https://graph.openaire.eu/support).
|
This page lists all alternative datasets currently available.
|
||||||
::: -->
|
|
||||||
|
|
||||||
In order to facilitate users, different dumps are available under the Zenodo community called [OpenAIRE Graph](https://zenodo.org/communities/openaire-research-graph).
|
|
||||||
This page lists all alternative dumps currently available.
|
|
||||||
|
|
||||||
|
|
||||||
## The OpenAIRE COVID-19 dump
|
## The OpenAIRE COVID-19 dataset
|
||||||
|
|
||||||
Dataset: https://doi.org/10.5281/zenodo.3980490
|
Dataset: https://doi.org/10.5281/zenodo.3980490
|
||||||
|
|
||||||
Schema: https://doi.org/10.5281/zenodo.3974225
|
Schema: https://doi.org/10.5281/zenodo.3974225
|
||||||
|
|
||||||
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
||||||
It contains metadata records of publications, research data, software and projects on the topic of Corona Virus and COVID-19.
|
It contains metadata records of publications, research data, software and projects on the topic of Corona Virus and COVID-19.
|
||||||
This dump is part of the activities of OpenAIRE to support the fight against COVID-19 together with the OpenAIRE COVID-19 Gateway.
|
This dataset is part of the activities of OpenAIRE to support the fight against COVID-19 together with the OpenAIRE COVID-19 Gateway.
|
||||||
The dump consists of a tar archive containing gzip files with one json per line. Please refer [here](#alternative-sub-graph-data-model) for details on the data model of this dump.
|
The dataset consists of a tar archive containing gzip files with one json per line. Please refer [here](#alternative-sub-graph-data-model) for details on the data model of this dataset.
|
||||||
|
|
||||||
## The dump of funded products
|
## The dataset of funded products
|
||||||
|
|
||||||
Dataset: https://doi.org/10.5281/zenodo.4559725
|
Dataset: https://doi.org/10.5281/zenodo.4559725
|
||||||
|
|
||||||
Schema: https://doi.org/10.5281/zenodo.3974225
|
Schema: https://doi.org/10.5281/zenodo.3974225
|
||||||
|
|
||||||
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
||||||
It contains metadata records of research products (research literature, data, software, other types of research products) with funding
|
It contains metadata records of research products (research literature, data, software, other types of research products) with funding
|
||||||
information available in the OpenAIRE Graph. Records are grouped by funder in a dedicated archive file. Each tar archive contains
|
information available in the OpenAIRE Graph. Records are grouped by funder in a dedicated archive file. Each tar archive contains
|
||||||
gzip files, each with one json record per line. The model of this dump differs from the one of the whole graph.
|
gzip files, each with one json record per line. The model of this dataset differs from the one of the whole graph.
|
||||||
Please refer [here](#alternative-sub-graph-data-model) for details on the data model of this dump.
|
Please refer [here](#alternative-sub-graph-data-model) for details on the data model of this dataset.
|
||||||
|
|
||||||
## The dump of delta projects
|
## The dataset of delta projects
|
||||||
|
|
||||||
Dataset: https://doi.org/10.5281/zenodo.6419021
|
Dataset: https://doi.org/10.5281/zenodo.6419021
|
||||||
|
|
||||||
Schema: https://doi.org/10.5281/zenodo.4238938
|
Schema: https://doi.org/10.5281/zenodo.4238938
|
||||||
|
|
||||||
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
||||||
It contains the metadata records of projects collected by OpenAIRE in a given time frame. Usually one deposition of collected projects is done for each release of the OpenAIRE Graph
|
It contains the metadata records of projects collected by OpenAIRE in a given time frame. Usually one deposition of collected projects is done for each release of the OpenAIRE Graph
|
||||||
The deposition is one tar archive containing gzip files, each with one json record per line.
|
The deposition is one tar archive containing gzip files, each with one json record per line.
|
||||||
|
|
||||||
## The dumps about research communities, initiatives and infrastructures
|
## The datasets about research communities, initiatives and infrastructures
|
||||||
|
|
||||||
Dataset: https://doi.org/10.5281/zenodo.3974604
|
Dataset: https://doi.org/10.5281/zenodo.3974604
|
||||||
|
|
||||||
Schema: https://doi.org/10.5281/zenodo.3974225
|
Schema: https://doi.org/10.5281/zenodo.3974225
|
||||||
|
|
||||||
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
|
||||||
The dataset contains one file per community/initiative/infrastructure collaborating with OpenAIRE. Check out also their community gateways on
|
The dataset contains one file per community/initiative/infrastructure collaborating with OpenAIRE. Check out also their community gateways on
|
||||||
CONNECT. Each file is a tar archive containing gzip files with one json per line. The only communities/research initiative/infrastructure we dump are those visible to everyone.
|
CONNECT. Each file is a tar archive containing gzip files with one json per line. The only communities/research initiative/infrastructure included are publicly visible ones.
|
||||||
The model of this dump differs from the one of the whole graph.
|
The model of this dataset differs from the one of the whole graph.
|
||||||
Please refer [here](#alternative-sub-graph-data-model) for details on the data model of this dump.
|
Please refer [here](#alternative-sub-graph-data-model) for details on the data model of this dataset.
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
## Alternative sub-graph data model
|
## Alternative sub-graph data model
|
||||||
|
|
||||||
It should be noted that the dumps for research communities, infrastructures, and products related to projects do not strictly follow the main data model of the OpenAIRE Graph. In particular, they differ in the following:
|
It should be noted that the datasets for research communities, infrastructures, and products related to projects do not strictly follow the main data model of the OpenAIRE Graph. In particular, they differ in the following:
|
||||||
|
|
||||||
* only research products are dumped (no relations, and entities different from results)
|
* only research products are included (no relations, and entities different from results)
|
||||||
* the dumped results are extended with information that can be inferred in the whole dump namely:
|
* the results are extended with information that can be inferred in the whole dataset namely:
|
||||||
* funding information if present
|
* funding information if present
|
||||||
* associated research community/infrastructure
|
* associated research community/infrastructure
|
||||||
* associated data sources
|
* associated data sources
|
||||||
|
|
||||||
So they have just one entity type, that is the [Extended Result](alternative-model/extendedresult.md).
|
So they have just one entity type, that is the [Extended Result](alternative-model/extendedresult.md).
|
||||||
|
|
Loading…
Reference in New Issue