Split bulk downloads page in sub-pages

This commit is contained in:
Serafeim Chatzopoulos 2022-11-28 14:19:40 +02:00
parent 2f3e832d4d
commit 989d9ea34c
4 changed files with 63 additions and 27 deletions

View File

@ -0,0 +1,41 @@
---
sidebar_position: 4
---
# Bulk downloads
You can download the full OpenAIRE Research Dump as well as its schema from the following links:
Dataset: https://doi.org/10.5281/zenodo.3516917
Schema: https://doi.org/10.5281/zenodo.4238938
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
It is composed of several files so that you can download the parts you are interested into. The files are named after the entity they store (i.e. publication, dataset). Each file is at most 10GB and it is
a tar archive containing gz files, each with one json per line.
:::note Tip!
For a visual and interactive overview of the JSON schema, we suggest to use a JSON schema viewer like [jsonschemaviewer](https://navneethg.github.io/jsonschemaviewer/) (you just need to copy the schema and then you can easily navigate through the nodes).
:::
<article class="margin-top--lg">
<section class="row list_node_modules-@docusaurus-theme-classic-lib-theme-DocCategoryGeneratedIndexPage-styles-module">
<article class="col col--6 margin-bottom--lg">
<a class="card padding--lg cardContainer_node_modules-@docusaurus-theme-classic-lib-theme-DocCard-styles-module" href="/downloads/starter-kit">
<h2 class="text--truncate cardTitle_node_modules-@docusaurus-theme-classic-lib-theme-DocCard-styles-module" title="Starter-kit">📄️ Starter-kit</h2>
<p class="text--truncate cardDescription_node_modules-@docusaurus-theme-classic-lib-theme-DocCard-styles-module" title=""></p>
</a>
</article>
<article class="col col--6 margin-bottom--lg">
<a class="card padding--lg cardContainer_node_modules-@docusaurus-theme-classic-lib-theme-DocCard-styles-module" href="/downloads/subgraphs">
<h2 class="text--truncate cardTitle_node_modules-@docusaurus-theme-classic-lib-theme-DocCard-styles-module" title="Sub-graphs and other formats">📄️ Sub-graphs and other formats</h2>
<p class="text--truncate cardDescription_node_modules-@docusaurus-theme-classic-lib-theme-DocCard-styles-module" title="In order to facilitate users, different dumps and formats are available under the Zenodo community called OpenAIRE Research Graph. In the following, you can find the list of alternative Dumps currently available:">In order to facilitate users, different dumps and formats are available under the Zenodo community called OpenAIRE Research Graph. In the following, you can find the list of alternative Dumps currently available:
</p>
</a>
</article>
</section>
</article>

View File

@ -0,0 +1,6 @@
---
sidebar_position: 1
---
# Starter-kit

View File

@ -2,22 +2,12 @@
sidebar_position: 4
---
# Bulk downloads
# Sub-graphs and other formats
In order to facilitate users, different dumps are available under the Zenodo community called [OpenAIRE Research Graph](https://zenodo.org/communities/openaire-research-graph).
In the following the list of Dumps available:
In order to facilitate users, different dumps and formats are available under the Zenodo community called [OpenAIRE Research Graph](https://zenodo.org/communities/openaire-research-graph).
In the following, you can find the list of alternative Dumps currently available:
* <strong>The whole OpenAIRE Research Graph Dump</strong>
Dataset: https://doi.org/10.5281/zenodo.3516917
Schema: https://doi.org/10.5281/zenodo.4238938
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
It is composed of several files so that you can download the parts you are interested into. The files are named after the entity they store (i.e. publication, dataset). Each file is at most 10GB and it is
a tar archive containing gz files, each with one json per line.
* <strong>The OpenAIRE COVID-19 dump</strong>
## The OpenAIRE COVID-19 dump
Dataset: https://doi.org/10.5281/zenodo.6638745
@ -28,7 +18,7 @@ a tar archive containing gz files, each with one json per line.
This dump is part of the activities of OpenAIRE to support the fight against COVID-19 together with the OpenAIRE COVID-19 Gateway.
The dump consists of a tar archive containing gzip files with one json per line.
* <strong>The dump of funded products</strong>
## The dump of funded products
Dataset: https://doi.org/10.5281/zenodo.6634431
@ -39,7 +29,7 @@ It contains metadata records of research products (research literature, data, so
information available in the OpenAIRE Graph. Records are grouped by funder in a dedicated archive file. Each tar archive contains
gzip files, each with one json record per line.
* <strong>The dump of delta projects</strong>
## The dump of delta projects
Dataset: https://doi.org/10.5281/zenodo.7119633
@ -49,7 +39,7 @@ gzip files, each with one json record per line.
It contains the metadata records of projects collected by OpenAIRE in a given time frame. Usually one deposition of collected projects is done for each release of the OpenAIRE Graph
The deposition is one tar archive containing gzip files, each with one json record per line.
* <strong>The dumps about research communities, initiatives and infrastructures</strong>
## The dumps about research communities, initiatives and infrastructures
Dataset: https://doi.org/10.5281/zenodo.6638478
@ -59,7 +49,7 @@ gzip files, each with one json record per line.
The dataset contains one file per community/initiative/infrastructure collaborating with OpenAIRE. Check out also their community gateways on
CONNECT. Each file is a tar archive containing gzip files with one json per line. The only communities/research initiative/infrastructure we dump are those visible to everyone.
* <strong>The dump of ScholeXplorer</strong>
## The dump of ScholeXplorer
Dataset: https://doi.org/10.5281/zenodo.6338616
@ -70,9 +60,3 @@ The dataset contains one file per community/initiative/infrastructure collaborat
This dataset is licensed under a CC0 1.0 Universal (CC0 1.0) Public Domain Dedication.
The dataset contains the GZ-compressed dump of the Scholix links exposed by the OpenAIRE ScholeXplorer service.
:::note Tip!
For a visual and interactive overview of the JSON schema, we suggest to use a JSON schema viewer like [jsonschemaviewer](https://navneethg.github.io/jsonschemaviewer/) (you just need to copy the schema and then you can easily navigate through the nodes).
:::

View File

@ -51,9 +51,14 @@ const sidebars = {
href: "https://graph.openaire.eu/develop/overview.html"
},
{
type: 'doc',
id: 'download'
},
type: 'category',
label: 'Bulk downloads',
link: {type: 'doc', id: 'downloads/downloads'},
items: [
{ type: 'doc', id: 'downloads/starter-kit' },
{ type: 'doc', id: 'downloads/subgraphs' },
]
},
{
type: 'category',
label: "Data provision",