- Home
- API & Resources
- Bulk Access
- OpenAIRE Research Graph Dumps
OpenAIRE Research Graph Dumps
In order to facilitate users, different dumps are available. All are available under the Zenodo community called OpenAIRE Research Graph.
- The whole OpenAIRE Research Graph Dump
Dataset:
Schema:
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
It is composed of several files so that you can download the parts you are interested into. Each file is a tar archive containing gz files, each with one json per line. - The OpenAIRE COVID-19 dump
Dataset:
Schema:
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
It contains metadata records of publications, research data, software and projects on the topic of Corona Virus and COVID-19. This dump is part of the activities of OpenAIRE to support the fight against COVID-19 together with the OpenAIRE COVID-19 Gateway. The dump consists of a tar archive containing gzip files with one json per line. - The dump of funded products
Dataset:
Schema:
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
It contains metadata records of research products (research literature, data, software, other types of research products) with funding information available in the OpenAIRE Research Graph Records are grouped by funder in a dedicated archive file. Each tar archive contains gzip files, each with one json record per line. -
The dumps about research communities, initiatives and infrastructures
Dataset:
Schema:
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
The dataset contains one file per community/initiative/infrastructure collaborating with OpenAIRE. Check out also their community gateways on CONNECT. Each file is a tar archive containing gzip files with one json per line. - The dump of ScholeXplorer
Dataset:
Schema (Scholix version 3):
This dataset is licensed under a CC0 1.0 Universal (CC0 1.0) Public Domain Dedication.
The dataset contains the GZ-compressed dump of the Scholix links exposed by the OpenAIRE ScholeXplorer service. - The dump of DOIBoost
Dataset:
Publication:
Software:
This dataset is licensed under a Creative Commons Attribution 4.0 International License.
DOIBoost is a metadata collection that enriches CrossRef with inputs from Microsoft Academic Graph, ORCID, and Unpaywall.
Cite us
If you use any of the dumps above for research purposes, please cite it following the reccomendation that you find on the Zenodo page.
The OpenAIRE Research Graph and DOIBoost include data from Microsoft Academic Graph (MAG): please acknowledge also MAG following this guideline.
Still using the old XML dumps?
Please migrate to the new json dumps. Meanwhile, you can still access the old documentation here.