openaire-graph-docs/docs/downloads/beginners-kit.md

16 lines
1.5 KiB
Markdown
Raw Permalink Normal View History

2022-12-06 17:43:54 +01:00
---
sidebar_position: 2
---
2022-12-30 15:40:16 +01:00
# Beginner's kit
2022-12-06 17:43:54 +01:00
2023-02-23 12:50:36 +01:00
The large size of the OpenAIRE Graph is a major impediment for beginners to familiarise with the underlying data model and explore its contents.
Working with the Graph in its full size typically requires access to a huge distributed computing infrastructure which cannot be easily accessible to everyone.
[The OpenAIRE Beginners Kit](https://doi.org/10.5281/zenodo.7490191) aims to address this issue. It consists of two components:
2022-12-30 15:40:16 +01:00
2024-01-17 10:36:15 +01:00
<!-- :::caution
This version is not accompanied with public dataset files, hence the files in this section are based on [v6.0.0](/docs/6.0.0/) of the Graph. The current data are only exposed via the [OpenAIRE Graph API](https://graph.openaire.eu/develop/) and added-value services that are built on top of this version of the Graph (e.g., the [OpenAIRE Explore](https://explore.openaire.eu/)). If you are interested to get bulk access to our latest data, please contact us via our [helpdesk](https://graph.openaire.eu/support).
::: -->
2022-12-30 15:40:16 +01:00
* A subset of the Graph composed of the research products published between 2022-06-29 and 2022-12-29, all the entities connected to them and the respective relationships.
2023-02-07 11:50:53 +01:00
* A Zeppelin notebook that demonstrates how you can use PySpark to analyse the Graph and get answers to some interesting research questions. A guide to Apache Zeppelin can be found [here](https://docs.cloudera.com/HDPDocuments/HDP2/HDP-2.6.5/bk_zeppelin-component-guide/content/ch_overview.html).