From d19f9e4342f7945a664f76d68ab917262c49c42f Mon Sep 17 00:00:00 2001 From: Claudio Atzori Date: Tue, 7 Feb 2023 11:58:00 +0100 Subject: [PATCH] Added link to a Zeppelin guide --- versioned_docs/version-5.1.0/downloads/beginners-kit.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/versioned_docs/version-5.1.0/downloads/beginners-kit.md b/versioned_docs/version-5.1.0/downloads/beginners-kit.md index 9555ee4..b6af1df 100644 --- a/versioned_docs/version-5.1.0/downloads/beginners-kit.md +++ b/versioned_docs/version-5.1.0/downloads/beginners-kit.md @@ -4,9 +4,9 @@ sidebar_position: 2 # Beginner's kit -The large size of the OpenAIRE Research Graph is a major impediment for beginners to familiarise with the underlying data model and explore its contents. +The large size of the OpenAIRE Research Graph is a major impediment for beginners to familiarise with the underlying data model and explore its contents. Working with the Graph in its full size typically requires access to a huge distributed computing infrastructure which cannot be easily accessible to everyone. [The OpenAIRE Beginner’s Kit]( https://doi.org/10.5281/zenodo.7490192) aims to address this issue. It consists of two components: * A subset of the Graph composed of the research products published between 2022-06-29 and 2022-12-29, all the entities connected to them and the respective relationships. -* A Zeppelin notebook that demonstrates how you can use PySpark to analyse the Graph and get answers to some interesting research questions. \ No newline at end of file +* A Zeppelin notebook that demonstrates how you can use PySpark to analyse the Graph and get answers to some interesting research questions. A guide to Apache Zeppelin can be found [here](https://docs.cloudera.com/HDPDocuments/HDP2/HDP-2.6.5/bk_zeppelin-component-guide/content/ch_overview.html). \ No newline at end of file