From 6e56aa1a4dcb9c715a4200503f1491b3f22f6916 Mon Sep 17 00:00:00 2001 From: Serafeim Chatzopoulos Date: Wed, 21 Dec 2022 21:44:50 +0200 Subject: [PATCH 1/5] Add text to compatible sources - aggregation --- docs/data-provision/aggregation/compatible-sources.md | 8 +++++++- 1 file changed, 7 insertions(+), 1 deletion(-) diff --git a/docs/data-provision/aggregation/compatible-sources.md b/docs/data-provision/aggregation/compatible-sources.md index 3d6ac0b..48d831e 100644 --- a/docs/data-provision/aggregation/compatible-sources.md +++ b/docs/data-provision/aggregation/compatible-sources.md @@ -2,4 +2,10 @@ sidebar_position: 1 --- -# OpenAIRE compatible sources \ No newline at end of file +# OpenAIRE compatible sources + +The OpenAIRE aggregator collects metadata records from content providers compliant to the OpenAIRE guidelines. + +The OpenAIRE Guidelines help repository managers expose publications, datasets and CRIS metadata via the OAI-PMH protocol in order to integrate with OpenAIRE infrastructure. + +You can find more information in https://guidelines.openaire.eu/en/latest/ \ No newline at end of file From 099a500e88ec3b6911b394cc849d059c053738e1 Mon Sep 17 00:00:00 2001 From: Claudio Atzori Date: Thu, 22 Dec 2022 16:21:00 +0100 Subject: [PATCH 2/5] added merge by id description --- docs/data-provision/merge-by-id.md | 27 ++++++++++++++++++++++++++- 1 file changed, 26 insertions(+), 1 deletion(-) diff --git a/docs/data-provision/merge-by-id.md b/docs/data-provision/merge-by-id.md index fea9776..199500f 100644 --- a/docs/data-provision/merge-by-id.md +++ b/docs/data-provision/merge-by-id.md @@ -1,3 +1,28 @@ # Merge by id -TODO \ No newline at end of file +In the metadata aggregation system it is common to find the same record provided by +different datasources and, sometimes, even inside the same datasource (especially in +case of aggregators). As the harmonisation processes are performed per datasource +contents, the relative records are the output of different mapping implementations. +This approach has the advantage to be deeply customisable to catch datasource specific +aspects, but it leaves room for inconsistencies when evaluating the different mappings +across the various datasources. + +This phase is therefore responsible to compensate for such inconsistencies and performs +a global grouping of every record available in the graph: + +- entities are grouped by [`id`](../data-model/entities/result#id) +- relations are grouped by [`source`, `target`, `reltype`](../data-model/relationships#the-relationship-object) + +This ensures that the same record, possibly assigned to different types by different +mappings, appears only once in the graph and under a single typing. In case of clashing +identifiers, the properties are merged (including the provencance information), considering +the following precedence order for the result typing: + +``` +publication > dataset > software > other +``` + +The same holds for relationships, as the same (e.g.) DOI-to-DOI citation relation could +be aggregated from multiple sources, this grouping phase would collapse all the different +duplicates onto a single relation that would however include all the individual provenances. From 8e4172c1f70e6c8876889af1b12f65e574071c35 Mon Sep 17 00:00:00 2001 From: Claudio Atzori Date: Thu, 22 Dec 2022 16:25:25 +0100 Subject: [PATCH 3/5] usage count text from Dimitris --- docs/data-provision/indicators-ingestion/usage-counts.md | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/docs/data-provision/indicators-ingestion/usage-counts.md b/docs/data-provision/indicators-ingestion/usage-counts.md index d32ef6a..fd98a43 100644 --- a/docs/data-provision/indicators-ingestion/usage-counts.md +++ b/docs/data-provision/indicators-ingestion/usage-counts.md @@ -1,7 +1,7 @@ -# Usage counts +# Usage Statistics Indicators -Usage counts cover the needs of content providers and consumers offering added value to assist them in reaching their goals. +Usage Statistics indicators for research products, like publications, datasets,etc., are an important complement to other (traditional and alternative) bibliometric indicators to provide a comprehensive and recent view of the impact of such resources but also about their authors, institutions and the platforms themselves. They are taking into account different levels of information: the usage of data sources, the usage of individual items in the context of their resource type and the usage of individual web resources or files. -They include metrics of usage activity of Open Access Repositories categorizing the data retrieved by country, number of downloads, number of views, number of repositories and all derivative quantitative open metrics, comprehensively. +Usage Statistics Indicators are built by the OpenAIRE's UsageCounts Service. The service collects usage data and consolidated usage statistics reports respectively, from its distributed network of data providers (repositories, e-journals, CRIS) by utilizing open standards and protocols and delivers reliable, consolidated and comparable usage metrics like counts of item downloads and metadata views conformant to COUNTER Code of Practice. -You can find more information about the UsageCounts service [here](https://usagecounts.openaire.eu/). +You can find more information about the UsageCounts service [here](https://usagecounts.openaire.eu/). \ No newline at end of file From 070219b09557096e7b485e8fb53cb28476ffc23c Mon Sep 17 00:00:00 2001 From: Claudio Atzori Date: Fri, 23 Dec 2022 12:11:59 +0100 Subject: [PATCH 4/5] added synthetic stats page --- docs/data-provision/stats.md | 12 ++++++++++++ sidebars.js | 1 + 2 files changed, 13 insertions(+) create mode 100644 docs/data-provision/stats.md diff --git a/docs/data-provision/stats.md b/docs/data-provision/stats.md new file mode 100644 index 0000000..9d0de86 --- /dev/null +++ b/docs/data-provision/stats.md @@ -0,0 +1,12 @@ +# Stats analysis + +The OpenAIRE Graph is also processed by a pipeline for extracting the statistics +and producing the charts for funders, research initiative, research infrastructures, +and policymakers available on [MONITOR](https://monitor.openaire.eu). + +Based on the information available on the graph, OpenAIRE provides a set of +indicators for monitoring the funding and research impact and the uptake of +Open Science publishing practices, such as Open Access publishing of publications +and datasets, availability of interlinks between research products, availability +of post-print versions in institutional or thematic Open Access repositories, etc. + diff --git a/sidebars.js b/sidebars.js index 2d484c2..f799348 100644 --- a/sidebars.js +++ b/sidebars.js @@ -151,6 +151,7 @@ const sidebars = { }, { type: 'doc', id: 'data-provision/finalisation' }, { type: 'doc', id: 'data-provision/indexing' }, + { type: 'doc', id: 'data-provision/stats' } ] }, { From 29731b7be7870c67ae5db491eb9d8c55d430f38b Mon Sep 17 00:00:00 2001 From: Claudio Atzori Date: Fri, 23 Dec 2022 12:13:43 +0100 Subject: [PATCH 5/5] added links to the explore, connect, provide portals. Further adoption of the OpenAIRE Graph shorter wording --- docs/data-provision/indexing.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/docs/data-provision/indexing.md b/docs/data-provision/indexing.md index f56e69d..579c230 100644 --- a/docs/data-provision/indexing.md +++ b/docs/data-provision/indexing.md @@ -1,8 +1,8 @@ # Indexing -The final version of the OpenAIRE Research Graph is indexed on a Solr server that is used by the OpenAIRE portals (EXPLORE, CONNECT, PROVIDE) and APIs, the latter adopted by several third-party applications and organizations, such as: +The final version of the OpenAIRE Research Graph is indexed on a Solr server that is used by the OpenAIRE portals ([EXPLORE](https://explore.openaire.eu), [CONNECT](https://connect.openaire.eu), [PROVIDE](https://provide.openaire.eu)) and APIs, the latter adopted by several third-party applications and organizations, such as: -* The OpenAIRE Research Graph APIs and Portals will offer to the EOSC (European Open Science Cloud) an Open Science Resource Catalogue, keeping an up to date map of all research results (publications, datasets, software), services, organizations, projects, funders in Europe and beyond. +* The OpenAIRE Graph APIs and Portals will offer to the EOSC (European Open Science Cloud) an Open Science Resource Catalogue, keeping an up to date map of all research results (publications, datasets, software), services, organizations, projects, funders in Europe and beyond. * DSpace & EPrints repositories can install the OpenAIRE plugin to expose OpenAIRE compliant metadata records via their OAI-PMH endpoint and offer to researchers the possibility to link their depositions to the funding project, by selecting it from the list of project provided by OpenAIRE.