Merge pull request 'dataset vs data' (#82) from dataset_vs_data into main

Reviewed-on: D-Net/openaire-graph-docs#82
This commit is contained in:
Claudio Atzori 2024-11-04 14:06:50 +01:00
commit 31e6f191ee
4 changed files with 16 additions and 16 deletions

View File

@ -4,7 +4,7 @@ sidebar_position: 2
# Data sources
OpenAIRE entity instances are created out of data collected from various data sources of different kinds, such as publication repositories, dataset archives, CRIS systems, funder databases, etc. Data sources export information packages (e.g., XML records, HTTP responses, RDF data, JSON) that may contain information on one or more of such entities and possibly relationships between them.
OpenAIRE entity instances are created out of data collected from various data sources of different kinds, such as publication repositories, research data archives, CRIS systems, funder databases, etc. Data sources export information packages (e.g., XML records, HTTP responses, RDF data, JSON) that may contain information on one or more of such entities and possibly relationships between them.
For example, a metadata record about a project carries information for the creation of a Project entity and its participants (as Organization entities). It is important, once each piece of information is extracted from such packages and inserted into the OpenAIRE information space as an entity, for such pieces to keep provenance information relative to the originating data source. This is to give visibility to the data source, but also to enable the reconstruction of the very same piece of information if problems arise.

View File

@ -9,7 +9,7 @@ In this page, we descibe the properties of the `ResearchProduct` object.
Moreover, there are the following sub-types of a `ResearchProduct`, that inherit all its properties and further extend it:
* [Publication](#publication)
* [Dataset](#dataset)
* [Data](#data)
* [Software](#software)
* [Other research product](#other-research-product)
@ -32,7 +32,7 @@ _Type: String • Cardinality: ONE_
Type of the research products. Possible types:
* `publication`
* `dataset`
* `data`
* `software`
* `other`
@ -59,7 +59,7 @@ Identifiers of the record at the original sources.
### mainTitle
_Type: String • Cardinality: ONE_
A name or title by which a research product is known. May be the title of a publication, of a dataset or the name of a piece of software.
A name or title by which a research product is known. It may be the title of a publication or the name of a piece of software.
```json
"mainTitle": "The fall of the innovation empire and its possible rise through open science"
@ -420,14 +420,14 @@ Container has information about the conference or journal where the research pro
"vol": "50"
}
```
### Dataset
### Data
Metadata records about research data (includes the subtypes listed [here](http://api.openaire.eu/vocabularies/dnet:result_typologies/dataset)).
#### size
_Type: String • Cardinality: ONE_
The declared size of the dataset.
The declared size of the research data.
```json
"size": "10129818"
@ -436,7 +436,7 @@ The declared size of the dataset.
#### version
_Type: String • Cardinality: ONE_
The version of the dataset.
The version of the research data.
```json
"version": "v1.3"
@ -445,7 +445,7 @@ The version of the dataset.
#### geolocation
_Type: [GeoLocation](other#geolocation) • Cardinality: MANY_
The list of geolocations associated with the dataset.
The list of geolocations associated with the research data.
```json
"geolocation": [

View File

@ -4,7 +4,7 @@ sidebar_position: 2
# Data sources
OpenAIRE entity instances are created out of data collected from various data sources of different kinds, such as publication repositories, dataset archives, CRIS systems, funder databases, etc. Data sources export information packages (e.g., XML records, HTTP responses, RDF data, JSON) that may contain information on one or more of such entities and possibly relationships between them.
OpenAIRE entity instances are created out of data collected from various data sources of different kinds, such as publication repositories, research data archives, CRIS systems, funder databases, etc. Data sources export information packages (e.g., XML records, HTTP responses, RDF data, JSON) that may contain information on one or more of such entities and possibly relationships between them.
For example, a metadata record about a project carries information for the creation of a Project entity and its participants (as Organization entities). It is important, once each piece of information is extracted from such packages and inserted into the OpenAIRE information space as an entity, for such pieces to keep provenance information relative to the originating data source. This is to give visibility to the data source, but also to enable the reconstruction of the very same piece of information if problems arise.

View File

@ -9,7 +9,7 @@ In this page, we descibe the properties of the `ResearchProduct` object.
Moreover, there are the following sub-types of a `ResearchProduct`, that inherit all its properties and further extend it:
* [Publication](#publication)
* [Dataset](#dataset)
* [Data](#data)
* [Software](#software)
* [Other research product](#other-research-product)
@ -32,7 +32,7 @@ _Type: String • Cardinality: ONE_
Type of the research products. Possible types:
* `publication`
* `dataset`
* `data`
* `software`
* `other`
@ -59,7 +59,7 @@ Identifiers of the record at the original sources.
### mainTitle
_Type: String • Cardinality: ONE_
A name or title by which a research product is known. May be the title of a publication, of a dataset or the name of a piece of software.
A name or title by which a research product is known. May be the title of a publication or the name of a piece of software.
```json
"mainTitle": "The fall of the innovation empire and its possible rise through open science"
@ -396,14 +396,14 @@ Container has information about the conference or journal where the research pro
"vol": "50"
}
```
### Dataset
### Data
Metadata records about research data (includes the subtypes listed [here](http://api.openaire.eu/vocabularies/dnet:result_typologies/dataset)).
#### size
_Type: String • Cardinality: ONE_
The declared size of the dataset.
The declared size of the research data.
```json
"size": "10129818"
@ -412,7 +412,7 @@ The declared size of the dataset.
#### version
_Type: String • Cardinality: ONE_
The version of the dataset.
The version of the research data.
```json
"version": "v1.3"
@ -421,7 +421,7 @@ The version of the dataset.
#### geolocation
_Type: [GeoLocation](other#geolocation) • Cardinality: MANY_
The list of geolocations associated with the dataset.
The list of geolocations associated with the research data.
```json
"geolocation": [