openaire-graph-docs/docs/data-model/entities/data-source.md

4.6 KiB

sidebar_position
2

Data source

OpenAIRE entity instances are created out of data collected from various data sources of different kinds, such as publication repositories, dataset archives, CRIS systems, funder databases, etc. Data sources export information packages (e.g., XML records, HTTP responses, RDF data, JSON) that may contain information on one or more of such entities and possibly relationships between them. For example, a metadata record about a project carries information for the creation of a Project entity and its participants (as Organization entities). It is important, once each piece of information is extracted from such packages and inserted into the OpenAIRE information space as an entity, for such pieces to keep provenance information relative to the originating data source. This is to give visibility to the data source, but also to enable the reconstruction of the very same piece of information if problems arise.

Definitions for the re3data specific elements from: https://gfzpublic.gfz-potsdam.de/rest/items/item_758898_6/component/file_775891/content


The DataSource object

id

Type: String • Cardinality: ONE

Main entity identifier, created according to OpenAIRE_entity_identifier_and_PID_mapping_policy.

originalId

Type: String • Cardinality: MANY

The list of original ids associated to the datasource.

pid

Type: ControlledField • Cardinality: MANY

The persistent identifiers for the datasource.

datasourcetype

Type: ControlledField • Cardinality: ONE

The datasource type (e.g. pubsrepository::institutional, Institutional Repository) as in the vocabulary dnet:datasource_typologies.

openairecompatibility

Type: String • Cardinality: ONE

The OpenAIRE compatibility of the ingested results, indicates which guidelines they are compliant to the vocabulary dnet:datasourceCompatibilityLevel.

officialname

Type: String • Cardinality: ONE

The official name of the datasource.

englishname

Type: String • Cardinality: ONE

The English name of the datasource.

websiteurl

Type: String • Cardinality: ONE

The URL of the website of the datasource.

logourl

Type: String • Cardinality: ONE

The URL of the logo for the datasource.

dateofvalidation

Type: String • Cardinality: ONE

The date of validation against the guidelines for the datasource records.

description

Type: String • Cardinality: ONE

The description for the datasource.

subjects

Type: String • Cardinality: ONE

The subjects of the contents provided by the datasource.

languages

Type: String • Cardinality: MANY

The languages of the contents provided by the datasource (OpenDOAR only).

contenttypes

Type: String • Cardinality: MANY

The typologies of the contents provided by the datasource (OpenDOAR only).

releasestartdate

Type: String • Cardinality: ONE

TODO

releaseenddate

Type: String • Cardinality: ONE

TODO

accessrights

Type: String • Cardinality: ONE

Open, restricted or closed.

uploadrights

Type: String • Cardinality: ONE

Open, restricted or closed.

databaseaccessrestriction

Type: String • Cardinality: ONE

All existing access restrictions to the research data repository. Allowed values are: feeRequired, registration, other (re3data only).

datauploadrestriction

Type: String • Cardinality: ONE

All existing restrictions to the data upload. (re3data only).

versioning

Type: Boolean • Cardinality: ONE

The research data repository supports versioning of research data. (re3data only).

citationguidelineurl

Type: String • Cardinality: ONE

The URL of the research data repository providing information on how to cite its research data. The DataCite citation format is recommended (http://www.datacite.org/whycitedata). (re3data only)

pidsystems

Type: String • Cardinality: ONE

certificates

Type: String • Cardinality: ONE

TODO

policies

Type: String • Cardinality: MANY

TODO

journal

Type: Container • Cardinality: ONE

TODO