openaire-graph-docs/versioned_docs/version-5.1.2/data-model/entities/data-source.md

7.8 KiB

sidebar_position
2

Data sources

OpenAIRE entity instances are created out of data collected from various data sources of different kinds, such as publication repositories, dataset archives, CRIS systems, funder databases, etc. Data sources export information packages (e.g., XML records, HTTP responses, RDF data, JSON) that may contain information on one or more of such entities and possibly relationships between them.

For example, a metadata record about a project carries information for the creation of a Project entity and its participants (as Organization entities). It is important, once each piece of information is extracted from such packages and inserted into the OpenAIRE information space as an entity, for such pieces to keep provenance information relative to the originating data source. This is to give visibility to the data source, but also to enable the reconstruction of the very same piece of information if problems arise.


The DataSource object

id

Type: String • Cardinality: ONE

The OpenAIRE id of the data source, created according to the OpenAIRE entity identifier and PID mapping policy.

"id": "10|issn___print::22c514d022b199c346e7f29ca06efc95"

originalId

Type: String • Cardinality: MANY

The list of original identifiers associated to the datasource.

"originalId": [
    "issn___print::2451-8271",
    ...
]

pid

Type: ControlledField • Cardinality: MANY

The persistent identifiers for the datasource.

"pid": [
    {
        "scheme": "DOI",
        "value": "10.5281/zenodo.4707307" 
    },
    ...
]

datasourcetype

Type: ControlledField • Cardinality: ONE

The datasource type; see the vocabulary dnet:datasource_typologies.

"datasourcetype": {
    "scheme": "pubsrepository::journal",
    "value": "Journal"
}

openairecompatibility

Type: String • Cardinality: ONE

The OpenAIRE compatibility of the ingested results, indicates which guidelines they are compliant according to the vocabulary dnet:datasourceCompatibilityLevel.

"openairecompatibility": "collected from a compatible aggregator"

officialname

Type: String • Cardinality: ONE

The official name of the datasource.

"officialname": "Recent Patents and Topics on Medical Imaging"

englishname

Type: String • Cardinality: ONE

The English name of the datasource.

"englishname": "Recent Patents and Topics on Medical Imaging"

websiteurl

Type: String • Cardinality: ONE

The URL of the website of the datasource.

"websiteurl": "http://dspace.unict.it/"

logourl

Type: String • Cardinality: ONE

The URL of the logo for the datasource.

"logourl": "https://impactum-journals.uc.pt/public/journals/26/pageHeaderLogoImage_en_US.png"

dateofvalidation

Type: String • Cardinality: ONE

The date of validation against the OpenAIRE guidelines for the datasource records.

"dateofvalidation": "2016-10-10"

description

Type: String • Cardinality: ONE

The description for the datasource.

"description": "Recent Patents on Medical Imaging publishes review and research articles, and guest edited single-topic issues on recent patents in the field of medical imaging. It provides an important and reliable source of current information on developments in the field. The journal is essential reading for all researchers involved in Medical Imaging."

subjects

Type: String • Cardinality: MANY

List of subjects associated to the datasource

"subjects": [
    "Medicine",
    "Imaging",
    ...
]

languages

Type: String • Cardinality: MANY

The languages present in the data source's content, as defined by OpenDOAR.

"languages":[ 
    "eng",
    ...
]

contenttypes

Type: String • Cardinality: MANY

Types of content in the data source, as defined by OpenDOAR

"contenttypes": [
    "Journal articles",
    ...
]

releasestartdate

Type: String • Cardinality: ONE

Releasing date of the data source, as defined by re3data.org.

"releasestartdate": "2010-07-24"

releaseenddate

Type: String • Cardinality: ONE

Date when the data source went offline or stopped ingesting new research data. As defined by re3data.org

"releaseenddate": "2016-03-28"

accessrights

Type: String • Cardinality: ONE

Type of access to the data source, as defined by re3data.org. Possible values: { open, restricted, closed }.

"accessrights": "open"

uploadrights

Type: String • Cardinality: ONE

Type of data upload, as defined by re3data.org; one of { open, restricted, closed }.

"uploadrights": "closed"

databaseaccessrestriction

Type: String • Cardinality: ONE

Access restrictions to the research data repository. Allowed values are: { feeRequired, registration, other }.

This field only applies for re3data data source; see re3data schema specification for more details.

"databaseaccessrestriction": "registration"

datauploadrestriction

Type: String • Cardinality: ONE

Upload restrictions applied by the datasource, as defined by re3data.org. One of { feeRequired, registration, other }.

This field only applies for re3data data source; see re3data schema specification for more details.

"datauploadrestriction": "feeRequired registration"

versioning

Type: Boolean • Cardinality: ONE

Whether the research data repository supports versioning: yes if the data source supports versioning, no otherwise.

This field only applies for re3data data source; see re3data schema specification for more details.

"versioning": true

citationguidelineurl

Type: String • Cardinality: ONE

The URL of the data source providing information on how to cite its items. The DataCite citation format is recommended (http://www.datacite.org/whycitedata).

This field only applies for re3data data source; see re3data schema specification for more details.

"citationguidelineurl": "https://physionet.org/about/#citation"

pidsystems

Type: String • Cardinality: ONE

The persistent identifier system that is used by the data source. As defined by re3data.org.

"pidsystems": "hdl"

certificates

Type: String • Cardinality: ONE

The certificate, seal or standard the data source complies with. As defined by re3data.org.

"certificates": "WDS"

policies

Type: String • Cardinality: MANY

Policies of the data source, as defined in OpenDOAR.

journal

Type: Container • Cardinality: ONE

Information about the journal, if this data source is of type Journal.

"container": {
    "edition": "",
    "iss": "5",
    "issnLinking": "",
    "issnOnline": "1873-7625",
    "issnPrinted":"2451-8271",
    "name": "Recent Patents and Topics on Imaging",
    "sp": "12",
    "ep": "22",
    "vol": "50"
}

missionstatementurl

Type: String • Cardinality: ONE

The URL of a mission statement describing the designated community of the data source. As defined by re3data.org

"missionstatementurl": "https://www.sigma2.no/content/nird-research-data-archive"