2022-09-02 18:12:15 +02:00
---
sidebar_position: 2
---
2023-04-05 17:28:12 +02:00
# Data sources
2022-09-07 12:11:05 +02:00
2022-09-23 16:19:32 +02:00
OpenAIRE entity instances are created out of data collected from various data sources of different kinds, such as publication repositories, dataset archives, CRIS systems, funder databases, etc. Data sources export information packages (e.g., XML records, HTTP responses, RDF data, JSON) that may contain information on one or more of such entities and possibly relationships between them.
2022-09-07 12:11:05 +02:00
2022-09-23 16:19:32 +02:00
For example, a metadata record about a project carries information for the creation of a Project entity and its participants (as Organization entities). It is important, once each piece of information is extracted from such packages and inserted into the OpenAIRE information space as an entity, for such pieces to keep provenance information relative to the originating data source. This is to give visibility to the data source, but also to enable the reconstruction of the very same piece of information if problems arise.
2022-09-09 17:38:08 +02:00
2022-09-07 12:11:05 +02:00
---
2022-09-09 17:38:08 +02:00
## The `DataSource` object
2022-09-07 12:11:05 +02:00
### id
_Type: String • Cardinality: ONE_
2022-10-06 13:57:46 +02:00
The OpenAIRE id of the data source, created according to the [OpenAIRE entity identifier and PID mapping policy ](../pids-and-identifiers ).
2022-09-23 16:19:32 +02:00
```json
2023-07-19 10:00:10 +02:00
"id": "issn___print::22c514d022b199c346e7f29ca06efc95"
2022-09-23 16:19:32 +02:00
```
2022-09-07 12:11:05 +02:00
### originalId
_Type: String • Cardinality: MANY_
2022-09-23 16:19:32 +02:00
The list of original identifiers associated to the datasource.
```json
"originalId": [
"issn___print::2451-8271",
...
]
```
2022-09-07 12:11:05 +02:00
### pid
2022-09-23 16:19:32 +02:00
2022-09-07 12:11:05 +02:00
_Type: [ControlledField ](other#controlledfield ) • Cardinality: MANY_
The persistent identifiers for the datasource.
2022-09-23 16:19:32 +02:00
```json
"pid": [
{
"scheme": "DOI",
"value": "10.5281/zenodo.4707307"
},
...
]
```
2024-07-30 11:36:02 +02:00
### type
2022-09-07 12:11:05 +02:00
_Type: [ControlledField ](other#controlledfield ) • Cardinality: ONE_
2022-09-23 16:19:32 +02:00
The datasource type; see the vocabulary [dnet:datasource_typologies ](https://api.openaire.eu/vocabularies/dnet:datasource_typologies ).
```json
2024-07-30 11:36:02 +02:00
"type": {
2022-09-23 16:19:32 +02:00
"scheme": "pubsrepository::journal",
"value": "Journal"
}
```
2022-09-07 12:11:05 +02:00
2024-07-30 11:36:02 +02:00
### openaireCompatibility
2022-09-07 12:11:05 +02:00
_Type: String • Cardinality: ONE_
2024-01-17 13:39:34 +01:00
The OpenAIRE compatibility of the ingested research products, indicates which guidelines they are compliant according to the vocabulary [dnet:datasourceCompatibilityLevel ](https://api.openaire.eu/vocabularies/dnet:datasourceCompatibilityLevel ).
2022-09-23 16:19:32 +02:00
```json
2024-07-30 11:36:02 +02:00
"openaireCompatibility": "collected from a compatible aggregator"
2022-09-23 16:19:32 +02:00
```
2022-09-07 12:11:05 +02:00
2024-07-30 11:36:02 +02:00
### officialName
2022-09-07 12:11:05 +02:00
_Type: String • Cardinality: ONE_
The official name of the datasource.
2022-09-23 16:19:32 +02:00
```json
2024-07-30 11:36:02 +02:00
"officialBame": "Recent Patents and Topics on Medical Imaging"
2022-09-23 16:19:32 +02:00
```
2024-07-30 11:36:02 +02:00
### englishName
2022-09-07 12:11:05 +02:00
_Type: String • Cardinality: ONE_
The English name of the datasource.
2022-09-23 16:19:32 +02:00
```json
2024-07-30 11:36:02 +02:00
"englishName": "Recent Patents and Topics on Medical Imaging"
2022-09-23 16:19:32 +02:00
```
2024-07-30 11:36:02 +02:00
### websiteUrl
2022-09-07 12:11:05 +02:00
_Type: String • Cardinality: ONE_
The URL of the website of the datasource.
2022-09-23 16:19:32 +02:00
```json
2024-07-30 11:36:02 +02:00
"websiteUrl": "http://dspace.unict.it/"
2022-09-23 16:19:32 +02:00
```
2024-07-30 11:36:02 +02:00
### logoUrl
2022-09-07 12:11:05 +02:00
_Type: String • Cardinality: ONE_
The URL of the logo for the datasource.
2022-09-23 16:19:32 +02:00
```json
2024-07-30 11:36:02 +02:00
"logoUrl": "https://impactum-journals.uc.pt/public/journals/26/pageHeaderLogoImage_en_US.png"
2022-09-23 16:19:32 +02:00
```
2024-07-30 11:36:02 +02:00
### dateOfValidation
2022-09-07 12:11:05 +02:00
_Type: String • Cardinality: ONE_
2022-09-23 16:19:32 +02:00
The date of validation against the OpenAIRE guidelines for the datasource records.
```json
2024-07-30 11:36:02 +02:00
"dateOfValidation": "2016-10-10"
2022-09-23 16:19:32 +02:00
```
2022-09-07 12:11:05 +02:00
### description
_Type: String • Cardinality: ONE_
The description for the datasource.
2022-09-23 16:19:32 +02:00
```json
"description": "Recent Patents on Medical Imaging publishes review and research articles, and guest edited single-topic issues on recent patents in the field of medical imaging. It provides an important and reliable source of current information on developments in the field. The journal is essential reading for all researchers involved in Medical Imaging."
```
2022-09-07 12:11:05 +02:00
### subjects
2022-09-23 16:19:32 +02:00
_Type: String • Cardinality: MANY_
List of subjects associated to the datasource
2022-09-07 12:11:05 +02:00
2022-09-23 16:19:32 +02:00
```json
"subjects": [
"Medicine",
"Imaging",
...
]
```
2022-09-07 12:11:05 +02:00
### languages
_Type: String • Cardinality: MANY_
2022-09-23 16:19:32 +02:00
The languages present in the data source's content, as defined by OpenDOAR.
```json
2024-07-30 11:36:02 +02:00
"languages": [
2022-09-23 16:19:32 +02:00
"eng",
...
]
```
2022-09-07 12:11:05 +02:00
2024-07-30 11:36:02 +02:00
### contentTypes
2022-09-07 12:11:05 +02:00
_Type: String • Cardinality: MANY_
2022-09-23 16:19:32 +02:00
Types of content in the data source, as defined by OpenDOAR
```json
2024-07-30 11:36:02 +02:00
"contentTypes": [
2022-09-23 16:19:32 +02:00
"Journal articles",
...
]
```
2022-09-07 12:11:05 +02:00
2024-07-30 11:36:02 +02:00
### releaseStartDate
2022-09-07 12:11:05 +02:00
_Type: String • Cardinality: ONE_
2022-09-23 16:19:32 +02:00
Releasing date of the data source, as defined by re3data.org.
```json
2024-07-30 11:36:02 +02:00
"releaseStartDate": "2010-07-24"
2022-09-23 16:19:32 +02:00
```
2022-09-07 12:11:05 +02:00
2024-07-30 11:36:02 +02:00
### releaseEndDate
2022-09-07 12:11:05 +02:00
_Type: String • Cardinality: ONE_
2022-09-23 16:19:32 +02:00
Date when the data source went offline or stopped ingesting new research data. As defined by re3data.org
```json
2024-07-30 11:36:02 +02:00
"releaseEndDate": "2016-03-28"
2022-09-23 16:19:32 +02:00
```
2022-09-07 12:11:05 +02:00
2024-07-30 11:36:02 +02:00
### accessRights
2022-09-07 12:11:05 +02:00
_Type: String • Cardinality: ONE_
2022-09-23 16:19:32 +02:00
Type of access to the data source, as defined by re3data.org. Possible values: `{ open, restricted, closed }` .
```json
2024-07-30 11:36:02 +02:00
"accessRights": "open"
2022-09-23 16:19:32 +02:00
```
2022-09-07 12:11:05 +02:00
2024-07-30 11:36:02 +02:00
### uploadRights
2022-09-07 12:11:05 +02:00
_Type: String • Cardinality: ONE_
2022-09-23 16:19:32 +02:00
Type of data upload, as defined by re3data.org; one of `{ open, restricted, closed }` .
```json
2024-07-30 11:36:02 +02:00
"uploadRights": "closed"
2022-09-23 16:19:32 +02:00
```
2022-09-07 12:11:05 +02:00
2024-07-30 11:36:02 +02:00
### databaseAccessRestriction
2022-09-07 12:11:05 +02:00
_Type: String • Cardinality: ONE_
2022-09-23 16:19:32 +02:00
Access restrictions to the research data repository. Allowed values are: `{ feeRequired, registration, other }` .
This field only applies for re3data data source; see [re3data schema specification ](https://gfzpublic.gfz-potsdam.de/rest/items/item_758898_6/component/file_775891/content ) for more details.
```json
2024-07-30 11:36:02 +02:00
"databaseAccessRestriction": "registration"
2022-09-23 16:19:32 +02:00
```
2022-09-07 12:11:05 +02:00
2024-07-30 11:36:02 +02:00
### dataUploadRestriction
2022-09-07 12:11:05 +02:00
_Type: String • Cardinality: ONE_
2022-09-23 16:19:32 +02:00
Upload restrictions applied by the datasource, as defined by re3data.org. One of `{ feeRequired, registration, other }` .
This field only applies for re3data data source; see [re3data schema specification ](https://gfzpublic.gfz-potsdam.de/rest/items/item_758898_6/component/file_775891/content ) for more details.
```json
2024-07-30 11:36:02 +02:00
"dataUploadRestriction": "feeRequired registration"
2022-09-23 16:19:32 +02:00
```
2022-09-07 12:11:05 +02:00
### versioning
_Type: Boolean • Cardinality: ONE_
2022-09-23 16:19:32 +02:00
Whether the research data repository supports versioning:
`yes` if the data source supports versioning, `no` otherwise.
This field only applies for re3data data source; see [re3data schema specification ](https://gfzpublic.gfz-potsdam.de/rest/items/item_758898_6/component/file_775891/content ) for more details.
```json
"versioning": true
```
2022-09-07 12:11:05 +02:00
2024-07-30 11:36:02 +02:00
### citationGuidelineUrl
2022-09-07 12:11:05 +02:00
_Type: String • Cardinality: ONE_
2022-09-23 16:19:32 +02:00
The URL of the data source providing information on how to cite its items. The DataCite citation format is recommended (http://www.datacite.org/whycitedata).
This field only applies for re3data data source; see [re3data schema specification ](https://gfzpublic.gfz-potsdam.de/rest/items/item_758898_6/component/file_775891/content ) for more details.
```json
2024-07-30 11:36:02 +02:00
"citationGuidelineUrl": "https://physionet.org/about/#citation"
2022-09-23 16:19:32 +02:00
```
2022-09-07 12:11:05 +02:00
2024-07-30 11:36:02 +02:00
### pidSystems
2022-09-07 12:11:05 +02:00
_Type: String • Cardinality: ONE_
2022-09-23 16:19:32 +02:00
The persistent identifier system that is used by the data source. As defined by re3data.org.
```json
2024-07-30 11:36:02 +02:00
"pidSystems": "hdl"
2022-09-23 16:19:32 +02:00
```
2022-09-07 12:11:05 +02:00
### certificates
_Type: String • Cardinality: ONE_
2022-09-23 16:19:32 +02:00
The certificate, seal or standard the data source complies with. As defined by re3data.org.
```json
"certificates": "WDS"
```
2022-09-07 12:11:05 +02:00
### policies
_Type: String • Cardinality: MANY_
2022-09-23 16:19:32 +02:00
Policies of the data source, as defined in OpenDOAR.
2022-09-07 12:11:05 +02:00
### journal
_Type: [Container ](other#container ) • Cardinality: ONE_
2022-09-23 16:19:32 +02:00
Information about the journal, if this data source is of type Journal.
```json
"container": {
"edition": "",
"iss": "5",
"issnLinking": "",
"issnOnline": "1873-7625",
"issnPrinted":"2451-8271",
"name": "Recent Patents and Topics on Imaging",
"sp": "12",
"ep": "22",
"vol": "50"
}
```
2024-07-30 11:36:02 +02:00
### missionStatementUrl
2022-09-23 16:19:32 +02:00
_Type: String • Cardinality: ONE_
The URL of a mission statement describing the designated community of the data source. As defined by re3data.org
```json
2024-07-30 11:36:02 +02:00
"missionStatementUrl": "https://www.sigma2.no/content/nird-research-data-archive"
2022-09-23 16:19:32 +02:00
```