openaire-graph-docs/docs/changelog.md

247 lines
9.6 KiB
Markdown
Raw Permalink Normal View History

---
sidebar_position: 12
---
2022-11-11 18:15:55 +01:00
# Versions & changelog
2022-11-17 13:28:09 +01:00
## Versioning
2022-11-11 18:15:55 +01:00
2023-11-29 10:18:01 +01:00
Our versioning policy follows the [Semantic Versioning specification](https://semver.org/).
2022-11-11 18:15:55 +01:00
In our case, given a version `MAJOR.MINOR.PATCH`, we increment the:
* `MAJOR` version when the data model of the Graph changes
2022-11-17 13:28:09 +01:00
* `MINOR` version when the pipeline (e.g., different deduplication method, different implementation for an enrichment process) or major data sources change
2022-11-11 18:15:55 +01:00
* `PATCH` version when the graph data are updated
2022-11-17 13:28:09 +01:00
## Changelog
2022-11-11 18:15:55 +01:00
This section documents all notable changes for each graph version.
2023-02-03 12:35:26 +01:00
---
### v7.1.3
_Start Date: 2024-04-10 • Release Date: 2024-04-22 • Dataset release: **no**_
#### Added
- Introduced new Field of Science classifications, reaching a total of ~73Mi publications classified
- General increase of the funded scientific outputs, thanks to the full-text mining scanning new OpenAccess publications, some examples:
- European Commission - EC +7% (from 1.52Mi to 1.62Mi)
- Irish Research Council - IRC +7% (from 12.7K to 13.5K)
- French National Research Agency - ANR +5.8% (from 91.5K to 96.8K)
- National Institute of Health - NIH +5% (from 594K to 626K)
- UK Research and Innovation - UKRI +3.7% (from 434K to 450K)
- General increase of the scientific products with author affiliation information +2% (from 83.12Mi to 84.88Mi)
#### Changed
- Updated Crossref publications to include contents until March 2023
- Updated Datacite contents until March 2024
- Updated ORCID contents until March 2024
### v7.1.2
_Start Date: 2024-03-15 • Release Date: 2024-03-27 • Dataset release: **no**_
#### Added
- General increase of the funded scientific outputs, thanks to the full-text mining scanning new OpenAccess publications
#### Changed
- Updated Crossref publications to include contents until February 2023
- Updated Datacite contents until February 2024
- Updated ORCID contents until February 2024
2024-03-14 10:35:15 +01:00
### v7.1.1
_Start Date: 2024-02-23 • Release Date: 2024-03-06 • Dataset release: **no**_
#### Added
- Updated the content import criteria applied to Datacite, resulting in +13Mi Other Research Products (+167%)
- Introduced project PIDs; DOI currently available for grants funded by FCT and TWCF
#### Changed
- Scientific products typed as "Collection" categorized under "Research Data" instead of "Other Research Product".
- Updated Crossref publications to include contents until January 2023
- Updated Datacite contents until January 2024
2024-02-19 12:19:18 +01:00
### v7.1.0
_Start Date: 2024-01-30 • Release Date: 2024-02-20 • Dataset release: **no**_
#### Added
- The scientific products aggregated increased by ~5Mi records (+1.6%)
#### Changed
- A refined version of the deduplication strategy allowed to catch more duplicates among the scientific products, implying
2024-02-19 12:19:18 +01:00
a decrease of their total number of ~3.2Mi (-1.35%). More details about the deduplication algorithm are available [here](graph-production-workflow/deduplication/research-products).
2024-02-21 10:53:05 +01:00
- Updated Crossref publications to include contents until November 2023
- Updated Datacite contents until December 2023
2024-02-19 12:19:18 +01:00
2024-01-17 10:16:12 +01:00
### v7.0.0
2024-01-17 10:36:15 +01:00
_Start Date: 2023-12-18 • Release Date: 2024-01-06 • Dataset release: **yes**_
2024-01-17 10:16:12 +01:00
#### Added
- the scientific products increased by ~3Mi records (+1.26%)
- the number of relations increased by 28.6Mi (+1%)
- the funded contents increased by 5%, from 3.6Mi to 3,8Mi. Funders that recorded the highest increase include, for example, EC with +120K linked research products, and SFI with +1K products.
#### Changed
This graph release also introduces new fields to identify reseach products published using specific open access models, in diamond journals, and those that received public funding. These fields will also be added to the graph dataset in Zenodo. In details:
2024-01-17 10:16:12 +01:00
- `ResearchProduct.isGreen (true, false)`: indicates whether or not the researh product was published following the green open access model;
- `ResearchProduct.openAccesColor (bronze, gold, hybrid)`: indicates the specific open access model used for the publication;
- `ResearchProduct.isInDiamondJournal (true, false)`: indicates whether or not the research product was published in a diamond journal;
- `ResearchProduct.publicly-funded (true, false)`: indicates whether or not the grants acknowledged by the publication come from public funds.
2024-01-17 10:16:12 +01:00
2023-11-27 11:00:20 +01:00
### v6.2.2
2024-01-17 10:36:15 +01:00
_Start Date: 2023-11-07 • Release Date: 2023-11-23 • Dataset release: **no**_
2023-11-27 11:00:20 +01:00
2023-11-29 10:18:01 +01:00
#### Added
2023-11-27 11:00:20 +01:00
- Imported Opencitation's POCI dataset, containing citations among publications in PubMed
- Imported Affiliations from Crossref and from PubMed
- Imported Software Heritage identifiers for Software records
- Extended coverage of Irish funders imported from Crossref
- Peer reviewed material identified with a revised heuristic that allowed to improve the coverage
- Project references identified by TDM increased by ~10%
- Introduced new Field of Science classifications for ~40Mi publications
2023-11-29 10:18:01 +01:00
#### Changed
2023-11-27 11:00:20 +01:00
- Updated Crossref publications to include contents until October 2023
- Updated Datacite contents until October 2023
- Indicators regarding data source downloads and views taken by usage counts from September 2023
2023-10-31 15:26:29 +01:00
### v6.1.1
2024-01-17 10:36:15 +01:00
_Start Date: 2023-09-11 • Release Date: 2023-10-15 • Dataset release: **no**_
2023-10-31 15:26:29 +01:00
2023-11-29 10:18:01 +01:00
#### Added
- Affiliation (research product to organization) relations from Crossref
2023-10-31 15:26:29 +01:00
- Links to the full text of research products
2023-10-31 15:35:24 +01:00
- Cleaning for author and publisher names (get rid of tabs, CR characters, \n(s), escape double quotes)
2023-10-31 15:26:29 +01:00
#### Changed
2023-10-31 15:35:24 +01:00
- Projects without a grant code are removed
2023-10-31 15:26:29 +01:00
- Crossref dump from July 2023
- ORCID works without a DOI from March 2023
- Usage counts from July 2023
- Datacite contents from early July 2023
- OpenCitations relations from December 2022
2023-08-17 10:30:22 +02:00
### v6.0.0
2024-01-17 10:36:15 +01:00
_Start Date: 2023-07-26 • Release Date: 2023-08-16 • Dataset release: **yes**_
2023-08-16 20:02:52 +02:00
#### Changed
- [Relationship data model](./data-model/relationships/relationship-object): flattened properties source, sourceType, target, targetType
- BIP! indicators are now serialised as an array; see the updated model [here](./data-model/entities/other#bipindicators)
2023-08-16 20:02:52 +02:00
- Crossref dump from June 2023
- ORCID works without a DOI from June 2023
- Usage counts from June 2023
- Datacite contents from June 2023
- OpenCitations relations from January 2023
- BIP! indicators from June 2023
- New Datasources/Services were added, collected from an updated EOSC Service catalogue endpoint
2023-08-17 10:30:22 +02:00
2023-08-17 10:04:14 +02:00
### v5.2.0
2024-01-17 10:36:15 +01:00
_Start Date: 2023-07-03 • Release Date: 2023-07-17 • Dataset release: **no**_
2023-07-19 10:00:10 +02:00
#### Added
- Citations imported from Crossref & MAG
- FoS and SDG classifications introduced for ~16Mi research products
#### Changed
- Removed the numerical prefix from the OpenAIRE identifiers (```"20|openorgs____::..." --> "openorgs____::..."```)
- Dataset file names in the Zenodo depositions changed from `dump` to `dataset`
- Crossref dump from May 2023
- ORCID works without a DOI from June 2023
- Usage counts from April 2023
- Datacite contents from June 2023
- OpenCitations relations from January 2023
- Deduplication of the datasource
- Avoid duplicated organisation PIDs
2023-06-13 15:58:02 +02:00
### v5.1.3
2024-01-17 10:36:15 +01:00
_Start Date: 2023-05-22 • Release Date: 2023-06-12 • Dataset release: **no**_
2023-06-13 15:58:02 +02:00
#### Added
- Datasource and project level usage counts
#### Changed
- Crossref dump from April 2023
- ORCID works without a DOI from May 2023
- Usage counts from April 2023
- Datacite contents from May 2023
- OpenCitations relations from January 2023
- Deduplication of the datasource
### v5.1.2
2024-01-17 10:36:15 +01:00
_Start Date: 2023-03-20 • Release Date: 2023-04-04 • Dataset release: **no**_
#### Changed
- Crossref dump from February 2023
- ORCID works without a DOI from March 2023
- Usage counts from February 2023 (+76% Downloads per Datasource for 2023)
- Datacite contents from mid March 2023
- OpenCitations relations from January 2023
### v5.1.1
2024-01-17 10:36:15 +01:00
_Start Date: 2023-02-13 • Release Date: 2023-03-01 • Dataset release: **no**_
#### Added
- Revised SDG classification: improved coverage (+600K classified DOIs)
- General increase of the funded scientific outputs, thanks to the full text mining scanning new OpenAccess publications
- Integrated contents from
- [EMBL-EBIs Protein Data Bank in Europe](./graph-production-workflow/aggregation/non-compatible-sources/ebi)
- [UniProtKB/Swiss-Prot](./graph-production-workflow/aggregation/non-compatible-sources/uniprot)
#### Changed
- Crossref dump from January 2023
- ORCID works without a DOI from January 2023
- Usage counts from January 2023
- Datacite contents from mid February 2023
- OpenCitations relations from December 2022
2023-02-03 12:35:26 +01:00
### v5.1.0
2024-01-17 10:36:15 +01:00
_Start Date: 2023-01-16 • Release Date: 2023-01-30 • Dataset release: **no**_
2023-02-03 12:35:26 +01:00
#### Added
- Revised SDG classification: better accuracy, lower coverage (will improve in the next months)
#### Changed
2022-11-11 18:15:55 +01:00
2023-02-03 12:35:26 +01:00
- Crossref dump from December 2022
- ORCID works without a DOI from January 2023
- Usage counts from December 2022
- DataCite contents from January 2023
---
2022-11-11 18:15:55 +01:00
2022-11-17 13:28:09 +01:00
### v5.0.0
2022-11-11 18:15:55 +01:00
2024-01-17 10:36:15 +01:00
_Start Date: 2022-12-19 • Release Date: 2022-12-28 • Dataset release: **yes**_
2023-02-03 12:35:26 +01:00
2022-11-17 13:28:09 +01:00
#### Added
2022-11-11 18:15:55 +01:00
- [Impact & Usage indicators](./data-model/entities/research-product.md#indicators) at the level of the research product
- [Beginner's kit](./downloads/beginners-kit) in the Downloads section
- New relationship types were introduced; see the complete list [here](./data-model/relationships/relationship-types)
2022-12-30 22:00:07 +01:00
2022-11-17 13:28:09 +01:00
#### Changed
- FOS and SDGs were removed from the [ResearchProduct.subjects](./data-model/entities/research-product#subjects)
- Measures were removed from the [ResearchProduct.instance](./data-model/entities/research-product#instance)
2023-02-03 12:35:26 +01:00
- Updated DOIBoost to include publications from Crossref and the works from ORCID with a DOI until November 2022
- Added ORCID works without a DOI from November 2022
2022-11-11 18:15:55 +01:00