From f7e9e93209b2d435edfeba6504d2d515c1d9db7a Mon Sep 17 00:00:00 2001 From: mkallipo <95910739+mkallipo@users.noreply.github.com> Date: Fri, 26 Apr 2024 11:13:04 +0200 Subject: [PATCH] affiliation matching description update --- .../enrichment-by-mining/affiliation_matching.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/docs/graph-production-workflow/enrichment-by-mining/affiliation_matching.md b/docs/graph-production-workflow/enrichment-by-mining/affiliation_matching.md index fadb3d7..a23b711 100644 --- a/docs/graph-production-workflow/enrichment-by-mining/affiliation_matching.md +++ b/docs/graph-production-workflow/enrichment-by-mining/affiliation_matching.md @@ -85,13 +85,13 @@ If multiple matches are found above the desired similarity thresholds, the algor * input * source of affiliations: JSON Crossref or XML Pubmed or Parquet DataCite files. - * organizations: [dix_acad.pkl](https://github.com/mkallipo/affiliation-matching/blob/main/dictionaries/dix_acad.pkl), [dix_mult](https://github.com/mkallipo/affiliation-matching/blob/main/dictionaries/dix_mult.pkl, [dix_city](https://github.com/mkallipo/affiliation-matching/blob/main/dictionaries/dix_city.pkl), [dix_country](https://github.com/mkallipo/affiliation-matching/blob/main/dictionaries/dix_country.pkl) (four pickled dictionaries with keys legalnames and alternativenames of organizations in the ROR database.) + * organizations: [dix_acad.pkl](https://github.com/openaire/affro/blob/main/dictionaries/dix_acad.pkl), [dix_mult](https://github.com/openaire/affro/blob/main/dictionaries/dix_mult.pkl), [dix_city](https://github.com/openaire/affro/blob/main/dictionaries/dix_city.pkl), [dix_country](https://github.com/openaire/affro/blob/main/dictionaries/dix_country.pkl) (four pickled dictionaries with keys legalnames and alternativenames of organizations in the ROR database.) * similarity thresholds: simU for universities, simG for other organizations (default values are simU = 0.64, simG = 0.87). cument-organization pairs which are used as a hint for matching affiliations * output - * [MatchedOrganization](https://github.com/openaire/iis/blob/master/iis-wf/iis-wf-affmatching/src/main/resources/eu/dnetlib/iis/wf/affmatching/model/MatchedOrganization.avdl) avro datastore location with matched publications with organizations. + * JSON file with ROR ids of organizations and corresponding similarity scores for each DOI. ***Limitations:*** -