From 1c668b7fd81073f4b52e27748d117a56fcc79161 Mon Sep 17 00:00:00 2001 From: LSmyrnaios Date: Mon, 3 Jul 2023 18:07:01 +0300 Subject: [PATCH] Eliminate the "ambiguous unicode character" warning on Gitea. --- .../enrichment-by-mining/enrichment-by-mining.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/graph-production-workflow/enrichment-by-mining/enrichment-by-mining.md b/docs/graph-production-workflow/enrichment-by-mining/enrichment-by-mining.md index b30718d..1561699 100644 --- a/docs/graph-production-workflow/enrichment-by-mining/enrichment-by-mining.md +++ b/docs/graph-production-workflow/enrichment-by-mining/enrichment-by-mining.md @@ -5,7 +5,7 @@ import DocCardList from '@theme/DocCardList'; **OpenAIRE** collects the full-texts of the publications, in order to apply TDM (Text and Data Mining) algorithms on them and enrich the Graph with inference links. -The collection of the full-texts is handled by the internal **PDF Aggregation Service**. This service uses the publications’ urls, from the OpenAIRE Graph and state-of-the-art algorithms, to crawl the web and try to locate and download the full-texts of the open access publications, while focusing on the most recent ones. It respects the servers of the repositories and publishers and avoids overloading them. +The collection of the full-texts is handled by the internal **PDF Aggregation Service**. This service uses the publications' urls, from the OpenAIRE Graph and state-of-the-art algorithms, to crawl the web and try to locate and download the full-texts of the open access publications, while focusing on the most recent ones. It respects the servers of the repositories and publishers and avoids overloading them. The service is orchestrating a distributed execution system, on the cloud, with multiple microservices running in parallel, in order to efficiently process and download a large number of publications. The microservices store the generated report records for the publications, in a database, and the full-texts in an S3 Object Store.