Claudio Atzori
62ff843334
adopting dhp-schemas:8.0.1 to support Auhtor's rawAffiliationString(s). Improved graph2hive implementation
2024-10-08 16:22:54 +02:00
Claudio Atzori
d5867a1992
merged #490
2024-10-08 15:39:59 +02:00
Miriam Baglioni
7e6d12fa77
[UsageCount] fixed error
...
(cherry picked from commit 9c9a9562ae
)
2024-10-01 15:55:07 +02:00
Miriam Baglioni
191fc3a461
[UsageCount] add check in case the datasource is not matched against those present in the graph
...
(cherry picked from commit b42bdd5fb3
)
2024-10-01 15:54:31 +02:00
Claudio Atzori
10696f2a44
reverted procedure for creating the UsageCounts actionset
2024-10-01 15:54:13 +02:00
Miriam Baglioni
e430826e00
[ImportOC] fix to move original folder instead of extracted ones
2024-09-30 15:10:10 +02:00
Miriam Baglioni
599e56dbc6
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
2024-09-25 17:28:23 +02:00
Claudio Atzori
6397141e56
code formatting
2024-09-25 15:27:32 +02:00
Claudio Atzori
e354f9853a
[OpenCitations] move the extracted contents under a backup path to avoid needing to re-download it in case of errors
2024-09-25 15:27:02 +02:00
Sandro La Bruzzo
6a097abc89
as described on ticket #9525
...
1. Changed the mapping applied to Crossref records: anything that has a relationship "is-review-of" must be mapped as publication of type "Review".
2. Force the hostedby of Crossref records with DOI prefix 10.3410 and 10.12703 to the H1 Connect data source.
2024-09-25 11:32:54 +02:00
Michele Artini
fa2532db30
fixed a bug with id
2024-09-25 09:38:50 +02:00
Michele Artini
b35d046fd2
fixed a bug with 'null' string
2024-09-24 15:18:54 +02:00
Miriam Baglioni
4d3e079590
Merge remote-tracking branch 'origin/beta' into beta
2024-09-24 14:26:29 +02:00
Michele Artini
2d7a7a962d
unit test @Disabled
2024-09-23 10:19:36 +02:00
Michele Artini
6b0f7cc8b0
skip urls with authentication
2024-09-23 10:16:53 +02:00
Michele Artini
339d8124f2
osf plugin: links to contributors and primaty_file
2024-09-20 08:44:05 +02:00
Michele Artini
52bb7af03b
use of dom4j
2024-09-19 14:59:05 +02:00
Michele Artini
9073b1159d
partial implementation of osfPreprints plugin + tests
2024-09-19 13:58:53 +02:00
Michele Artini
dcf09811a2
partial implementation of osfPreprints plugin
2024-09-19 12:42:45 +02:00
Michele Artini
a2fac78dcc
fixed a problem in incremental harvesting
2024-09-17 10:16:28 +02:00
Michele Artini
99b7adda0c
gtr2 unit test
2024-09-16 15:13:44 +02:00
Michele Artini
bb9cee4f40
implementation of gtr2Publications plugin
2024-09-16 14:16:56 +02:00
Miriam Baglioni
468f2aa5a5
[AffiliationAffRo]align beta with new affiliation from publisher webpage introduced in production. AffRo collectedfrom OpenAIRE to discriminate against WebCrawl
2024-08-12 18:10:46 +02:00
Miriam Baglioni
89fcf4086c
[Person]fix issue in affiliation relation id construction for person (missing ::)
2024-08-12 18:04:43 +02:00
Claudio Atzori
8e7ef79ce0
[bip affiliations] considers only DOI based records
2024-08-05 12:13:48 +02:00
Claudio Atzori
64740475d0
depending on dhp-schemas:7.0.1
2024-07-29 11:51:42 +02:00
Miriam Baglioni
1af6571474
merging with branch beta
2024-07-25 15:48:05 +02:00
Miriam Baglioni
c7f6669f1a
[webcrawl] the blacklist is now in json and no more in csv after the normalization process
2024-07-25 15:20:18 +02:00
Miriam Baglioni
7cff281d3e
[webcrawl] the blacklist is now in json and no more in csv after the normalization process
2024-07-25 15:16:42 +02:00
Miriam Baglioni
fc60661ac5
[webcrawl] added code and test (code/resource) to verify the deletion of the relations related to results put in blacklist
2024-07-25 12:25:14 +02:00
Miriam Baglioni
6f1801d7d1
[webcrawl]-
2024-07-23 17:34:48 +02:00
Miriam Baglioni
19806c2ae3
[SDG]fixed switch of methods
2024-07-23 17:12:55 +02:00
Miriam Baglioni
9573bf576d
[SDG]added code to ingest also the SDG without DOI
2024-07-23 12:47:57 +02:00
Miriam Baglioni
79985ad197
[Crossref]added mapping for DFG versus the unidentified project [ https://support.openaire.eu/issues/9926?next_issue_id=9924&prev_issue_id=9927#note-4 ]
2024-07-17 18:30:24 +02:00
Claudio Atzori
06e3985b77
merged from beta
2024-07-17 12:01:40 +02:00
Claudio Atzori
83327239de
fixed pom definitions, bumped dependency version for the dhp-schema module, removed unnecessary dependencies
2024-07-17 11:58:48 +02:00
Claudio Atzori
e39e8bbd47
Merge pull request '[WebCrawlAffiliation]remove from the creation of the action set the relations for pmc and pmid. Only doi are allowed' ( #462 ) from affiliationFromWebCrawlOnlyDOI into beta
...
Reviewed-on: D-Net/dnet-hadoop#462
2024-07-17 11:12:32 +02:00
Claudio Atzori
a65241fcaf
Merge pull request 'implementation of the new collector plugin: research_fi' ( #456 ) from research_fi_collector_plugin into beta
...
Reviewed-on: D-Net/dnet-hadoop#456
2024-07-17 10:25:38 +02:00
Claudio Atzori
c99f92efaa
Merge pull request '[beta] OpenAIRE Affiliation Inference' ( #452 ) from affRoFromRawString into beta
...
Reviewed-on: D-Net/dnet-hadoop#452
2024-07-17 10:24:39 +02:00
Miriam Baglioni
d96215cb9b
[UnpayWall]added othe : in the identifier construction
2024-07-16 18:17:32 +02:00
Miriam Baglioni
9246bdec1c
[WebCrawlAffiliation]remove from the creation of the action set the relations for pmc and pmid. Only doi are allowed
2024-07-16 14:07:37 +02:00
Claudio Atzori
61d1fa9b9f
[metadata collection] added -Dcom.sun.security.enableAIAcaIssuers=true as a default for metadata collection
2024-07-12 10:26:45 +02:00
Claudio Atzori
f9ed2ae33c
[metadata collection] added the possibility to specify the JAVA_HOME and the JAVA_OPTS parameters
2024-07-11 15:32:36 +02:00
Michele Artini
bbe52584f7
log message
2024-07-11 15:14:34 +02:00
Michele Artini
5cdba9172b
implementeation of the new collector plugin: research_fi
2024-07-10 14:53:13 +02:00
Miriam Baglioni
c465835061
[Person]new implementation for the extraction of the coAuthorship relations
2024-07-09 12:29:55 +02:00
Miriam Baglioni
814e650e12
[Irish Tender]changed the irish.json file according to comments #26 , #29 , and #34 for 9635
2024-07-04 12:24:28 +02:00
Miriam Baglioni
ddd20e7f8e
[Person]first implementation of the action set to include Person entity in the graph starting from the orcid data
2024-07-04 12:08:46 +02:00
Miriam Baglioni
9cbe966b4a
[AffiliationIngestion]refactoring
2024-06-29 18:35:49 +02:00
Miriam Baglioni
236b64d830
[AffiliationIngestion]Extended the ingestion of affiliation from open aire to include also links derived from Web Crawl. Extended the test. Inserted in Constatns the id and name of the webcrawl datasource to be used here and also in the ingestion of links from web crawl
2024-06-29 18:29:20 +02:00