Claudio Atzori
67e37f41fb
Merge pull request 'blacklist filtering moved before the cleanup phase in order to have case sensitive regex' ( #485 ) from dedup_blacklist_fix into beta
...
Reviewed-on: D-Net/dnet-hadoop#485
2024-10-28 09:42:51 +01:00
Claudio Atzori
46dbb62598
Merge pull request ' #9839 : include claimed affiliation relationships' ( #476 ) from claim-orgs into beta
...
Reviewed-on: D-Net/dnet-hadoop#476
2024-10-25 10:12:59 +02:00
Claudio Atzori
d3764265d5
Merge pull request '[dedup] avoid NPEs in the countryInference dedup utility' ( #475 ) from dedup_countryInference_NPE into beta
...
Reviewed-on: D-Net/dnet-hadoop#475
2024-10-25 10:12:06 +02:00
Claudio Atzori
4a9aeb6238
Merge pull request '9126-impact-indicators-wf-optimisation' ( #471 ) from 9126-impact-indicators-wf-optimisation into beta
...
Reviewed-on: D-Net/dnet-hadoop#471
2024-10-25 10:10:44 +02:00
Claudio Atzori
8172bee8c8
Merge pull request 'Minor fixes' ( #496 ) from beta_fixes_oct into beta
...
Reviewed-on: D-Net/dnet-hadoop#496
2024-10-25 10:09:56 +02:00
Miriam Baglioni
e75326d6ec
[FundersMatchFromCrossref] added match from CrossRef to DFG unidentified project
2024-10-25 09:13:54 +02:00
Giambattista Bloisi
6bc741715c
Fix OafMapperUtilsTest.testMergePubs
2024-10-23 14:02:45 +02:00
Giambattista Bloisi
aa7b8fd014
Use workingDir parameter for temporary data of ORCID enrichment
2024-10-23 14:02:17 +02:00
Giambattista Bloisi
0e34b0ece1
Fix imports: point them from the main distribution packages
2024-10-23 14:01:52 +02:00
Giambattista Bloisi
56b05cde0b
Revert the changes for IgnoreUndefined management in tree evaluation
2024-10-11 10:35:15 +02:00
Claudio Atzori
62ff843334
adopting dhp-schemas:8.0.1 to support Auhtor's rawAffiliationString(s). Improved graph2hive implementation
2024-10-08 16:22:54 +02:00
Claudio Atzori
d5867a1992
merged #490
2024-10-08 15:39:59 +02:00
Claudio Atzori
e5df68772d
[graph provision] fixed serialisation of the usage counts as measures in the XML records
2024-10-02 09:35:21 +02:00
Miriam Baglioni
7e6d12fa77
[UsageCount] fixed error
...
(cherry picked from commit 9c9a9562ae
)
2024-10-01 15:55:07 +02:00
Miriam Baglioni
191fc3a461
[UsageCount] add check in case the datasource is not matched against those present in the graph
...
(cherry picked from commit b42bdd5fb3
)
2024-10-01 15:54:31 +02:00
Claudio Atzori
10696f2a44
reverted procedure for creating the UsageCounts actionset
2024-10-01 15:54:13 +02:00
Claudio Atzori
5734b80861
Merge pull request 'datasource table creation split in steps' ( #489 ) from antonis.lempesis/dnet-hadoop:beta into beta
...
Reviewed-on: D-Net/dnet-hadoop#489
2024-09-30 16:34:38 +02:00
Antonis Lempesis
f3c179658a
datasource table creation split in steps
2024-09-30 17:12:21 +03:00
Miriam Baglioni
b18ad035c1
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
2024-09-30 15:10:44 +02:00
Miriam Baglioni
e430826e00
[ImportOC] fix to move original folder instead of extracted ones
2024-09-30 15:10:10 +02:00
Giambattista Bloisi
c45cae447a
Fix: invert the "natural" order when ordering by id lexicographically
2024-09-26 17:08:02 +02:00
Claudio Atzori
3fcafc7ed6
Merge pull request 'Latest institutions in monitor dbs' ( #472 ) from antonis.lempesis/dnet-hadoop:beta into beta
...
Reviewed-on: D-Net/dnet-hadoop#472
2024-09-26 09:49:01 +02:00
Miriam Baglioni
599e56dbc6
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
2024-09-25 17:28:23 +02:00
Claudio Atzori
6397141e56
code formatting
2024-09-25 15:27:32 +02:00
Claudio Atzori
e354f9853a
[OpenCitations] move the extracted contents under a backup path to avoid needing to re-download it in case of errors
2024-09-25 15:27:02 +02:00
Claudio Atzori
535a7b99f1
the metadata collection plugins using the HttpConnector2 class shall now retry instead of failing in case of UnknownHostException
2024-09-25 11:35:34 +02:00
Sandro La Bruzzo
6a097abc89
as described on ticket #9525
...
1. Changed the mapping applied to Crossref records: anything that has a relationship "is-review-of" must be mapped as publication of type "Review".
2. Force the hostedby of Crossref records with DOI prefix 10.3410 and 10.12703 to the H1 Connect data source.
2024-09-25 11:32:54 +02:00
Michele Artini
9754521847
Merge pull request 'fixed a bug with id' ( #486 ) from osfPreprints_plugin into beta
...
Reviewed-on: D-Net/dnet-hadoop#486
2024-09-25 10:02:24 +02:00
Michele Artini
fa2532db30
fixed a bug with id
2024-09-25 09:38:50 +02:00
Michele Artini
54f8b4da39
Merge pull request 'fixed a bug with 'null' string' ( #484 ) from osfPreprints_plugin into beta
...
Reviewed-on: D-Net/dnet-hadoop#484
2024-09-24 15:19:54 +02:00
Michele Artini
b35d046fd2
fixed a bug with 'null' string
2024-09-24 15:18:54 +02:00
Miriam Baglioni
4d3e079590
Merge remote-tracking branch 'origin/beta' into beta
2024-09-24 14:26:29 +02:00
Michele Artini
e941adbe2b
fixed a bug with topic ENRICH/MORE/SUBJECT/ARXIV
2024-09-24 08:57:37 +02:00
Michele Artini
fdbe629f49
removed the deletedByInference=true filter
2024-09-23 15:27:28 +02:00
Antonis Lempesis
619aa34a15
Merge branch 'beta' of https://code-repo.d4science.org/antonis.lempesis/dnet-hadoop into beta
2024-09-23 15:25:59 +03:00
Antonis Lempesis
dbea7a4072
removed duplicate line
2024-09-23 14:57:11 +03:00
Antonis Lempesis
c9241dba0d
Merge pull request 'convert_hive_to_spark_actions' ( #1 ) from convert_hive_to_spark_actions into beta
...
Reviewed-on: antonis.lempesis/dnet-hadoop#1
2024-09-23 13:53:28 +02:00
Michele Artini
755a5aefcf
Merge pull request 'osfPreprints_plugin' ( #482 ) from osfPreprints_plugin into beta
...
Reviewed-on: D-Net/dnet-hadoop#482
2024-09-23 10:21:34 +02:00
Michele Artini
2d7a7a962d
unit test @Disabled
2024-09-23 10:19:36 +02:00
Michele Artini
6b0f7cc8b0
skip urls with authentication
2024-09-23 10:16:53 +02:00
Michele Artini
db6f137cf9
Merge pull request 'osfPreprints_plugin' ( #480 ) from osfPreprints_plugin into beta
...
Reviewed-on: D-Net/dnet-hadoop#480
2024-09-20 09:56:50 +02:00
Michele Artini
339d8124f2
osf plugin: links to contributors and primaty_file
2024-09-20 08:44:05 +02:00
Michele Artini
52bb7af03b
use of dom4j
2024-09-19 14:59:05 +02:00
Michele Artini
9073b1159d
partial implementation of osfPreprints plugin + tests
2024-09-19 13:58:53 +02:00
Michele Artini
dcf09811a2
partial implementation of osfPreprints plugin
2024-09-19 12:42:45 +02:00
Claudio Atzori
bfd05cdab2
run mergeResultsOfDifferentTypes only when checkDelegatedAuthority is true
2024-09-17 10:49:32 +02:00
Michele Artini
714a16854e
Merge pull request 'gtr2Publications_plugin' ( #477 ) from gtr2Publications_plugin into beta
...
Reviewed-on: D-Net/dnet-hadoop#477
2024-09-17 10:23:39 +02:00
Michele Artini
a2fac78dcc
fixed a problem in incremental harvesting
2024-09-17 10:16:28 +02:00
Michele Artini
99b7adda0c
gtr2 unit test
2024-09-16 15:13:44 +02:00
Michele Artini
bb9cee4f40
implementation of gtr2Publications plugin
2024-09-16 14:16:56 +02:00