Claudio Atzori
|
8f309b72ff
|
[dedup] using node names consistently across the workflow
|
2021-04-21 17:54:51 +02:00 |
Claudio Atzori
|
52244f813a
|
merging from enrico.ottonello/dnet-hadoop:orcid-no-doi
|
2021-04-21 12:24:09 +02:00 |
Sandro La Bruzzo
|
fd29307b84
|
updated workflow name
|
2021-04-21 09:21:41 +02:00 |
Claudio Atzori
|
815b9f4d56
|
[openorgs dedup] fixed workflow parameter declarations. Introduced support for resuming the execution from intermediate steps
|
2021-04-20 17:24:45 +02:00 |
Claudio Atzori
|
d0d477cca3
|
code formatting
|
2021-04-20 12:50:34 +02:00 |
miconis
|
0393cdce42
|
addition of alternative names in export queries
|
2021-04-20 12:45:21 +02:00 |
miconis
|
cadd0a5de8
|
modification of the queries for openorgs: they now consider also pending orgs
|
2021-04-20 12:06:56 +02:00 |
Sandro La Bruzzo
|
e06c7f32f6
|
updated id figshare as described in #6377
|
2021-04-20 10:18:07 +02:00 |
Sandro La Bruzzo
|
dbe0d0378e
|
resolved ticket #6377
|
2021-04-20 09:44:44 +02:00 |
Antonis Lempesis
|
625d993cd9
|
added step for observatory db
|
2021-04-20 02:31:06 +03:00 |
Antonis Lempesis
|
25d0512fbd
|
code cleanup
|
2021-04-20 01:43:23 +03:00 |
Sandro La Bruzzo
|
524e5f3092
|
Improved parallelization on transformation wf on hadoop
|
2021-04-19 15:17:25 +02:00 |
Sandro La Bruzzo
|
cdfe01bbae
|
improved parallelization on transformation job
|
2021-04-19 15:14:52 +02:00 |
Sandro La Bruzzo
|
3ae67b7a1d
|
Merge remote-tracking branch 'origin/stable_ids' into stable_ids
|
2021-04-16 17:36:57 +02:00 |
Sandro La Bruzzo
|
a16e5299f9
|
applied unique function on the final dataset
|
2021-04-16 17:36:48 +02:00 |
Claudio Atzori
|
45057440c1
|
code formatting
|
2021-04-16 17:28:25 +02:00 |
Enrico Ottonello
|
34ca792a55
|
Merge branch 'master' of https://code-repo.d4science.org/D-Net/dnet-hadoop into orcid-no-doi
|
2021-04-16 17:18:46 +02:00 |
Enrico Ottonello
|
27068aacd1
|
wf to move orcid-no-doi dataset on the folder ready the import
|
2021-04-16 17:17:47 +02:00 |
miconis
|
7ad573d023
|
bug fix: changed join in propagaterelations without applying filter on the id
|
2021-04-16 16:40:42 +02:00 |
Sandro La Bruzzo
|
67085da305
|
fixed NPE
|
2021-04-16 11:05:58 +02:00 |
Sandro La Bruzzo
|
644aa8f40c
|
Merge remote-tracking branch 'origin/stable_ids' into stable_ids
|
2021-04-16 09:14:26 +02:00 |
Sandro La Bruzzo
|
7d6a80e2f2
|
added new type on MAG mapping
|
2021-04-16 09:14:15 +02:00 |
Claudio Atzori
|
8704d32780
|
code formatting
|
2021-04-15 16:52:58 +02:00 |
Claudio Atzori
|
ba4b4c74d8
|
do not make the identifier prefix depend on the Handle
|
2021-04-15 16:48:26 +02:00 |
Claudio Atzori
|
906d50563c
|
Merge pull request 'properly invalidating impala metadata' (#105) from antonis.lempesis/dnet-hadoop:master into master
Reviewed-on: #105
|
2021-04-15 15:06:22 +02:00 |
Claudio Atzori
|
3d58f95522
|
[stats update] properly invalidating impala metadata
|
2021-04-15 15:03:05 +02:00 |
Antonis Lempesis
|
03d36fadea
|
properly invalidating impala metadata
|
2021-04-15 13:34:22 +03:00 |
miconis
|
f64e57c112
|
refactoring of the id generation, sparkcreatemergerels collects entities to create root id after a join
|
2021-04-15 10:59:24 +02:00 |
miconis
|
176a5e493d
|
Merge branch 'stable_ids' of code-repo.d4science.org:D-Net/dnet-hadoop into stable_ids
|
2021-04-14 18:06:34 +02:00 |
miconis
|
3525a8f504
|
id generation of representative record moved to the SparkCreateMergeRel job
|
2021-04-14 18:06:07 +02:00 |
Claudio Atzori
|
745fa92db8
|
Merge branch 'stable_ids' of https://code-repo.d4science.org/D-Net/dnet-hadoop into stable_ids
|
2021-04-14 10:14:00 +02:00 |
Claudio Atzori
|
083c2959dc
|
cleanup
|
2021-04-14 10:13:53 +02:00 |
Sandro La Bruzzo
|
3f77bfceb0
|
fixed test failure on jenkins
|
2021-04-14 10:03:01 +02:00 |
Claudio Atzori
|
3125cef545
|
code formatting
|
2021-04-14 09:11:54 +02:00 |
Sandro La Bruzzo
|
44a0064df6
|
Merge remote-tracking branch 'origin/stable_ids' into stable_ids
|
2021-04-13 17:48:12 +02:00 |
Sandro La Bruzzo
|
479abd10cb
|
Add into ORCID workflow a method that extracts orcid directly to the dump generated by Enrico
|
2021-04-13 17:47:43 +02:00 |
Claudio Atzori
|
710cd1e8f2
|
Merge pull request 'add xslt, personname cleaner' (#104) from andreas.czerniak/BrStableId_dnet-hadoop:stable_ids into stable_ids
Reviewed-on: #104
LGTM
|
2021-04-13 14:43:05 +02:00 |
Claudio Atzori
|
d1ca025b0b
|
[cleaning] remiving authors without fullname or providing 'deactivated' keyword. Removing test test titles
|
2021-04-13 14:32:41 +02:00 |
miconis
|
1542196a33
|
bug fix: starting node of duplicate scan wf changed
|
2021-04-13 10:15:43 +02:00 |
miconis
|
369ed1cd8a
|
bug fix: lookupurl parameter added to dedup record job
|
2021-04-13 09:08:05 +02:00 |
Andreas Czerniak
|
52fbece3b3
|
Merge branch 'stable_ids' of https://code-repo.d4science.org/andreas.czerniak/BrStableId_dnet-hadoop into stable_ids
|
2021-04-13 07:05:09 +02:00 |
Andreas Czerniak
|
d7614c1f85
|
introduce new const
|
2021-04-13 07:04:27 +02:00 |
Andreas Czerniak
|
3b694074ff
|
add xslt, personname cleaner
|
2021-04-13 07:04:27 +02:00 |
Claudio Atzori
|
511c0521e5
|
[dedup] avoiding NPEs handling OpenOrg relations
|
2021-04-12 17:45:11 +02:00 |
Claudio Atzori
|
72dcadd8e6
|
Merge branch 'stable_ids' of https://code-repo.d4science.org/D-Net/dnet-hadoop into stable_ids
|
2021-04-12 17:32:09 +02:00 |
Claudio Atzori
|
902d05f548
|
[cleaning] avoiding NPEs handling null author PIDs
|
2021-04-12 17:31:40 +02:00 |
miconis
|
d442e25cbc
|
bug fix: ids in self mergerels are not marked deletedbyinference=true
|
2021-04-12 15:56:22 +02:00 |
miconis
|
dcff9cecdf
|
bug fix: ids in self mergerels are not marked deletedbyinference=true
|
2021-04-12 15:55:27 +02:00 |
Andreas Czerniak
|
34df35926c
|
add xslt, personname cleaner
|
2021-04-09 14:35:36 +02:00 |
miconis
|
11b22b2d23
|
bug fix in the query, it now exports only relations with non-hidden organizations
|
2021-04-08 11:51:47 +02:00 |