Claudio Atzori
|
6dddad86ee
|
[cleaning] title cleaning based on the me.xuender:unidecode library
|
2021-07-28 16:21:29 +02:00 |
Claudio Atzori
|
2fff24df55
|
code formatting
|
2021-07-28 11:34:19 +02:00 |
Miriam Baglioni
|
80d5b3b4de
|
DoiBoost AccessRigh #4362 - removing commented code
|
2021-07-28 11:16:49 +02:00 |
Miriam Baglioni
|
5fe016dcbc
|
DoiBoost AccessRigh #4362 - related to https://code-repo.d4science.org/D-Net/dnet-hadoop/pulls/126/files#issuecomment-4194
|
2021-07-28 11:14:28 +02:00 |
Miriam Baglioni
|
73ed7374a9
|
mergin with branch beta
|
2021-07-28 11:05:16 +02:00 |
Miriam Baglioni
|
43e62fcae9
|
DoiBoost AccessRigh #4362 - related to https://code-repo.d4science.org/D-Net/dnet-hadoop/pulls/126/files#issuecomment-4193
|
2021-07-28 11:04:55 +02:00 |
Sandro La Bruzzo
|
16c91203bd
|
implemented workflow of creation action set for scholexplorer
|
2021-07-28 10:30:49 +02:00 |
Miriam Baglioni
|
6c936943aa
|
mergin with branch beta
|
2021-07-28 10:24:48 +02:00 |
Sandro La Bruzzo
|
825d9f0289
|
fixed datacite workflow starting from Importing delta
|
2021-07-27 16:09:46 +02:00 |
Claudio Atzori
|
5aa7d16d1b
|
updated assertions in eu.dnetlib.dhp.oa.graph.raw.MappersTest
|
2021-07-27 15:11:58 +02:00 |
Sandro La Bruzzo
|
848aabbb6c
|
minor fix
|
2021-07-25 12:06:41 +02:00 |
Sandro La Bruzzo
|
8fac10c91e
|
fixed defintion wf of creation final infospace of scholexplorer
|
2021-07-25 11:15:37 +02:00 |
Sandro La Bruzzo
|
3920c69bc8
|
change implementation of resolve Relation to generate jsonRdd in output
|
2021-07-25 09:51:36 +02:00 |
Claudio Atzori
|
a0393607a7
|
mapping funding relations from Datacite should be done according to the actual result identifier
|
2021-07-23 18:15:08 +02:00 |
Sandro La Bruzzo
|
d9e3b89937
|
implemented last part of workflows to generate scholixGraph
|
2021-07-23 16:38:32 +02:00 |
Sandro La Bruzzo
|
cfde63a7c3
|
fixed resolve relation join
|
2021-07-23 14:17:29 +02:00 |
Sandro La Bruzzo
|
4a439c3863
|
NPE fixed
|
2021-07-23 14:17:29 +02:00 |
Sandro La Bruzzo
|
ca74e8dd02
|
create a separate wf for resolving relation
|
2021-07-23 11:40:06 +02:00 |
Sandro La Bruzzo
|
43e9380cd3
|
update resolve relation to use the same format of openaire graph
|
2021-07-23 11:25:18 +02:00 |
Sandro La Bruzzo
|
058b636d4d
|
added control to check if the entity exists
|
2021-07-22 16:08:54 +02:00 |
Sandro La Bruzzo
|
62ae36a3d2
|
fixed NPE
|
2021-07-22 15:41:38 +02:00 |
Miriam Baglioni
|
1a5b114906
|
DoiBoost AccessRigh #4362 - refactoring
|
2021-07-22 12:00:23 +02:00 |
Sandro La Bruzzo
|
31d2d6d41e
|
Scholexplorer: introduction of dedup openaire
|
2021-07-21 18:09:32 +02:00 |
Miriam Baglioni
|
b226ba4439
|
mergin with branch beta
|
2021-07-21 09:46:40 +02:00 |
Claudio Atzori
|
10d7b4f0b4
|
filtering 'old' OpenAIRE ids from the entity.originalId[] array in the OAF -> XML searialization procedure
|
2021-07-20 11:52:05 +02:00 |
Miriam Baglioni
|
83fe31c92e
|
changed the name of the workflows
|
2021-07-19 18:19:14 +02:00 |
Miriam Baglioni
|
dd81c36b60
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2021-07-19 18:18:14 +02:00 |
Miriam Baglioni
|
54acc5373b
|
changed the name of the workflows
|
2021-07-19 18:18:09 +02:00 |
Miriam Baglioni
|
b420b11ed3
|
duplicate the number of partitions in ProcessMag
|
2021-07-19 18:16:23 +02:00 |
Claudio Atzori
|
65934888a1
|
adding record identifier among the originalIds regardless of what IdentifierFactory produces
|
2021-07-19 17:52:52 +02:00 |
Claudio Atzori
|
0977baf41d
|
contents mapped from the stores with 'claim' interpretation will not change their identifier along their way towards the graph
|
2021-07-19 17:43:52 +02:00 |
Miriam Baglioni
|
662c396354
|
duplicate the number of partitions in ConvertCrossrefToOaf
|
2021-07-19 12:41:14 +02:00 |
Miriam Baglioni
|
59530a14fb
|
DoiBoost AccessRigh #4362 - set BestAccessRight with the ususal comparator
|
2021-07-19 12:34:35 +02:00 |
Miriam Baglioni
|
199123b74b
|
DoiBoost AccessRigh #4362 - Fixed issue on date formatting. Added test method and associated resource
|
2021-07-16 17:30:27 +02:00 |
Miriam Baglioni
|
3bc9a05bc9
|
mergin with branch beta
|
2021-07-16 10:32:27 +02:00 |
Miriam Baglioni
|
34506df1b6
|
DoiBoost AccessRigh #4362 - if the journal is open, the OPEN access right is set to all instances and color is GOLD (overwrite if the color was already set in one of the previous steps)
|
2021-07-16 10:29:51 +02:00 |
Claudio Atzori
|
bf9e0d2d4f
|
Merge pull request 'orcid-no-doi' (#123) from enrico.ottonello/dnet-hadoop:orcid-no-doi into beta
Reviewed-on: D-Net/dnet-hadoop#123
|
2021-07-15 17:59:41 +02:00 |
Sandro La Bruzzo
|
7e2caafe84
|
Scholexplorer: fixed mapping typologies
|
2021-07-15 09:53:12 +02:00 |
Miriam Baglioni
|
4da46bb62f
|
mergin with branch beta
|
2021-07-14 15:08:52 +02:00 |
Miriam Baglioni
|
09ad7b2a9e
|
DoiBoost AccessRigh #4362 - Unpaywall mapped to OAF with OPEN instance (non oa are filtered out) (unknown hostedby) + map the color as it is
|
2021-07-14 14:45:21 +02:00 |
Miriam Baglioni
|
f4f7c6f9d3
|
DoiBoost AccessRigh #4362 - Unpaywall mapped to OAF with OPEN instance (non oa are filtered out) (unknown hostedby) + map the color as it is
|
2021-07-14 14:44:54 +02:00 |
Miriam Baglioni
|
6222adf176
|
DoiBoost AccessRigh #4362 - added resources and test for crossref mapping (licence part included)
|
2021-07-14 14:42:34 +02:00 |
Miriam Baglioni
|
981b1018f6
|
DoiBoost AccessRigh #4362 - decide access right according to licence. Default access right is Unknown
|
2021-07-14 14:42:06 +02:00 |
Sandro La Bruzzo
|
3d8e2aa146
|
Code refactor:
- removed old workflows in doiboost
- splitted workflow of doiboost in preprocess and process
|
2021-07-14 14:37:06 +02:00 |
Miriam Baglioni
|
441701c85c
|
DoiBoost AccessRigh #4362 - If multiple licenses are available, take the one applied to 'vor'
|
2021-07-14 14:14:50 +02:00 |
Sandro La Bruzzo
|
c35c117601
|
fixed process doiboost workflow:
- splitted OrcidToOAF into two phase preprocess and process
- updated workflow used in production
|
2021-07-14 12:48:01 +02:00 |
Sandro La Bruzzo
|
bbe8193930
|
merged stable ids
|
2021-07-12 17:00:43 +02:00 |
Claudio Atzori
|
ae2b47b29d
|
[broker] added coalesce(1) on the stats dataset before storing it on postgres
|
2021-07-09 15:47:51 +02:00 |
Sandro La Bruzzo
|
57c74c73c6
|
fixed mistakes in oozie workflow
|
2021-07-09 12:28:09 +02:00 |
Sandro La Bruzzo
|
61ccb54fde
|
removed wrong loop on oozie wf
|
2021-07-09 12:17:57 +02:00 |