Miriam Baglioni
|
9bc4fd3b69
|
Patch FCT relations - fixed issue with join
|
2021-07-30 10:34:05 +02:00 |
Miriam Baglioni
|
2fc89fc9b5
|
Merge branch 'fct_project_id_replacement' of https://code-repo.d4science.org/D-Net/dnet-hadoop into fct_project_id_replacement
|
2021-07-30 10:20:43 +02:00 |
Claudio Atzori
|
081fe92a21
|
Merge branch 'fct_project_id_replacement' of https://code-repo.d4science.org/D-Net/dnet-hadoop into fct_project_id_replacement
|
2021-07-30 10:13:56 +02:00 |
Claudio Atzori
|
576693d782
|
added unit test for PatchRelationsApplication
|
2021-07-30 10:13:33 +02:00 |
Claudio Atzori
|
6e3554a45e
|
[provision] lowercase relation filter
|
2021-07-29 13:56:37 +02:00 |
Claudio Atzori
|
e725c88ebb
|
[raw_all] patching relation identifier phase to be run at the end, i.e. includes also claimed relations
|
2021-07-29 13:03:43 +02:00 |
Claudio Atzori
|
f83dd70e1c
|
Merge pull request 'Patch the identifiers (source/target) in the relations' (#125) from fct_project_id_replacement into master
Reviewed-on: D-Net/dnet-hadoop#125
|
2021-07-29 12:11:27 +02:00 |
Claudio Atzori
|
5f7330d407
|
Merge branch 'master' into fct_project_id_replacement
|
2021-07-29 11:38:22 +02:00 |
Claudio Atzori
|
1923c1ce21
|
replaced full join + filtering with a left join
|
2021-07-29 11:36:20 +02:00 |
Claudio Atzori
|
a9961a1835
|
[cleaning] title cleaning based on the me.xuender:unidecode library
|
2021-07-28 16:36:33 +02:00 |
Alessia Bardi
|
9594343725
|
code formatting after mvn compile
|
2021-07-28 11:41:34 +02:00 |
Claudio Atzori
|
d267dce520
|
[raw_all] added extra workflow step for patching the identifiers in the relations, given an id mapping dataset
|
2021-07-27 17:18:29 +02:00 |
Claudio Atzori
|
998b66855a
|
updated assertions in eu.dnetlib.dhp.oa.graph.raw.MappersTest
|
2021-07-27 15:11:37 +02:00 |
Claudio Atzori
|
5b6844b969
|
mapping funding relations from Datacite should be done according to the actual result identifier
|
2021-07-23 18:14:37 +02:00 |
Claudio Atzori
|
ffdb2a3ea3
|
[cleaning] fixed filtering function for missing titles
|
2021-07-23 11:55:55 +02:00 |
Alessia Bardi
|
9069958479
|
tests for enermaps
|
2021-07-20 19:31:43 +02:00 |
Claudio Atzori
|
77e8c6c7f7
|
filtering 'old' OpenAIRE ids from the entity.originalId[] array in the OAF -> XML searialization procedure
|
2021-07-20 11:51:33 +02:00 |
Claudio Atzori
|
5947cddafc
|
adding record identifier among the originalIds regardless of what IdentifierFactory produces
|
2021-07-19 17:52:24 +02:00 |
Miriam Baglioni
|
13cf444f85
|
Merge pull request 'force orginalId for claimed records' (#124) from forceOrginalId_claims into master
Reviewed-on: D-Net/dnet-hadoop#124
|
2021-07-19 17:41:58 +02:00 |
Claudio Atzori
|
5e5f65a3c3
|
contents mapped from the stores with 'claim' interpretation will not change their identifier along their way towards the graph
|
2021-07-19 15:56:55 +02:00 |
Claudio Atzori
|
9913b6073c
|
Merge pull request 'orcid-no-doi' (#123) from enrico.ottonello/dnet-hadoop:orcid-no-doi into master
Reviewed-on: D-Net/dnet-hadoop#123
|
2021-07-15 17:53:58 +02:00 |
Enrico Ottonello
|
2dc50c0999
|
added default value to process path
|
2021-07-14 17:02:22 +02:00 |
Enrico Ottonello
|
66604bb2b4
|
added absolute path to process folder
|
2021-07-14 16:44:51 +02:00 |
Enrico Ottonello
|
7840cc6526
|
merged with master
|
2021-07-14 15:33:59 +02:00 |
Enrico Ottonello
|
a65667d217
|
added publication to dataset even if no contributors
|
2021-07-14 15:07:07 +02:00 |
Sandro La Bruzzo
|
10068c00ea
|
Code refactor:
- removed old workflows in doiboost
- splitted workflow of doiboost in preprocess and process
|
2021-07-14 14:45:50 +02:00 |
Miriam Baglioni
|
1cdd09cd8e
|
Tentative fix for testing of Jenkins
|
2021-07-14 11:14:59 +02:00 |
Sandro La Bruzzo
|
4cb65bc64a
|
fixed process doiboost workflow:
- splitted OrcidToOAF into two phase preprocess and process
- updated workflow used in production
|
2021-07-14 09:44:32 +02:00 |
Claudio Atzori
|
734de62474
|
[doiboost] added workflow for the ActionSet update dedicated to production
|
2021-07-13 17:26:04 +02:00 |
Claudio Atzori
|
fa720c1da4
|
[doiboost] added workflow for the ActionSet update dedicated to production
|
2021-07-13 16:59:30 +02:00 |
Claudio Atzori
|
9629569e22
|
Merge branch 'master' of https://code-repo.d4science.org/D-Net/dnet-hadoop
|
2021-07-13 16:04:08 +02:00 |
Claudio Atzori
|
f13e11e3f7
|
[aggregation] datacite wf: defined parameter declaring the path used to store the OAF objects produced by the transformation phase
|
2021-07-13 16:04:02 +02:00 |
Miriam Baglioni
|
f5486ffb14
|
Fixed issues to tests
|
2021-07-13 14:07:45 +02:00 |
Claudio Atzori
|
e0061232e9
|
[aggregation] datacite wf: conditional creation of links, optional resume from intermediate phases
|
2021-07-13 13:41:21 +02:00 |
Claudio Atzori
|
28a66af425
|
updated URL in the issueManagement tag
|
2021-07-13 11:52:24 +02:00 |
Claudio Atzori
|
783988af06
|
depending on dhp-schemas:2.6.14 (release)
|
2021-07-13 11:17:25 +02:00 |
Claudio Atzori
|
9038fdc771
|
depending on dhp-schemas:2.7.14 (release)
|
2021-07-12 17:46:12 +02:00 |
Sandro La Bruzzo
|
bbe8193930
|
merged stable ids
|
2021-07-12 17:00:43 +02:00 |
Claudio Atzori
|
ae2b47b29d
|
[broker] added coalesce(1) on the stats dataset before storing it on postgres
|
2021-07-09 15:47:51 +02:00 |
Sandro La Bruzzo
|
57c74c73c6
|
fixed mistakes in oozie workflow
|
2021-07-09 12:28:09 +02:00 |
Sandro La Bruzzo
|
61ccb54fde
|
removed wrong loop on oozie wf
|
2021-07-09 12:17:57 +02:00 |
Sandro La Bruzzo
|
9f5a0f3ab6
|
moved wf indexing of Scholexplorer in dhp-graph-provision
|
2021-07-09 12:06:43 +02:00 |
Sandro La Bruzzo
|
09fccf8000
|
added workflow to serialize scholix and summary in json
|
2021-07-09 11:01:42 +02:00 |
Sandro La Bruzzo
|
0ea576745f
|
updated CreateInputGraph because ggenerics don't work on Spark Dataset
|
2021-07-09 10:29:24 +02:00 |
Sandro La Bruzzo
|
cd17e19044
|
implemented branch workflow to import datacite and crossref in scholexplorer
|
2021-07-08 21:20:19 +02:00 |
Miriam Baglioni
|
c30f3ce647
|
merge doi normalization
|
2021-07-08 19:20:02 +02:00 |
Sandro La Bruzzo
|
8a034e46e1
|
updated baseline workflow
|
2021-07-08 11:11:41 +02:00 |
Claudio Atzori
|
b7b8e0986e
|
[raw_all] The claim merge procedure includes the claimed contexts in the merged result
|
2021-07-08 10:42:31 +02:00 |
Sandro La Bruzzo
|
0799ac9fb6
|
fixed wrong path
|
2021-07-08 10:36:37 +02:00 |
Sandro La Bruzzo
|
4d53402712
|
extended ebiLinks to create a dataset before generation of OAF
|
2021-07-08 10:26:21 +02:00 |