Miriam Baglioni
|
cf758f4f91
|
added normalization step for the doi
|
2021-06-30 10:03:15 +02:00 |
Miriam Baglioni
|
801763a0fa
|
there is no more the need to lower case the doi since it is done in the first step. Also changed the creation of the id by using the factory
|
2021-06-29 19:07:23 +02:00 |
Miriam Baglioni
|
a74de1cda2
|
added normalization step to the doi
|
2021-06-29 18:51:11 +02:00 |
Miriam Baglioni
|
06074ea7d3
|
added normalization step to the doi
|
2021-06-29 18:46:08 +02:00 |
Miriam Baglioni
|
8b8ffe82dc
|
added step of normalization for the doi
|
2021-06-29 18:41:39 +02:00 |
Miriam Baglioni
|
50cc21d92e
|
Added method to normalize doi values (lower case, remove all preceeding 10., filtering out doi not starting with 10.)
|
2021-06-29 18:35:28 +02:00 |
Claudio Atzori
|
6d3f960238
|
Merge pull request 'added the missing indicators files' (#120) from antonis.lempesis/dnet-hadoop:stable_ids into stable_ids
Reviewed-on: #120
|
2021-06-29 15:57:39 +02:00 |
Antonis Lempesis
|
ae18171212
|
Merge branch 'stable_ids' into stable_ids
|
2021-06-29 15:33:39 +02:00 |
Antonis Lempesis
|
87f14a3899
|
added the missing indicators files
|
2021-06-29 16:31:51 +03:00 |
Sandro La Bruzzo
|
db933ebd21
|
Merge remote-tracking branch 'origin/stable_ids' into stable_id_scholexplorer
|
2021-06-29 14:16:12 +02:00 |
Sandro La Bruzzo
|
7e08655e5f
|
added relation dates in all scholexplorer Datasources
|
2021-06-29 12:02:03 +02:00 |
Sandro La Bruzzo
|
075055eaca
|
added relation dates in bio mapping
|
2021-06-29 10:33:09 +02:00 |
Sandro La Bruzzo
|
f36f92287d
|
implemented mapping from Crossref Event Data to Oaf
|
2021-06-29 10:21:23 +02:00 |
Claudio Atzori
|
986a8011ec
|
Merge pull request 'copied latest changes from old fork: indicators+monitor institutions' (#119) from antonis.lempesis/dnet-hadoop:stable_ids into stable_ids
Reviewed-on: #119
|
2021-06-29 08:49:12 +02:00 |
Antonis Lempesis
|
018c4eb52c
|
copied latest changes from old fork: indicators+monitor institutions
|
2021-06-28 23:46:52 +03:00 |
Sandro La Bruzzo
|
511ec14c63
|
implemented mapping from EBI and Scholix Resolved to OAF
|
2021-06-28 22:04:22 +02:00 |
Claudio Atzori
|
af42377d0e
|
HttpClient used in metadata collection retries on 502, 503, 504
|
2021-06-28 09:34:30 +02:00 |
Sandro La Bruzzo
|
ad50415167
|
Merge remote-tracking branch 'origin/stable_ids' into stable_id_scholexplorer
|
2021-06-24 17:20:50 +02:00 |
Sandro La Bruzzo
|
80e15cc455
|
implemented mapping from uniprot, pdb and ebi links
|
2021-06-24 17:20:00 +02:00 |
Claudio Atzori
|
67afd06cd1
|
[cleaning] cleaning instance.pid and instance.alternateidentifier using the same procedure used to clean result.pid
|
2021-06-24 12:10:17 +02:00 |
Claudio Atzori
|
2e8fd2c531
|
cleanup
|
2021-06-23 14:38:24 +02:00 |
Claudio Atzori
|
4dc9ebf217
|
[raw_all] fixed unit test
|
2021-06-23 14:38:07 +02:00 |
Claudio Atzori
|
50fc5a64a0
|
[raw_all] Aggregator graph creation merges claims (updates) with the corresponding entity
|
2021-06-23 11:49:42 +02:00 |
Claudio Atzori
|
5edcc6832a
|
applying sonarLint suggestions
|
2021-06-23 09:53:29 +02:00 |
Sandro La Bruzzo
|
080a280bea
|
added pdb to Oaf Transformation
|
2021-06-21 16:23:59 +02:00 |
Sandro La Bruzzo
|
1dc0c59e20
|
merged fix thai dates from stable_ids
|
2021-06-21 10:39:46 +02:00 |
Sandro La Bruzzo
|
dc66cf615b
|
Merge branch 'stable_id_scholexplorer' of code-repo.d4science.org:D-Net/dnet-hadoop into stable_id_scholexplorer
|
2021-06-21 09:38:33 +02:00 |
Sandro La Bruzzo
|
507e42102a
|
added pdb to oaf class
|
2021-06-21 09:36:40 +02:00 |
Sandro La Bruzzo
|
a167543637
|
Merge branch 'stable_ids' of code-repo.d4science.org:D-Net/dnet-hadoop into stable_id_scholexplorer
|
2021-06-21 09:14:11 +02:00 |
Sandro La Bruzzo
|
4fe7b75644
|
renamed packages
|
2021-06-18 16:41:24 +02:00 |
Sandro La Bruzzo
|
3990165d05
|
changed typologies of unresolved relation
|
2021-06-18 11:43:59 +02:00 |
Claudio Atzori
|
2dd5449c13
|
Merge branch 'stable_ids' of https://code-repo.d4science.org/D-Net/dnet-hadoop into stable_ids
|
2021-06-18 10:08:15 +02:00 |
Claudio Atzori
|
fd54ecf7bd
|
bumped dhp-schemas dependency version
|
2021-06-18 10:08:07 +02:00 |
Miriam Baglioni
|
180d671127
|
Merge branch 'stable_ids' of https://code-repo.d4science.org/D-Net/dnet-hadoop into stable_ids
|
2021-06-18 09:46:18 +02:00 |
Miriam Baglioni
|
13c96622c9
|
-
|
2021-06-18 09:45:16 +02:00 |
Miriam Baglioni
|
b486ae498f
|
added test and test resource to verify the generation of the date of acceptance from the input extracted from the dump
|
2021-06-18 09:43:32 +02:00 |
Miriam Baglioni
|
464c2ddde3
|
changed to split in two steps the generation of the crossref dataset
|
2021-06-18 09:42:31 +02:00 |
Miriam Baglioni
|
6aca0d8ebb
|
added kryo encoding for input files
|
2021-06-18 09:42:07 +02:00 |
Miriam Baglioni
|
3585e53da3
|
changed to split in two steps the generation of the crossref dataset
|
2021-06-18 09:41:23 +02:00 |
Claudio Atzori
|
41b551562e
|
applying PR#115 (DatePicker) on stable_ids
|
2021-06-17 09:33:50 +02:00 |
Sandro La Bruzzo
|
3100166d29
|
Merge remote-tracking branch 'origin/stable_ids' into stable_id_scholexplorer
|
2021-06-16 16:22:16 +02:00 |
Claudio Atzori
|
74833d04f1
|
Merge branch 'pids_beta' of https://code-repo.d4science.org/antonis.lempesis/dnet-hadoop into stable_ids
|
2021-06-16 15:54:18 +02:00 |
Claudio Atzori
|
7243a40c88
|
code formatting
|
2021-06-16 15:03:03 +02:00 |
Sandro La Bruzzo
|
dfcf78cf24
|
removed wrong code
|
2021-06-16 14:57:42 +02:00 |
Sandro La Bruzzo
|
cc0f2b11fb
|
Implemented mapping from pubmed baseline to OAF
|
2021-06-16 14:56:24 +02:00 |
Miriam Baglioni
|
95885bcf12
|
forces executor Executor memory and driver executor memory to be 7G (trying to avoid OOM)
|
2021-06-16 10:17:52 +02:00 |
Miriam Baglioni
|
2550a73981
|
-
|
2021-06-16 10:04:41 +02:00 |
Miriam Baglioni
|
1c47c0d786
|
modified the number of executors trying to avoid OOM exception
|
2021-06-15 21:05:39 +02:00 |
Miriam Baglioni
|
7deac55138
|
added one option for resume from in the wf
|
2021-06-15 18:38:20 +02:00 |
Antonis Lempesis
|
f7c0b80e35
|
storing result_instance as parquet
|
2021-06-15 14:45:48 +03:00 |