Miriam Baglioni
|
b0d86d32b0
|
added list of author to be merged
|
2021-07-08 18:56:29 +02:00 |
Miriam Baglioni
|
abe546e5ba
|
added resource files for test author merger for empy crossref and other merging providers (related to DoiBoostAuthorMerger)
|
2021-07-08 18:55:55 +02:00 |
Miriam Baglioni
|
bf24f588e2
|
Added test for empty author list for crossref and other merging providers (related to DoiBoostAuthorMerger)
|
2021-07-08 18:55:13 +02:00 |
Miriam Baglioni
|
96255fa647
|
-
|
2021-07-08 18:54:27 +02:00 |
Miriam Baglioni
|
0e47e94099
|
Added variable to verify if crossref is base for the merging of authors (related to DoiBoostAuthorMerger)
|
2021-07-08 18:54:07 +02:00 |
Miriam Baglioni
|
434aa6380b
|
Adding description of the merging process for DoiBoost (related to DoiBoostAuthorMerger) - to be refined
|
2021-07-08 18:53:15 +02:00 |
Miriam Baglioni
|
e0e80cde22
|
Added class to store the most similar author list to be enriched w.r.t. one enriching author (related to DoiBoostAuthorMerger)
|
2021-07-08 18:52:25 +02:00 |
Miriam Baglioni
|
97e0c27db9
|
Added check for empty author list. If crossref is empty, the longest from all the merging providers is taken. If crossref is not empty, crossref is chosen as base for the enrichment
|
2021-07-08 15:27:05 +02:00 |
Miriam Baglioni
|
3ed90420e4
|
Merge branch 'stable_ids' of https://code-repo.d4science.org/D-Net/dnet-hadoop into stable_ids
|
2021-07-05 16:48:19 +02:00 |
Miriam Baglioni
|
7498e63174
|
added resource files for testing of DoiBoostAuthorMerger
|
2021-07-05 16:26:46 +02:00 |
Miriam Baglioni
|
22ce947335
|
added resource files for testing of DoiBoostAuthorMerger
|
2021-07-05 16:26:17 +02:00 |
Miriam Baglioni
|
f64f5d9e23
|
first implementation and test class for the specific Author Merger for doiboost. First change: crossref as base to be enriched. Modified the normalization function to remove accents from words
|
2021-07-05 16:24:47 +02:00 |
Miriam Baglioni
|
238d692a0a
|
apply specific AuthorMerger for doiboost
|
2021-07-05 16:23:33 +02:00 |
Miriam Baglioni
|
7177c25261
|
added check for null value during doi normalization
|
2021-07-05 16:22:38 +02:00 |
Miriam Baglioni
|
0892cad4e8
|
the normalization of the content of value was not visible outside the block. Moved doi normalization operation while returning value
|
2021-07-05 16:21:42 +02:00 |
Claudio Atzori
|
350a0823bd
|
Merge pull request 'using organization ids instead of names in monitor db creation' (#121) from antonis.lempesis/dnet-hadoop:stable_ids into stable_ids
Reviewed-on: D-Net/dnet-hadoop#121
|
2021-07-05 11:07:39 +02:00 |
Antonis Lempesis
|
89e6f46682
|
using organization ids instead of names in monitor db creation
|
2021-07-05 12:00:00 +03:00 |
Miriam Baglioni
|
bc34347643
|
added assertions to verify doi normalization
|
2021-06-30 14:37:08 +02:00 |
Miriam Baglioni
|
86f47afcc7
|
slight modification of the resource to accomodate also doi normalization tests
|
2021-06-30 14:36:49 +02:00 |
Miriam Baglioni
|
03767ea8e6
|
slight modification of the resource to accomodate also doi normalization tests
|
2021-06-30 13:21:24 +02:00 |
Miriam Baglioni
|
f8eec0ca9a
|
added resource to test the normalization of doi during the import of MAG
|
2021-06-30 13:19:54 +02:00 |
Miriam Baglioni
|
149f85ddf5
|
added tests for the normalization of the dois
|
2021-06-30 13:00:52 +02:00 |
Miriam Baglioni
|
e487b5544c
|
added tests for the normalization of the dois
|
2021-06-30 12:57:11 +02:00 |
Miriam Baglioni
|
1503ccbbb5
|
added tests for the normalization of the dois
|
2021-06-30 12:55:37 +02:00 |
Miriam Baglioni
|
1299bfb357
|
Added class to test the normalization of doi
|
2021-06-30 12:53:27 +02:00 |
Miriam Baglioni
|
cf758f4f91
|
added normalization step for the doi
|
2021-06-30 10:03:15 +02:00 |
Miriam Baglioni
|
801763a0fa
|
there is no more the need to lower case the doi since it is done in the first step. Also changed the creation of the id by using the factory
|
2021-06-29 19:07:23 +02:00 |
Miriam Baglioni
|
a74de1cda2
|
added normalization step to the doi
|
2021-06-29 18:51:11 +02:00 |
Miriam Baglioni
|
06074ea7d3
|
added normalization step to the doi
|
2021-06-29 18:46:08 +02:00 |
Miriam Baglioni
|
8b8ffe82dc
|
added step of normalization for the doi
|
2021-06-29 18:41:39 +02:00 |
Miriam Baglioni
|
50cc21d92e
|
Added method to normalize doi values (lower case, remove all preceeding 10., filtering out doi not starting with 10.)
|
2021-06-29 18:35:28 +02:00 |
Claudio Atzori
|
6d3f960238
|
Merge pull request 'added the missing indicators files' (#120) from antonis.lempesis/dnet-hadoop:stable_ids into stable_ids
Reviewed-on: D-Net/dnet-hadoop#120
|
2021-06-29 15:57:39 +02:00 |
Antonis Lempesis
|
ae18171212
|
Merge branch 'stable_ids' into stable_ids
|
2021-06-29 15:33:39 +02:00 |
Antonis Lempesis
|
87f14a3899
|
added the missing indicators files
|
2021-06-29 16:31:51 +03:00 |
Claudio Atzori
|
986a8011ec
|
Merge pull request 'copied latest changes from old fork: indicators+monitor institutions' (#119) from antonis.lempesis/dnet-hadoop:stable_ids into stable_ids
Reviewed-on: D-Net/dnet-hadoop#119
|
2021-06-29 08:49:12 +02:00 |
Antonis Lempesis
|
018c4eb52c
|
copied latest changes from old fork: indicators+monitor institutions
|
2021-06-28 23:46:52 +03:00 |
Claudio Atzori
|
af42377d0e
|
HttpClient used in metadata collection retries on 502, 503, 504
|
2021-06-28 09:34:30 +02:00 |
Claudio Atzori
|
67afd06cd1
|
[cleaning] cleaning instance.pid and instance.alternateidentifier using the same procedure used to clean result.pid
|
2021-06-24 12:10:17 +02:00 |
Claudio Atzori
|
2e8fd2c531
|
cleanup
|
2021-06-23 14:38:24 +02:00 |
Claudio Atzori
|
4dc9ebf217
|
[raw_all] fixed unit test
|
2021-06-23 14:38:07 +02:00 |
Claudio Atzori
|
50fc5a64a0
|
[raw_all] Aggregator graph creation merges claims (updates) with the corresponding entity
|
2021-06-23 11:49:42 +02:00 |
Claudio Atzori
|
5edcc6832a
|
applying sonarLint suggestions
|
2021-06-23 09:53:29 +02:00 |
Claudio Atzori
|
2dd5449c13
|
Merge branch 'stable_ids' of https://code-repo.d4science.org/D-Net/dnet-hadoop into stable_ids
|
2021-06-18 10:08:15 +02:00 |
Claudio Atzori
|
fd54ecf7bd
|
bumped dhp-schemas dependency version
|
2021-06-18 10:08:07 +02:00 |
Miriam Baglioni
|
180d671127
|
Merge branch 'stable_ids' of https://code-repo.d4science.org/D-Net/dnet-hadoop into stable_ids
|
2021-06-18 09:46:18 +02:00 |
Miriam Baglioni
|
13c96622c9
|
-
|
2021-06-18 09:45:16 +02:00 |
Miriam Baglioni
|
b486ae498f
|
added test and test resource to verify the generation of the date of acceptance from the input extracted from the dump
|
2021-06-18 09:43:32 +02:00 |
Miriam Baglioni
|
464c2ddde3
|
changed to split in two steps the generation of the crossref dataset
|
2021-06-18 09:42:31 +02:00 |
Miriam Baglioni
|
6aca0d8ebb
|
added kryo encoding for input files
|
2021-06-18 09:42:07 +02:00 |
Miriam Baglioni
|
3585e53da3
|
changed to split in two steps the generation of the crossref dataset
|
2021-06-18 09:41:23 +02:00 |