Enrico Ottonello
|
d0df02062c
|
raised runtimeexception on record without title or url
|
2022-04-06 13:19:58 +02:00 |
Enrico Ottonello
|
7fc5b97871
|
skipped record without title or url
|
2022-04-06 13:08:48 +02:00 |
Enrico Ottonello
|
a203c33693
|
added disprot constants configuration
|
2022-04-06 12:48:24 +02:00 |
Enrico Ottonello
|
98178b3165
|
custom deserializer for property value type working for both ped and disprot
|
2022-04-05 10:47:17 +02:00 |
Enrico Ottonello
|
f11dfc51f7
|
fix resolved url format, added alternate identifier from original pid
|
2022-03-22 16:39:21 +01:00 |
Enrico Ottonello
|
afe84c4244
|
added subjects to oaf generation
|
2022-03-18 18:10:39 +01:00 |
Enrico Ottonello
|
e53a606afc
|
added date of collection, resource type as workflow parameter
|
2022-03-15 17:36:48 +01:00 |
Enrico Ottonello
|
bd37f14941
|
added working ocean configuration
|
2022-03-03 14:38:21 +01:00 |
Enrico Ottonello
|
29ee1b9d82
|
added datasource key to workflow parameter to properly choose collected from and id values
|
2022-03-03 12:31:29 +01:00 |
Enrico Ottonello
|
e57216a1fa
|
added oozie workflow to generate bioschema dataset on hdfs
|
2022-03-02 16:58:10 +01:00 |
Enrico Ottonello
|
f28d7e3b9d
|
added spark dataset creation
|
2022-03-02 12:12:37 +01:00 |
Enrico Ottonello
|
7f9636ef00
|
added alternateIdentifiers to oaf
|
2022-02-25 14:42:08 +01:00 |
Enrico Ottonello
|
2f5caef77b
|
resolution of generated relations url to uniprot and pubmed datasources
|
2022-02-24 16:59:50 +01:00 |
Enrico Ottonello
|
2bc79c50f8
|
mapping bioschema to oaf
|
2022-02-22 11:46:29 +01:00 |
Enrico Ottonello
|
446f81ee60
|
wf to generate oaf from bioschema json datacite
|
2022-02-22 11:42:57 +01:00 |
Claudio Atzori
|
5226d0a100
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2022-02-18 15:21:07 +01:00 |
Claudio Atzori
|
401dd38074
|
code formatting
|
2022-02-18 15:19:15 +01:00 |
Sandro La Bruzzo
|
891781ee3f
|
Merge branch 'beta' of code-repo.d4science.org:D-Net/dnet-hadoop into beta
|
2022-02-18 11:11:32 +01:00 |
Sandro La Bruzzo
|
d3f03abd51
|
fixed wrong json path
|
2022-02-18 11:11:17 +01:00 |
Claudio Atzori
|
89c7313fc5
|
Merge branch 'beta' into hierarchical_orgs_relations
|
2022-02-17 10:30:04 +01:00 |
Sandro La Bruzzo
|
3aa2020b24
|
added script to regenerate hostedBy Map following instruction defined on ticket #7539
updated hosted By Map
|
2022-02-15 11:05:27 +01:00 |
Miriam Baglioni
|
be64055cfe
|
[OpenCitation] changed the name of destination folders
|
2022-02-14 15:49:44 +01:00 |
Miriam Baglioni
|
1490867cc7
|
[OpenCitation] cleaning of the COCI model
|
2022-02-14 14:52:12 +01:00 |
Miriam Baglioni
|
5c4043dba8
|
[OpenCitation] refactoring
|
2022-02-08 16:23:05 +01:00 |
Miriam Baglioni
|
759ed519f2
|
[OpenCitation] added logic to avoid the genration of self citations relations
|
2022-02-08 16:15:34 +01:00 |
Miriam Baglioni
|
b071f8e415
|
[OpenCitation] change to extract in json format each folder just onece
|
2022-02-08 15:37:28 +01:00 |
Miriam Baglioni
|
fbc28ee8c3
|
[OpenCitation] change the integration logic to consider dois with commas inside
|
2022-02-07 18:32:08 +01:00 |
Miriam Baglioni
|
73eba34d42
|
[UnresolvedEntities] Changed the way to merge the unresolved because the new merge removed the dataInfo from the merged result. Added also data info for subjects
|
2022-02-01 08:38:41 +01:00 |
Claudio Atzori
|
b37bc277c4
|
reintroduced the hostedby patching to the datacite records
|
2022-01-21 09:15:13 +01:00 |
Miriam Baglioni
|
e7d5a39c03
|
[BipFinderInstanceLevel] added tests in test class
|
2022-01-12 17:25:04 +01:00 |
Miriam Baglioni
|
4993666d73
|
[BipFinderInstanceLevel] changed creation of the instance to allow to enrich existing instances with same pid
|
2022-01-12 16:53:47 +01:00 |
Sandro La Bruzzo
|
57e2c4b749
|
formatted code
|
2022-01-12 09:40:28 +01:00 |
Claudio Atzori
|
dcd282977c
|
pulled from beta
|
2022-01-11 16:59:41 +01:00 |
Claudio Atzori
|
4f212652ca
|
scalafmt: code formatting
|
2022-01-11 16:57:48 +01:00 |
Miriam Baglioni
|
b7e450070b
|
[SDG-FOS] to import SDG file not considering the header
|
2022-01-07 12:13:26 +01:00 |
Miriam Baglioni
|
639190370a
|
mergin with branch beta
|
2022-01-07 11:29:25 +01:00 |
Miriam Baglioni
|
adccc2346a
|
[SDG-FOS] to lower case for the doi
|
2022-01-07 11:28:50 +01:00 |
Claudio Atzori
|
58f8998e3d
|
OAF-store-graph mdstores: save them in text format
|
2022-01-04 15:02:09 +01:00 |
Claudio Atzori
|
174c3037e1
|
OAF-store-graph mdstores: save them in text format
|
2022-01-04 14:40:16 +01:00 |
Claudio Atzori
|
045d767013
|
OAF-store-graph mdstores: save them in text format
|
2022-01-04 14:23:01 +01:00 |
Claudio Atzori
|
a6977197b3
|
serialise records in the OAF-store-graph mdstores in json format. Read them again in the graph construction phase using a tolerant parser to support backward compatible changes in the evolution of the schema
|
2022-01-03 17:25:26 +01:00 |
Miriam Baglioni
|
92fd69e25d
|
[SDG-FOS] alternative way to get input data to avoid OOM error while getting csv
|
2022-01-03 15:23:06 +01:00 |
Miriam Baglioni
|
7a1b440413
|
[SDG] logic to create unresolved entities out of SDG input. This changes also some classes related to FOS to reuse the same code. The code under createunresolvedentities create results with the merged update of the the inputs provided (bip at the level of the isntance, fos and sdg for subjects)
|
2021-12-23 13:24:28 +01:00 |
Miriam Baglioni
|
2a67ee13ec
|
[SDG] added model class
|
2021-12-23 10:37:52 +01:00 |
Miriam Baglioni
|
10579c0dd0
|
[FOS]fixed doi value in test
|
2021-12-22 23:10:16 +01:00 |
Miriam Baglioni
|
6116fc5d40
|
[FOS]added logic to include only different subjects. Test refactoring and extention
|
2021-12-22 23:04:22 +01:00 |
Miriam Baglioni
|
b81efb6a9d
|
[FOS]changed the mapping between the csv and the model. Changed Test classes and resources
|
2021-12-22 21:40:35 +01:00 |
Miriam Baglioni
|
de6c4c8968
|
[FOS]creation of the unresolved entities: remove the split for the doi: no more needed since each row is related to one doi
|
2021-12-22 16:44:44 +01:00 |
Miriam Baglioni
|
34ac56565d
|
refactoring
|
2021-12-22 16:28:11 +01:00 |
Miriam Baglioni
|
20ef1d657f
|
refactoring
|
2021-12-22 16:26:36 +01:00 |