Claudio Atzori
|
078169b922
|
cleanup
|
2024-03-15 09:56:04 +01:00 |
Claudio Atzori
|
af154d4456
|
implemented changes from #9497: sort abstracts by string length, included author fullnames in the related results, expanded instance details within each children/result XML element
|
2024-03-14 16:21:23 +01:00 |
Claudio Atzori
|
7863c92466
|
expanded paper abstract in the result/children XML element (ticket #9497)
|
2024-03-13 16:25:31 +01:00 |
Claudio Atzori
|
eb5887cb9a
|
including related organization url in the XML record serialization (ticket #9498)
|
2024-03-13 14:46:00 +01:00 |
Claudio Atzori
|
db66555ebb
|
WIP: updated provision workflow to create a JSON based representation of the payload
|
2024-03-12 09:56:09 +01:00 |
Claudio Atzori
|
d4871b31e8
|
WIP: extended provision workflow to create the JSON based payload
|
2024-03-08 11:43:20 +01:00 |
Claudio Atzori
|
6fcf872daa
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into index_records
|
2024-02-28 10:27:28 +01:00 |
Claudio Atzori
|
3f07390a58
|
WIP
|
2024-02-28 10:10:10 +01:00 |
Sandro La Bruzzo
|
7d806a434c
|
formatted code
|
2024-02-28 09:31:58 +01:00 |
Sandro La Bruzzo
|
b63994dcc4
|
Merge remote-tracking branch 'origin/beta' into orcid_update
|
2024-02-28 09:11:18 +01:00 |
Sandro La Bruzzo
|
915a76a796
|
following the comment on the pull requests:
- Added #NUM_OF_THREADS complete job in the queue at the end of the main loop to avoid deadlock
|
2024-02-28 09:10:55 +01:00 |
Sandro La Bruzzo
|
a712df1e1d
|
Merge remote-tracking branch 'origin/beta' into orcid_update
|
2024-02-23 10:12:25 +01:00 |
Sandro La Bruzzo
|
b32a9d1994
|
Implemented workflow for updating table , added step to check if the new generated table is valid
|
2024-02-23 10:04:28 +01:00 |
Michele Artini
|
3268570b2c
|
mapping of project PIDs
|
2024-02-22 14:47:21 +01:00 |
Claudio Atzori
|
a63b091bae
|
Merge branch 'beta' into import_orps_fix
|
2024-02-15 15:01:56 +01:00 |
Claudio Atzori
|
d85d2df6ad
|
[graph raw] fixed mapping of the original resource type from the Datacite format
|
2024-02-09 10:20:20 +01:00 |
Giambattista Bloisi
|
b19643f6eb
|
Dedup aliases, created when a dedup in a previous build has been merged in a new dedup, need to be marked as "deletedbyinference", since they are "merged" in the new dedup
|
2024-02-08 15:34:59 +01:00 |
Claudio Atzori
|
38c9001147
|
fixed import of ORPs stored on HDFS in the internal graph format (e.g. Datacite)
|
2024-02-07 17:02:05 +01:00 |
Claudio Atzori
|
fd17c1f17c
|
[actiosets] fixed join type
|
2024-02-05 16:55:36 +02:00 |
Claudio Atzori
|
009dcf6aea
|
[actiosets] introduced support for the PromoteAction strategy
|
2024-02-05 16:43:40 +02:00 |
Claudio Atzori
|
42f5506306
|
[orcid enrichment] fixed directory cleanup before distcp
|
2024-02-05 09:45:36 +02:00 |
Alessia Bardi
|
f2a08d8cc2
|
test for Italian records from IRS repositories
|
2024-01-30 19:20:14 +01:00 |
Miriam Baglioni
|
a5995ab557
|
[orcid-enrichment] change the value of parameters.
|
2024-01-29 18:19:48 +01:00 |
Claudio Atzori
|
926903b06b
|
Merge branch 'beta' into stats_with_spark_sql
|
2024-01-29 09:11:45 +01:00 |
Giambattista Bloisi
|
078df0b4d1
|
Use SparkSQL in place of Hive for executing step16-createIndicatorsTables.sql of stats update wf
|
2024-01-26 21:56:55 +01:00 |
Claudio Atzori
|
ce3200263e
|
Merge branch 'beta' into crossref_missing_author_fix
|
2024-01-26 15:57:04 +01:00 |
Sandro La Bruzzo
|
e889808daa
|
Fixed problem on missing author in crossref Mapping
|
2024-01-26 12:19:04 +01:00 |
Sandro La Bruzzo
|
0386f36385
|
Added workflow to update ORCID and replaced some parsing, because the update works and employments xml differs from the dump one.
|
2024-01-25 19:40:59 +01:00 |
Antonis Lempesis
|
a7115cfa9e
|
max mem of joins (hive.mapjoin.followby.gby.localtask.max.memory.usage) now 80%, up from 55%.
|
2024-01-25 15:13:16 +01:00 |
Claudio Atzori
|
9b13c22e5d
|
[graph provision] retrieve all the context information by adding all=true to the requests issued to thr API
|
2024-01-23 15:36:08 +01:00 |
Sandro La Bruzzo
|
43e0bba7ed
|
logg added during download
|
2024-01-23 15:04:49 +01:00 |
Claudio Atzori
|
f87f3a6483
|
[graph provision] updated param specification for the XML converter job
|
2024-01-23 08:54:37 +01:00 |
Claudio Atzori
|
6fd25cf549
|
code formatting
|
2024-01-23 08:47:12 +01:00 |
Claudio Atzori
|
f76852f385
|
Merge branch 'beta' into update_pivots_table
|
2024-01-22 16:37:22 +01:00 |
Claudio Atzori
|
1c6db320f4
|
[graph provision] obtain context info from the context API instead from the ISLookUp service
|
2024-01-22 15:53:17 +01:00 |
Claudio Atzori
|
2655eea5bc
|
[orcid enrichment] drop paths before copying the non-modifyed contents
|
2024-01-19 16:28:05 +01:00 |
Claudio Atzori
|
c6b3401596
|
increased shuffle partitions for publications in the country propagation workflow
|
2024-01-19 10:15:39 +01:00 |
Miriam Baglioni
|
bcc0a13981
|
[enrichment single step] adding <end> element in wf definition
|
2024-01-18 17:39:14 +01:00 |
Miriam Baglioni
|
6af536541d
|
[enrichment single step] moving parameter file in correct location
|
2024-01-18 15:35:40 +01:00 |
Miriam Baglioni
|
a12a3eb143
|
-
|
2024-01-18 15:18:10 +01:00 |
Miriam Baglioni
|
82e9e262ee
|
[enrichment single step] remove parameter from execution
|
2024-01-17 17:38:03 +01:00 |
Miriam Baglioni
|
67ce2d54be
|
[enrichment single step] refactoring to fix issues in disappeared result type
|
2024-01-17 16:50:00 +01:00 |
Miriam Baglioni
|
59eaccbd87
|
[enrichment single step] refactoring to fix issue in disappeared result type
|
2024-01-15 17:49:54 +01:00 |
Giambattista Bloisi
|
21a14fcd80
|
Reusable RunSQLSparkJob for executing SQL in Spark through Oozie Spark Actions
Implements pivots table update oozie workflow
|
2024-01-15 10:18:14 +01:00 |
Sandro La Bruzzo
|
e0753f19da
|
Fixed error of connection timeout
|
2024-01-13 09:27:08 +01:00 |
sandro.labruzzo
|
e328bc0ade
|
fixed missing parameter on download update
|
2024-01-12 16:18:20 +01:00 |
Miriam Baglioni
|
f612125939
|
fix issue on FoS integration. Removing the null values from FoS
|
2024-01-12 10:20:28 +01:00 |
Claudio Atzori
|
cb9e739484
|
Merge branch 'beta' into resource_types
|
2024-01-11 16:29:41 +01:00 |
Claudio Atzori
|
2753044d13
|
refined mapping for the extraction of the original resource type
|
2024-01-11 16:28:26 +01:00 |
Giambattista Bloisi
|
3c66e3bd7b
|
Create dedup record for "merged" pivots
Do not create dedup records for group that have more than 20 different acceptance date
|
2024-01-10 22:59:52 +01:00 |