Claudio Atzori
|
39a2afe8b5
|
[graph provision] fixed XML serialization of the usage counts measures, renamed workflow actions to better reflect their role
|
2024-05-09 13:54:42 +02:00 |
Claudio Atzori
|
18aa323ee9
|
cleanup unused classes, adjustments in the oozie wf definition
|
2024-05-08 11:36:46 +02:00 |
Claudio Atzori
|
b4e3389432
|
fixed property mapping creating the RelatedEntity transient objects. spark cores & memory adjustments. Code formatting
|
2024-05-07 16:25:17 +02:00 |
Giambattista Bloisi
|
711048ceed
|
PrepareRelationsJob rewritten to use Spark Dataframe API and Windowing functions
|
2024-05-07 15:44:33 +02:00 |
Claudio Atzori
|
26363060ed
|
fixed id prefix creation for the fosnodoi records, again
|
2024-05-03 15:53:52 +02:00 |
Claudio Atzori
|
0486227185
|
[cleaning] deactivating the cleaning of FOS subjects found in the metadata provided by repositories
|
2024-05-03 14:31:12 +02:00 |
Claudio Atzori
|
a5d13d5d27
|
code formatting
|
2024-05-03 14:14:34 +02:00 |
Claudio Atzori
|
e1a0fb8933
|
fixed id prefix creation for the fosnodoi records
|
2024-05-03 14:14:18 +02:00 |
Giambattista Bloisi
|
69c5efbd8b
|
Fix: when applying enrichments with no instance information the resulting merge entity was generated with no instance instead of keeping the original information
|
2024-05-03 13:57:56 +02:00 |
Claudio Atzori
|
00ad21d814
|
Merge pull request 'preparations for dhp-common beta release 1.2.5' (#433) from beta-release-1.2.5 into beta
Reviewed-on: D-Net/dnet-hadoop#433
|
2024-05-02 11:28:19 +02:00 |
Claudio Atzori
|
4355f64810
|
reverted to version 1.2.5-SNAPSHOT
|
2024-05-02 11:23:53 +02:00 |
Claudio Atzori
|
66680b8b9a
|
refactoring of common utilities
|
2024-05-02 11:16:58 +02:00 |
Claudio Atzori
|
dcf23b3d06
|
Merge branch 'beta' into beta-release-1.2.5
|
2024-05-02 10:01:49 +02:00 |
Claudio Atzori
|
11bd89e132
|
[enrichment] use sparkExecutorMemory to define also the memoryOverhead
|
2024-05-01 08:32:59 +02:00 |
Claudio Atzori
|
e96c2c1606
|
[ranking wf] set spark.executor.memoryOverhead to fine tune the resource consumption
|
2024-04-30 16:23:25 +02:00 |
Claudio Atzori
|
50c18f7a0b
|
[dedup wf] revised memory settings to address the increased volume of input contents
|
2024-04-30 12:34:16 +02:00 |
Claudio Atzori
|
c08a58bba8
|
Merge pull request 'Miscellaneous related to changes in MergeUtils' (#429) from misc_fixes_merge_entities into beta
Reviewed-on: D-Net/dnet-hadoop#429
|
2024-04-24 08:55:37 +02:00 |
Claudio Atzori
|
e2937db385
|
Merge branch 'beta' into misc_fixes_merge_entities
|
2024-04-24 08:55:28 +02:00 |
Giambattista Bloisi
|
1878199dae
|
Miscellaneous fixes:
- in Merge By ID pick by preference those records coming from delegated Authorities
- fix various tests
- close spark session in SparkCreateSimRels
|
2024-04-24 08:12:45 +02:00 |
Claudio Atzori
|
c3053ef34d
|
using version 1.2.5-beta for the release
|
2024-04-23 14:52:32 +02:00 |
Claudio Atzori
|
b5bcab13ec
|
using version 1.2.5-beta for the release
|
2024-04-23 14:36:39 +02:00 |
Claudio Atzori
|
425c9afc36
|
using version 1.2.5-beta for the release
|
2024-04-23 14:30:04 +02:00 |
Claudio Atzori
|
93dd9cc639
|
code formatting
|
2024-04-23 11:28:00 +02:00 |
Miriam Baglioni
|
6189879643
|
[NOAMI] removed entry for Irish Research eLibray (IReL) Care Board from the list of funders.
|
2024-04-23 11:09:18 +02:00 |
Claudio Atzori
|
c57cff2d6d
|
Merge pull request '[WebCrawl] adding affiliation relations from web information' (#428) from WebCrowlBeta into beta
Reviewed-on: D-Net/dnet-hadoop#428
|
2024-04-23 09:36:15 +02:00 |
Miriam Baglioni
|
7de114bda0
|
[WebCrawl] addressing comments from PR
|
2024-04-22 13:52:50 +02:00 |
Claudio Atzori
|
eb4692e4ee
|
Merge branch 'beta' into WebCrowlBeta
|
2024-04-22 11:40:24 +02:00 |
Claudio Atzori
|
24a83fc24f
|
avoid NPEs in common Oaf merge utilities
|
2024-04-22 11:39:44 +02:00 |
Miriam Baglioni
|
776c898c4b
|
[WebCrawl] adding affiliation relations from web information
|
2024-04-22 11:04:17 +02:00 |
Claudio Atzori
|
5857fd38c1
|
avoid NPEs in common Oaf merge utilities
|
2024-04-21 08:29:09 +02:00 |
Claudio Atzori
|
0656ab2838
|
code formatting
|
2024-04-20 08:10:58 +02:00 |
Claudio Atzori
|
ab7f0855af
|
fixed query reading projects from the aggregator DB
|
2024-04-20 08:10:32 +02:00 |
Claudio Atzori
|
7a7e313157
|
updated schema version
|
2024-04-19 17:30:25 +02:00 |
Claudio Atzori
|
e5879b68c7
|
[transformative agreement] including reuslt-funder relations to the information imported from the TRs
|
2024-04-19 17:14:18 +02:00 |
Claudio Atzori
|
3a027e97a7
|
[graph indexing] sets spark memoryOverhead in the join operations to the same value used for the memory executor
|
2024-04-19 16:59:58 +02:00 |
Sandro La Bruzzo
|
b72c3139e2
|
updated Ignore annotation that is deprecated to Disabled
|
2024-04-19 14:52:40 +02:00 |
Claudio Atzori
|
57c678d904
|
integrating changes from PR#424
|
2024-04-18 11:38:35 +02:00 |
Claudio Atzori
|
5ab8cd1794
|
Various fixes for the stats DB update workflow, step16-createIndicatorsTables.sql
|
2024-04-18 11:28:18 +02:00 |
Claudio Atzori
|
b554c41cc7
|
Merge pull request 'doidoost_dismiss' (#418) from doidoost_dismiss into beta
Reviewed-on: D-Net/dnet-hadoop#418
|
2024-04-17 12:01:11 +02:00 |
Claudio Atzori
|
ac8747582c
|
Merge branch 'beta' into doidoost_dismiss
|
2024-04-17 12:01:01 +02:00 |
Claudio Atzori
|
0db7e4ae9a
|
Merge pull request 'Refinements to PR #404: refactoring the Oaf records merge utilities into dhp-common' (#422) from revised_merge_logic into beta
Reviewed-on: D-Net/dnet-hadoop#422
|
2024-04-17 11:58:26 +02:00 |
Giambattista Bloisi
|
8ac167e420
|
Refinements to PR #404: refactoring the Oaf records merge utilities into dhp-common
|
2024-04-16 17:18:28 +02:00 |
Miriam Baglioni
|
0625b9061f
|
removed the funder id : 100011062 Asian Spinal Cord Network, wrongly associated to Ireland
|
2024-04-16 15:26:53 +02:00 |
Miriam Baglioni
|
9eeb9f5d32
|
mergin with branch beta
|
2024-04-16 15:24:40 +02:00 |
Claudio Atzori
|
589bce3520
|
Merge pull request '[pBETA] Improvements to copying data from ocean to impala' (#421) from antonis.lempesis/dnet-hadoop:beta into beta
Reviewed-on: D-Net/dnet-hadoop#421
|
2024-04-16 14:22:32 +02:00 |
Sandro La Bruzzo
|
a5ddd8dfbb
|
Added Action set generation for the MAG organization
|
2024-04-16 13:39:15 +02:00 |
Giambattista Bloisi
|
da333e9f4d
|
Merge pull request 'Enhance Dedup authors matching with algorithms used for ORCID enhancements (task 9690)' (#419) from dedup_authorsmatch_bytoken into beta
Reviewed-on: D-Net/dnet-hadoop#419
|
2024-04-16 10:24:11 +02:00 |
Claudio Atzori
|
43fd1de681
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2024-04-16 09:42:05 +02:00 |
Claudio Atzori
|
d070db4a32
|
added a couple more invalid author names
|
2024-04-16 09:41:59 +02:00 |
Michele Artini
|
78b9d84e4a
|
test
|
2024-04-16 09:41:16 +02:00 |