Claudio Atzori
cf7d9a32ab
disable autoBroadcastJoin in the cleaning workflow
2024-11-15 09:17:28 +01:00
Claudio Atzori
5f512f510e
code formatting
2024-11-15 09:16:51 +01:00
Claudio Atzori
4a3b173ca2
defaults to 0000 - Unknown in case the instance type lookup in the dnet:result_typologies doesn't find a corresponding result type binding
2024-11-13 16:27:00 +01:00
Claudio Atzori
6c5df761e2
enforce resulttype based on the dnet:result_typologies vocabulary and upon merge
2024-11-12 14:18:04 +01:00
Claudio Atzori
f7bb53fe78
[orcid enrichment] added missing workflow parameter: workingDir
2024-11-07 01:04:43 +01:00
Claudio Atzori
a42c8b7c85
person table directory produced by the workflows raw_all and merge graphs
2024-10-30 11:25:17 +01:00
Claudio Atzori
323c76eafc
patch relations job: removed non necessary logging
2024-10-30 07:35:30 +01:00
Claudio Atzori
499892b67c
[graph raw] rule out empty PIDs
2024-10-29 09:51:30 +01:00
Claudio Atzori
e4abe55988
merged person_through_the_graph & code formatting
2024-10-28 11:01:49 +01:00
Claudio Atzori
d71df6de19
Merge pull request 'affroNewModelonBeta' ( #494 ) from affroNewModelonBeta into beta
...
Reviewed-on: D-Net/dnet-hadoop#494
2024-10-28 10:48:34 +01:00
Claudio Atzori
46dbb62598
Merge pull request ' #9839 : include claimed affiliation relationships' ( #476 ) from claim-orgs into beta
...
Reviewed-on: D-Net/dnet-hadoop#476
2024-10-25 10:12:59 +02:00
Giambattista Bloisi
aa7b8fd014
Use workingDir parameter for temporary data of ORCID enrichment
2024-10-23 14:02:17 +02:00
Giambattista Bloisi
0e34b0ece1
Fix imports: point them from the main distribution packages
2024-10-23 14:01:52 +02:00
Miriam Baglioni
821540f94a
[personEntity] updated the property file to include also the db parameters. The same for the wf definition. Refactoring for compilation
2024-10-22 10:13:30 +02:00
Miriam Baglioni
2b27afaec8
[createASfromAffRo] refactoring after compilation
2024-10-18 16:22:51 +02:00
Claudio Atzori
62ff843334
adopting dhp-schemas:8.0.1 to support Auhtor's rawAffiliationString(s). Improved graph2hive implementation
2024-10-08 16:22:54 +02:00
Claudio Atzori
d5867a1992
merged #490
2024-10-08 15:39:59 +02:00
Alessia
07e6e7b4d6
#9839 : include claimed affiliation relationships
2024-09-16 13:41:56 +02:00
Miriam Baglioni
45605f93ae
merging with branch beta
2024-08-12 18:03:10 +02:00
Miriam Baglioni
985ca15264
[openaire-affiliation]removes matchings without DOI
2024-08-05 12:10:40 +02:00
Claudio Atzori
0bf76f2a34
[graph provision] added person to the graph2hive workflow
2024-08-05 09:35:07 +02:00
Claudio Atzori
9486e21a44
copy or process the person records throughout the graph pipeline
2024-07-30 14:25:31 +02:00
Claudio Atzori
d771a883f9
[dedup] updated sql query used to read organizations from the OpenOrgs DB to include their typology
2024-07-25 09:53:48 +02:00
Michele Artini
d27e9ea50f
added ODF invisible stores in raw_all workflow
2024-07-23 09:56:27 +02:00
Michele De Bonis
4f4c73d65b
minor change: addition of missing parameter in sql query
2024-07-22 15:19:02 +02:00
Claudio Atzori
11fe3a4fe0
[graph resolution] use sparkExecutorMemory to define also the memoryOverhead
2024-06-11 14:21:17 +02:00
Claudio Atzori
3776327a8c
hostedby patching to work with the updated Crossref contents, resolved conflict
2024-06-10 15:24:12 +02:00
Claudio Atzori
ec79405cc9
[graph raw] set organization type from openorgs
2024-06-07 11:30:31 +02:00
Claudio Atzori
92c3abd5a4
[graph cleaning] use sparkExecutorMemory to define also the memoryOverhead
2024-06-06 10:44:33 +02:00
Claudio Atzori
73bd1938a5
[graph2hive] use sparkExecutorMemory to define also the memoryOverhead
2024-06-05 12:17:35 +02:00
Sandro La Bruzzo
103e2652b3
merged beta
2024-05-17 14:43:07 +02:00
Sandro La Bruzzo
a87f9ea643
fixed scholexplorer bug
2024-05-17 14:16:43 +02:00
Sandro La Bruzzo
6efab4d88e
fixed scholexplorer bug
2024-05-16 16:19:18 +02:00
Claudio Atzori
0486227185
[cleaning] deactivating the cleaning of FOS subjects found in the metadata provided by repositories
2024-05-03 14:31:12 +02:00
Sandro La Bruzzo
26bf8e763a
merged from beta
2024-05-02 15:20:23 +02:00
Sandro La Bruzzo
0646d0d064
Updated main sparkApplication to avoid to require master variable
2024-05-02 15:15:03 +02:00
Claudio Atzori
4355f64810
reverted to version 1.2.5-SNAPSHOT
2024-05-02 11:23:53 +02:00
Claudio Atzori
66680b8b9a
refactoring of common utilities
2024-05-02 11:16:58 +02:00
Claudio Atzori
dcf23b3d06
Merge branch 'beta' into beta-release-1.2.5
2024-05-02 10:01:49 +02:00
Sandro La Bruzzo
133ead1e3e
updated new version of scholexplorer Generation
2024-04-29 09:00:30 +02:00
Sandro La Bruzzo
9cd3bc0f10
Added a new generation of the dump for scholexplorer tested with last version of spark, and strongly refactored
2024-04-26 16:02:07 +02:00
Giambattista Bloisi
1878199dae
Miscellaneous fixes:
...
- in Merge By ID pick by preference those records coming from delegated Authorities
- fix various tests
- close spark session in SparkCreateSimRels
2024-04-24 08:12:45 +02:00
Claudio Atzori
c3053ef34d
using version 1.2.5-beta for the release
2024-04-23 14:52:32 +02:00
Claudio Atzori
b5bcab13ec
using version 1.2.5-beta for the release
2024-04-23 14:36:39 +02:00
Claudio Atzori
425c9afc36
using version 1.2.5-beta for the release
2024-04-23 14:30:04 +02:00
Claudio Atzori
0656ab2838
code formatting
2024-04-20 08:10:58 +02:00
Claudio Atzori
ab7f0855af
fixed query reading projects from the aggregator DB
2024-04-20 08:10:32 +02:00
Giambattista Bloisi
8ac167e420
Refinements to PR #404 : refactoring the Oaf records merge utilities into dhp-common
2024-04-16 17:18:28 +02:00
Giambattista Bloisi
43b454399f
- Bug fix in matchOrderedTokenAndAbbreviations algorithms where tokens with same initial character were always considered equal
...
- AuthorsMatch exploits the new matching strategy used for ORCID enhancements in #PR398: split author names in tokens, order the tokens, then check for matches of ordered full tokens or abbreviations
2024-04-15 18:19:29 +02:00
Claudio Atzori
ef52128c55
included new stats* workflows in parent pom list of modules, code formatting
2024-03-26 10:42:10 +01:00