Claudio Atzori
|
dade7d5bb8
|
minor changes
|
2024-12-06 10:02:07 +01:00 |
Michele De Bonis
|
bde59a7c8f
|
implementation of the utilities for the inclusion of raids in the graph
|
2024-12-05 11:09:30 +01:00 |
Claudio Atzori
|
b95672b420
|
mergeUtils set the result identifier when enforcing the result type
|
2024-11-15 09:16:18 +01:00 |
Claudio Atzori
|
4a3b173ca2
|
defaults to 0000 - Unknown in case the instance type lookup in the dnet:result_typologies doesn't find a corresponding result type binding
|
2024-11-13 16:27:00 +01:00 |
Claudio Atzori
|
07f267bb10
|
fix vocabulary lookup in mergeutils
|
2024-11-13 08:14:26 +01:00 |
Claudio Atzori
|
8088943399
|
Merge pull request 'enforce resulttype' (#506) from merge_resulttypes into beta
Reviewed-on: D-Net/dnet-hadoop#506
|
2024-11-12 14:20:22 +01:00 |
Claudio Atzori
|
6c5df761e2
|
enforce resulttype based on the dnet:result_typologies vocabulary and upon merge
|
2024-11-12 14:18:04 +01:00 |
Giambattista Bloisi
|
8f5171557e
|
Remove ORCID information when the same ORCID ID is used multiple times in the same result for different authors
|
2024-11-07 12:22:34 +01:00 |
Claudio Atzori
|
a877c76d70
|
make MergeUtils.selectOldestDate less prone to errors when receiving invalid date formats
|
2024-10-30 11:24:25 +01:00 |
Claudio Atzori
|
26cdc7e439
|
Avoid NPEs in MergeUtils
|
2024-10-30 07:35:47 +01:00 |
Claudio Atzori
|
5ca031c8d6
|
[graph raw] rule out empty PIDs
|
2024-10-29 13:48:41 +01:00 |
Claudio Atzori
|
e4abe55988
|
merged person_through_the_graph & code formatting
|
2024-10-28 11:01:49 +01:00 |
Miriam Baglioni
|
0fb6af5586
|
Updated main pom dependency against dhp-schema, from 8.0.1 to 9.0.0. The new fields included in the updated schema module are populated by the Solr JSON payload mapping, which also limits the number of authors serialised to 200.
|
2024-10-25 16:28:50 +02:00 |
Miriam Baglioni
|
c921cf7ee0
|
[personEntity] removed the deletedbyinference results (not indexed, but still in the graph). Changed the writing mode: append instead of overwrite
|
2024-10-24 09:57:20 +02:00 |
Giambattista Bloisi
|
6bc741715c
|
Fix OafMapperUtilsTest.testMergePubs
|
2024-10-23 14:02:45 +02:00 |
Claudio Atzori
|
d5867a1992
|
merged #490
|
2024-10-08 15:39:59 +02:00 |
Giambattista Bloisi
|
c45cae447a
|
Fix: invert the "natural" order when ordering by id lexicographically
|
2024-09-26 17:08:02 +02:00 |
Claudio Atzori
|
3fcafc7ed6
|
Merge pull request 'Latest institutions in monitor dbs' (#472) from antonis.lempesis/dnet-hadoop:beta into beta
Reviewed-on: D-Net/dnet-hadoop#472
|
2024-09-26 09:49:01 +02:00 |
Claudio Atzori
|
535a7b99f1
|
the metadata collection plugins using the HttpConnector2 class shall now retry instead of failing in case of UnknownHostException
|
2024-09-25 11:35:34 +02:00 |
Claudio Atzori
|
d1cadc77c9
|
[graph provision] person serialisation, limit the number of authorships and coauthorships before expanding the payloads
|
2024-09-24 10:57:20 +02:00 |
Claudio Atzori
|
e0ff84baf0
|
[graph provision] person serialisation, limit the number of authorships and coauthorships before expanding the payloads
|
2024-09-23 10:29:46 +02:00 |
Claudio Atzori
|
23e0ab3a7c
|
run mergeResultsOfDifferentTypes only when checkDelegatedAuthority is true
|
2024-09-17 15:36:10 +02:00 |
Claudio Atzori
|
bfd05cdab2
|
run mergeResultsOfDifferentTypes only when checkDelegatedAuthority is true
|
2024-09-17 10:49:32 +02:00 |
Claudio Atzori
|
9486e21a44
|
copy or process the person records throughout the graph pipeline
|
2024-07-30 14:25:31 +02:00 |
Claudio Atzori
|
5aa7847ea6
|
consider the transformative agreement text when merging results
|
2024-07-16 10:38:50 +02:00 |
Claudio Atzori
|
1180d78b71
|
make entity level pids unique by pidType:pidValue
|
2024-07-04 09:41:12 +02:00 |
Claudio Atzori
|
7d3292551b
|
ignore dates containing 'null's
|
2024-07-02 15:44:31 +02:00 |
Lampros Smyrnaios
|
fe2275a9b0
|
Merge branch 'beta' of https://code-repo.d4science.org/antonis.lempesis/dnet-hadoop into convert_hive_to_spark_actions
# Conflicts:
# dhp-workflows/dhp-stats-update/src/main/resources/eu/dnetlib/dhp/oa/graph/stats/oozie_app/scripts/step14.sql
|
2024-06-25 20:17:47 +03:00 |
Claudio Atzori
|
a8d68c9d29
|
avoid NPEs
|
2024-06-11 14:19:24 +02:00 |
Claudio Atzori
|
ce2364743a
|
applying changes from PR#442: Fix for missing collectedfrom after dedup
|
2024-06-06 10:43:43 +02:00 |
Claudio Atzori
|
f70dc76b61
|
minor
|
2024-06-06 10:43:10 +02:00 |
Lampros Smyrnaios
|
a644a6f4fe
|
Catch Spark-sql errors and show a log with the statement that failed.
|
2024-05-29 12:10:11 +03:00 |
Giambattista Bloisi
|
73316d8c83
|
Add jaxb and jaxws dependencies when compiling with spark-34 profile as they are required to run with jdk > 8
|
2024-05-28 14:14:51 +02:00 |
Sandro La Bruzzo
|
f1fe363b19
|
merged again from beta (I hope for the last time)
|
2024-05-22 11:08:52 +02:00 |
Sandro La Bruzzo
|
66c1ffc866
|
merged again from beta (I hope for the last time)
|
2024-05-22 11:02:46 +02:00 |
Sandro La Bruzzo
|
103e2652b3
|
merged beta
|
2024-05-17 14:43:07 +02:00 |
Sandro La Bruzzo
|
6efab4d88e
|
fixed scholexplorer bug
|
2024-05-16 16:19:18 +02:00 |
Claudio Atzori
|
a5d13d5d27
|
code formatting
|
2024-05-03 14:14:34 +02:00 |
Giambattista Bloisi
|
69c5efbd8b
|
Fix: when applying enrichments with no instance information the resulting merge entity was generated with no instance instead of keeping the original information
|
2024-05-03 13:57:56 +02:00 |
Sandro La Bruzzo
|
db358ad0d2
|
code formatted
|
2024-05-02 15:25:57 +02:00 |
Sandro La Bruzzo
|
26bf8e763a
|
merged from beta
|
2024-05-02 15:20:23 +02:00 |
Sandro La Bruzzo
|
0646d0d064
|
Updated main sparkApplication to avoid to require master variable
|
2024-05-02 15:15:03 +02:00 |
Claudio Atzori
|
4355f64810
|
reverted to version 1.2.5-SNAPSHOT
|
2024-05-02 11:23:53 +02:00 |
Claudio Atzori
|
66680b8b9a
|
refactoring of common utilities
|
2024-05-02 11:16:58 +02:00 |
Claudio Atzori
|
dcf23b3d06
|
Merge branch 'beta' into beta-release-1.2.5
|
2024-05-02 10:01:49 +02:00 |
Sandro La Bruzzo
|
9cd3bc0f10
|
Added a new generation of the dump for scholexplorer tested with last version of spark, and strongly refactored
|
2024-04-26 16:02:07 +02:00 |
Claudio Atzori
|
e2937db385
|
Merge branch 'beta' into misc_fixes_merge_entities
|
2024-04-24 08:55:28 +02:00 |
Giambattista Bloisi
|
1878199dae
|
Miscellaneous fixes:
- in Merge By ID pick by preference those records coming from delegated Authorities
- fix various tests
- close spark session in SparkCreateSimRels
|
2024-04-24 08:12:45 +02:00 |
Sandro La Bruzzo
|
0d628cd62b
|
merged again from beta
|
2024-04-23 17:34:55 +02:00 |
Claudio Atzori
|
c3053ef34d
|
using version 1.2.5-beta for the release
|
2024-04-23 14:52:32 +02:00 |