Claudio Atzori
9e8849b753
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
2024-11-13 20:41:51 +01:00
sandro.labruzzo
4778a70478
Merge remote-tracking branch 'origin/beta' into pubmed_fix
2024-11-13 16:28:39 +01:00
Claudio Atzori
4a3b173ca2
defaults to 0000 - Unknown in case the instance type lookup in the dnet:result_typologies doesn't find a corresponding result type binding
2024-11-13 16:27:00 +01:00
sandro.labruzzo
ac0a94d62d
updated pubmed parser to add also ORCID id and affiliation string to authors
2024-11-13 16:26:59 +01:00
Giambattista Bloisi
5ee8881646
Merge pull request '[danishfunders] added link for danish funders versus the unidentified project for IRFD (501100004836) CF (501100002808) and NNF(501100009708)' ( #502 ) from danishFunders_crossrefmap into beta
...
Reviewed-on: D-Net/dnet-hadoop#502
2024-11-13 12:01:38 +01:00
Miriam Baglioni
fb1f0f8850
[danishfunders] added the possibility to link also versus a specif award if present in the metadata
2024-11-13 12:00:33 +01:00
Giambattista Bloisi
5b4d821bf9
Merge pull request 'Crossref: generate canonical openaire id for results in affiliation relationship' ( #507 ) from fix_crossref_affiliations into beta
...
Reviewed-on: D-Net/dnet-hadoop#507
2024-11-13 11:01:37 +01:00
Giambattista Bloisi
03c262ccb9
Crossref: generate canonical openaire id for results in affiliation relationship
2024-11-13 10:56:17 +01:00
sandro.labruzzo
a1d5ad5c26
code formatted
2024-11-13 09:51:13 +01:00
sandro.labruzzo
b0478c380e
merged conflicts on beta
2024-11-13 09:43:16 +01:00
Claudio Atzori
07f267bb10
fix vocabulary lookup in mergeutils
2024-11-13 08:14:26 +01:00
Claudio Atzori
8088943399
Merge pull request 'enforce resulttype' ( #506 ) from merge_resulttypes into beta
...
Reviewed-on: D-Net/dnet-hadoop#506
2024-11-12 14:20:22 +01:00
Claudio Atzori
6c5df761e2
enforce resulttype based on the dnet:result_typologies vocabulary and upon merge
2024-11-12 14:18:04 +01:00
Claudio Atzori
9f7a606ddd
Merge pull request 'betaFixPerson' ( #505 ) from betaFixPerson into beta
...
Reviewed-on: D-Net/dnet-hadoop#505
2024-11-12 14:09:22 +01:00
Miriam Baglioni
250f101779
[person] fixed issue in creating project identifier for the graph for person->project relations
2024-11-11 16:04:06 +01:00
Miriam Baglioni
f1ea9da5bc
[person] checked type in inferenceprovenance
2024-11-11 15:37:56 +01:00
Miriam Baglioni
b0283fe94c
[person] fix provenance of pid in person when it is orcid (classid entityregistry to avoid the cleaning put orcid_pending)
2024-11-11 14:57:57 +01:00
sandro.labruzzo
474f365286
removed wrong test
2024-11-11 12:37:27 +01:00
sandro.labruzzo
19ce783e58
renamed workflow
2024-11-11 12:28:02 +01:00
Sandro La Bruzzo
0d0904f4ec
updated workflow baseline to direct transform on OAF
2024-11-11 10:27:23 +01:00
Giambattista Bloisi
f31f22801f
Merge pull request 'Remove ORCID information when the same ORCID ID is used multiple times in the same result for different authors' ( #503 ) from clean_clashing_orcids into beta
...
Reviewed-on: D-Net/dnet-hadoop#503
2024-11-08 09:31:11 +01:00
Miriam Baglioni
6fd9ec8566
[danishfunders] added link for danish funders versus the unidentified project for IRFD (501100004836) CF (501100002808) and NNF(501100009708)
2024-11-07 13:55:31 +01:00
Giambattista Bloisi
8f5171557e
Remove ORCID information when the same ORCID ID is used multiple times in the same result for different authors
2024-11-07 12:22:34 +01:00
Claudio Atzori
f7bb53fe78
[orcid enrichment] added missing workflow parameter: workingDir
2024-11-07 01:04:43 +01:00
Claudio Atzori
973aa7dca6
[dedup] force the Relation schema when reading the merge rels
2024-11-06 12:29:06 +01:00
Sandro La Bruzzo
c1cef5d685
removed old library joda time replaced with standard java.time introduced in java 8
2024-11-05 10:38:40 +01:00
Sandro La Bruzzo
a8ed5a3b04
Organized getters and setters in the PMArticle class for better readability and maintainability.
2024-11-04 17:45:28 +01:00
Claudio Atzori
a42c8b7c85
person table directory produced by the workflows raw_all and merge graphs
2024-10-30 11:25:17 +01:00
Claudio Atzori
a877c76d70
make MergeUtils.selectOldestDate less prone to errors when receiving invalid date formats
2024-10-30 11:24:25 +01:00
Claudio Atzori
26cdc7e439
Avoid NPEs in MergeUtils
2024-10-30 07:35:47 +01:00
Claudio Atzori
323c76eafc
patch relations job: removed non necessary logging
2024-10-30 07:35:30 +01:00
Miriam Baglioni
69aee609ef
[bulktag] align type to community api
2024-10-29 15:53:04 +01:00
Claudio Atzori
5ca031c8d6
[graph raw] rule out empty PIDs
2024-10-29 13:48:41 +01:00
Claudio Atzori
499892b67c
[graph raw] rule out empty PIDs
2024-10-29 09:51:30 +01:00
Claudio Atzori
e4504fd98d
[Person] fixed project identifier creation
2024-10-28 15:32:09 +01:00
Claudio Atzori
9b4415cb67
using _the right_ scala 2.11 converters
2024-10-28 13:56:25 +01:00
Claudio Atzori
e6ca382deb
using scala 2.11 converters
2024-10-28 13:52:06 +01:00
Claudio Atzori
940735921f
Merge pull request 'Fill mergedIds field and filter mergerels with dedup records actually created' ( #500 ) from mergedids into beta
...
Reviewed-on: D-Net/dnet-hadoop#500
2024-10-28 13:43:09 +01:00
Giambattista Bloisi
56224e034a
Fill the new mergedIds field when generating dedup records
...
Filter out dedup records composed of invisible records only
Filter out mergerels that have not been used when creating the dedup record (ungrouping of cliques)
2024-10-28 13:31:01 +01:00
Miriam Baglioni
5916346ba1
[TransformativeAgreement] fix to remove the file downloaded from a previous run of the workflow
2024-10-28 12:18:50 +01:00
Claudio Atzori
e4abe55988
merged person_through_the_graph & code formatting
2024-10-28 11:01:49 +01:00
Claudio Atzori
d71df6de19
Merge pull request 'affroNewModelonBeta' ( #494 ) from affroNewModelonBeta into beta
...
Reviewed-on: D-Net/dnet-hadoop#494
2024-10-28 10:48:34 +01:00
Claudio Atzori
1cdcd07a7e
Merge pull request 'dhp-schema upgrade & provision mapping 2' ( #499 ) from beta_provision_alignment_9.0.0 into beta
...
Reviewed-on: D-Net/dnet-hadoop#499
2024-10-28 10:44:08 +01:00
Claudio Atzori
6fd50266f1
translate 'otherresearchproduct' into 'other' when setting the related record type
2024-10-28 10:42:46 +01:00
Claudio Atzori
dffa376eb6
Merge pull request 'dhp-schema upgrade & provision mapping' ( #498 ) from beta_provision_alignment_9.0.0 into beta
...
Reviewed-on: D-Net/dnet-hadoop#498
2024-10-28 10:03:24 +01:00
Claudio Atzori
32fa579b80
[graph provision] select the longest abstract
2024-10-28 10:03:02 +01:00
Claudio Atzori
67e37f41fb
Merge pull request 'blacklist filtering moved before the cleanup phase in order to have case sensitive regex' ( #485 ) from dedup_blacklist_fix into beta
...
Reviewed-on: D-Net/dnet-hadoop#485
2024-10-28 09:42:51 +01:00
Miriam Baglioni
0fb6af5586
Updated main pom dependency against dhp-schema, from 8.0.1 to 9.0.0. The new fields included in the updated schema module are populated by the Solr JSON payload mapping, which also limits the number of authors serialised to 200.
2024-10-25 16:28:50 +02:00
Claudio Atzori
dcba5ad32a
Merge pull request 'person_through_the_graph_newDevelopments' ( #497 ) from person_through_the_graph_newDevelopments into person_through_the_graph
...
Reviewed-on: D-Net/dnet-hadoop#497
2024-10-25 10:20:40 +02:00
Claudio Atzori
46dbb62598
Merge pull request ' #9839 : include claimed affiliation relationships' ( #476 ) from claim-orgs into beta
...
Reviewed-on: D-Net/dnet-hadoop#476
2024-10-25 10:12:59 +02:00