Claudio Atzori
9e8849b753
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
2024-11-13 20:41:51 +01:00
sandro.labruzzo
4778a70478
Merge remote-tracking branch 'origin/beta' into pubmed_fix
2024-11-13 16:28:39 +01:00
Claudio Atzori
4a3b173ca2
defaults to 0000 - Unknown in case the instance type lookup in the dnet:result_typologies doesn't find a corresponding result type binding
2024-11-13 16:27:00 +01:00
sandro.labruzzo
ac0a94d62d
updated pubmed parser to add also ORCID id and affiliation string to authors
2024-11-13 16:26:59 +01:00
Giambattista Bloisi
5ee8881646
Merge pull request '[danishfunders] added link for danish funders versus the unidentified project for IRFD (501100004836) CF (501100002808) and NNF(501100009708)' ( #502 ) from danishFunders_crossrefmap into beta
...
Reviewed-on: #502
2024-11-13 12:01:38 +01:00
Miriam Baglioni
fb1f0f8850
[danishfunders] added the possibility to link also versus a specif award if present in the metadata
2024-11-13 12:00:33 +01:00
Giambattista Bloisi
5b4d821bf9
Merge pull request 'Crossref: generate canonical openaire id for results in affiliation relationship' ( #507 ) from fix_crossref_affiliations into beta
...
Reviewed-on: #507
2024-11-13 11:01:37 +01:00
Giambattista Bloisi
03c262ccb9
Crossref: generate canonical openaire id for results in affiliation relationship
2024-11-13 10:56:17 +01:00
sandro.labruzzo
a1d5ad5c26
code formatted
2024-11-13 09:51:13 +01:00
sandro.labruzzo
b0478c380e
merged conflicts on beta
2024-11-13 09:43:16 +01:00
Claudio Atzori
07f267bb10
fix vocabulary lookup in mergeutils
2024-11-13 08:14:26 +01:00
Claudio Atzori
8088943399
Merge pull request 'enforce resulttype' ( #506 ) from merge_resulttypes into beta
...
Reviewed-on: #506
2024-11-12 14:20:22 +01:00
Claudio Atzori
6c5df761e2
enforce resulttype based on the dnet:result_typologies vocabulary and upon merge
2024-11-12 14:18:04 +01:00
Claudio Atzori
9f7a606ddd
Merge pull request 'betaFixPerson' ( #505 ) from betaFixPerson into beta
...
Reviewed-on: #505
2024-11-12 14:09:22 +01:00
Miriam Baglioni
250f101779
[person] fixed issue in creating project identifier for the graph for person->project relations
2024-11-11 16:04:06 +01:00
Miriam Baglioni
f1ea9da5bc
[person] checked type in inferenceprovenance
2024-11-11 15:37:56 +01:00
Miriam Baglioni
b0283fe94c
[person] fix provenance of pid in person when it is orcid (classid entityregistry to avoid the cleaning put orcid_pending)
2024-11-11 14:57:57 +01:00
sandro.labruzzo
474f365286
removed wrong test
2024-11-11 12:37:27 +01:00
sandro.labruzzo
19ce783e58
renamed workflow
2024-11-11 12:28:02 +01:00
Sandro La Bruzzo
0d0904f4ec
updated workflow baseline to direct transform on OAF
2024-11-11 10:27:23 +01:00
Giambattista Bloisi
f31f22801f
Merge pull request 'Remove ORCID information when the same ORCID ID is used multiple times in the same result for different authors' ( #503 ) from clean_clashing_orcids into beta
...
Reviewed-on: #503
2024-11-08 09:31:11 +01:00
Miriam Baglioni
6fd9ec8566
[danishfunders] added link for danish funders versus the unidentified project for IRFD (501100004836) CF (501100002808) and NNF(501100009708)
2024-11-07 13:55:31 +01:00
Miriam Baglioni
b9875f0095
[orcidenrichment] fixing issue
2024-11-07 13:36:24 +01:00
Giambattista Bloisi
8f5171557e
Remove ORCID information when the same ORCID ID is used multiple times in the same result for different authors
2024-11-07 12:22:34 +01:00
Claudio Atzori
f7bb53fe78
[orcid enrichment] added missing workflow parameter: workingDir
2024-11-07 01:04:43 +01:00
Miriam Baglioni
227e84be99
[orcidenrichment] fixing issue
2024-11-06 16:36:34 +01:00
Miriam Baglioni
e4f89f9800
[orcidenrichment] refactoring
2024-11-06 14:15:34 +01:00
Claudio Atzori
973aa7dca6
[dedup] force the Relation schema when reading the merge rels
2024-11-06 12:29:06 +01:00
Miriam Baglioni
939c84ede6
[orcidenrichment] refactoring
2024-11-06 10:16:54 +01:00
Miriam Baglioni
07a51c7361
[orcidenrichment] refactoring
2024-11-05 14:11:06 +01:00
Sandro La Bruzzo
c1cef5d685
removed old library joda time replaced with standard java.time introduced in java 8
2024-11-05 10:38:40 +01:00
Sandro La Bruzzo
a8ed5a3b04
Organized getters and setters in the PMArticle class for better readability and maintainability.
2024-11-04 17:45:28 +01:00
Miriam Baglioni
cbef9ecbd6
[OrcidPropagation] alignemnt of property file with new parameters
2024-11-04 12:42:11 +01:00
Miriam Baglioni
d2fc392814
[OrcidPropagation] new preparation step to use the authornamedisambiguation employed for orcid enrichment.
2024-11-04 12:41:32 +01:00
Giambattista Bloisi
aeaedeed01
Draft SparkPropagateOrcidAuthors
2024-10-30 15:23:12 +01:00
Giambattista Bloisi
d67f125614
Move AuthorMatchers in dhp-common
2024-10-30 15:23:05 +01:00
Claudio Atzori
a42c8b7c85
person table directory produced by the workflows raw_all and merge graphs
2024-10-30 11:25:17 +01:00
Claudio Atzori
a877c76d70
make MergeUtils.selectOldestDate less prone to errors when receiving invalid date formats
2024-10-30 11:24:25 +01:00
Claudio Atzori
26cdc7e439
Avoid NPEs in MergeUtils
2024-10-30 07:35:47 +01:00
Claudio Atzori
323c76eafc
patch relations job: removed non necessary logging
2024-10-30 07:35:30 +01:00
Miriam Baglioni
69aee609ef
[bulktag] align type to community api
2024-10-29 15:53:04 +01:00
Claudio Atzori
5ca031c8d6
[graph raw] rule out empty PIDs
2024-10-29 13:48:41 +01:00
Claudio Atzori
499892b67c
[graph raw] rule out empty PIDs
2024-10-29 09:51:30 +01:00
Claudio Atzori
e4504fd98d
[Person] fixed project identifier creation
2024-10-28 15:32:09 +01:00
Claudio Atzori
9b4415cb67
using _the right_ scala 2.11 converters
2024-10-28 13:56:25 +01:00
Claudio Atzori
e6ca382deb
using scala 2.11 converters
2024-10-28 13:52:06 +01:00
Claudio Atzori
940735921f
Merge pull request 'Fill mergedIds field and filter mergerels with dedup records actually created' ( #500 ) from mergedids into beta
...
Reviewed-on: #500
2024-10-28 13:43:09 +01:00
Giambattista Bloisi
56224e034a
Fill the new mergedIds field when generating dedup records
...
Filter out dedup records composed of invisible records only
Filter out mergerels that have not been used when creating the dedup record (ungrouping of cliques)
2024-10-28 13:31:01 +01:00
Miriam Baglioni
5916346ba1
[TransformativeAgreement] fix to remove the file downloaded from a previous run of the workflow
2024-10-28 12:18:50 +01:00
Claudio Atzori
e4abe55988
merged person_through_the_graph & code formatting
2024-10-28 11:01:49 +01:00