Miriam Baglioni
2d45f125a7
[bulktag subcommunities] refactoring and addition of new properties
2024-12-20 09:06:55 +01:00
Miriam Baglioni
2570023590
[Subcommunities] modified bulktagging workflow to include the new parameters
2024-11-21 14:47:17 +01:00
Miriam Baglioni
c0729ac279
[Subcommunities] added remapping to master datasource
2024-11-21 14:36:26 +01:00
Miriam Baglioni
ab96983647
[Subcommunities] added remapping to representative organization
2024-11-21 12:35:05 +01:00
Miriam Baglioni
0656ed568d
[Subcommunities] remove not needed methods used to create datasourceCommunityMap
2024-11-21 11:05:58 +01:00
Miriam Baglioni
ba9f1982b3
[Subcommunities] used the two new access point to directly get the organizationCOmmunityMap and the datasourceCommunityMap
2024-11-21 11:04:58 +01:00
Miriam Baglioni
9ee061ee90
[Subcommunities] added to the list of the communities also the sub community identifiers
2024-11-21 11:02:52 +01:00
Miriam Baglioni
896de42598
[CommunityAPI] use of new access point to directly get the organizationCommunityMap and the datasouceCommunityMap for all the communities and subcommunities. To be changed in the propagation code when implemented in the APIs
2024-11-20 17:44:33 +01:00
Miriam Baglioni
3081cad1d3
[CommunityAPI] refactoring
2024-11-20 14:03:59 +01:00
Miriam Baglioni
6beb94adee
[SubCommunity] Extention of the Utils methods to add also the associations between the subcommunities and organization/project/datasources
2024-11-20 10:59:49 +01:00
Miriam Baglioni
9dbcf19efb
[SubCommunity] Extention of communityApis to add also the associations between the subcommunities and organization/project/datasources
2024-11-20 09:16:33 +01:00
Miriam Baglioni
cea2de2c37
[SubCommunity] Extention of CommunityAPIs fro bulk tagging
2024-11-19 14:50:42 +01:00
Claudio Atzori
cf7d9a32ab
disable autoBroadcastJoin in the cleaning workflow
2024-11-15 09:17:28 +01:00
Claudio Atzori
5f512f510e
code formatting
2024-11-15 09:16:51 +01:00
Claudio Atzori
b95672b420
mergeUtils set the result identifier when enforcing the result type
2024-11-15 09:16:18 +01:00
Claudio Atzori
9e8849b753
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
2024-11-13 20:41:51 +01:00
Claudio Atzori
4a3b173ca2
defaults to 0000 - Unknown in case the instance type lookup in the dnet:result_typologies doesn't find a corresponding result type binding
2024-11-13 16:27:00 +01:00
Giambattista Bloisi
5ee8881646
Merge pull request '[danishfunders] added link for danish funders versus the unidentified project for IRFD (501100004836) CF (501100002808) and NNF(501100009708)' ( #502 ) from danishFunders_crossrefmap into beta
...
Reviewed-on: D-Net/dnet-hadoop#502
2024-11-13 12:01:38 +01:00
Miriam Baglioni
fb1f0f8850
[danishfunders] added the possibility to link also versus a specif award if present in the metadata
2024-11-13 12:00:33 +01:00
Giambattista Bloisi
5b4d821bf9
Merge pull request 'Crossref: generate canonical openaire id for results in affiliation relationship' ( #507 ) from fix_crossref_affiliations into beta
...
Reviewed-on: D-Net/dnet-hadoop#507
2024-11-13 11:01:37 +01:00
Giambattista Bloisi
03c262ccb9
Crossref: generate canonical openaire id for results in affiliation relationship
2024-11-13 10:56:17 +01:00
Claudio Atzori
07f267bb10
fix vocabulary lookup in mergeutils
2024-11-13 08:14:26 +01:00
Claudio Atzori
8088943399
Merge pull request 'enforce resulttype' ( #506 ) from merge_resulttypes into beta
...
Reviewed-on: D-Net/dnet-hadoop#506
2024-11-12 14:20:22 +01:00
Claudio Atzori
6c5df761e2
enforce resulttype based on the dnet:result_typologies vocabulary and upon merge
2024-11-12 14:18:04 +01:00
Claudio Atzori
9f7a606ddd
Merge pull request 'betaFixPerson' ( #505 ) from betaFixPerson into beta
...
Reviewed-on: D-Net/dnet-hadoop#505
2024-11-12 14:09:22 +01:00
Miriam Baglioni
250f101779
[person] fixed issue in creating project identifier for the graph for person->project relations
2024-11-11 16:04:06 +01:00
Miriam Baglioni
f1ea9da5bc
[person] checked type in inferenceprovenance
2024-11-11 15:37:56 +01:00
Miriam Baglioni
b0283fe94c
[person] fix provenance of pid in person when it is orcid (classid entityregistry to avoid the cleaning put orcid_pending)
2024-11-11 14:57:57 +01:00
Giambattista Bloisi
f31f22801f
Merge pull request 'Remove ORCID information when the same ORCID ID is used multiple times in the same result for different authors' ( #503 ) from clean_clashing_orcids into beta
...
Reviewed-on: D-Net/dnet-hadoop#503
2024-11-08 09:31:11 +01:00
Miriam Baglioni
6fd9ec8566
[danishfunders] added link for danish funders versus the unidentified project for IRFD (501100004836) CF (501100002808) and NNF(501100009708)
2024-11-07 13:55:31 +01:00
Giambattista Bloisi
8f5171557e
Remove ORCID information when the same ORCID ID is used multiple times in the same result for different authors
2024-11-07 12:22:34 +01:00
Claudio Atzori
f7bb53fe78
[orcid enrichment] added missing workflow parameter: workingDir
2024-11-07 01:04:43 +01:00
Claudio Atzori
973aa7dca6
[dedup] force the Relation schema when reading the merge rels
2024-11-06 12:29:06 +01:00
Claudio Atzori
a42c8b7c85
person table directory produced by the workflows raw_all and merge graphs
2024-10-30 11:25:17 +01:00
Claudio Atzori
a877c76d70
make MergeUtils.selectOldestDate less prone to errors when receiving invalid date formats
2024-10-30 11:24:25 +01:00
Claudio Atzori
26cdc7e439
Avoid NPEs in MergeUtils
2024-10-30 07:35:47 +01:00
Claudio Atzori
323c76eafc
patch relations job: removed non necessary logging
2024-10-30 07:35:30 +01:00
Miriam Baglioni
69aee609ef
[bulktag] align type to community api
2024-10-29 15:53:04 +01:00
Claudio Atzori
5ca031c8d6
[graph raw] rule out empty PIDs
2024-10-29 13:48:41 +01:00
Claudio Atzori
499892b67c
[graph raw] rule out empty PIDs
2024-10-29 09:51:30 +01:00
Claudio Atzori
e4504fd98d
[Person] fixed project identifier creation
2024-10-28 15:32:09 +01:00
Claudio Atzori
9b4415cb67
using _the right_ scala 2.11 converters
2024-10-28 13:56:25 +01:00
Claudio Atzori
e6ca382deb
using scala 2.11 converters
2024-10-28 13:52:06 +01:00
Claudio Atzori
940735921f
Merge pull request 'Fill mergedIds field and filter mergerels with dedup records actually created' ( #500 ) from mergedids into beta
...
Reviewed-on: D-Net/dnet-hadoop#500
2024-10-28 13:43:09 +01:00
Giambattista Bloisi
56224e034a
Fill the new mergedIds field when generating dedup records
...
Filter out dedup records composed of invisible records only
Filter out mergerels that have not been used when creating the dedup record (ungrouping of cliques)
2024-10-28 13:31:01 +01:00
Miriam Baglioni
5916346ba1
[TransformativeAgreement] fix to remove the file downloaded from a previous run of the workflow
2024-10-28 12:18:50 +01:00
Claudio Atzori
e4abe55988
merged person_through_the_graph & code formatting
2024-10-28 11:01:49 +01:00
Claudio Atzori
d71df6de19
Merge pull request 'affroNewModelonBeta' ( #494 ) from affroNewModelonBeta into beta
...
Reviewed-on: D-Net/dnet-hadoop#494
2024-10-28 10:48:34 +01:00
Claudio Atzori
1cdcd07a7e
Merge pull request 'dhp-schema upgrade & provision mapping 2' ( #499 ) from beta_provision_alignment_9.0.0 into beta
...
Reviewed-on: D-Net/dnet-hadoop#499
2024-10-28 10:44:08 +01:00
Claudio Atzori
6fd50266f1
translate 'otherresearchproduct' into 'other' when setting the related record type
2024-10-28 10:42:46 +01:00