Miriam Baglioni
b0283fe94c
[person] fix provenance of pid in person when it is orcid (classid entityregistry to avoid the cleaning put orcid_pending)
2024-11-11 14:57:57 +01:00
Claudio Atzori
f7bb53fe78
[orcid enrichment] added missing workflow parameter: workingDir
2024-11-07 01:04:43 +01:00
Claudio Atzori
973aa7dca6
[dedup] force the Relation schema when reading the merge rels
2024-11-06 12:29:06 +01:00
Claudio Atzori
a42c8b7c85
person table directory produced by the workflows raw_all and merge graphs
2024-10-30 11:25:17 +01:00
Claudio Atzori
323c76eafc
patch relations job: removed non necessary logging
2024-10-30 07:35:30 +01:00
Miriam Baglioni
69aee609ef
[bulktag] align type to community api
2024-10-29 15:53:04 +01:00
Claudio Atzori
499892b67c
[graph raw] rule out empty PIDs
2024-10-29 09:51:30 +01:00
Claudio Atzori
e4504fd98d
[Person] fixed project identifier creation
2024-10-28 15:32:09 +01:00
Claudio Atzori
9b4415cb67
using _the right_ scala 2.11 converters
2024-10-28 13:56:25 +01:00
Claudio Atzori
e6ca382deb
using scala 2.11 converters
2024-10-28 13:52:06 +01:00
Claudio Atzori
940735921f
Merge pull request 'Fill mergedIds field and filter mergerels with dedup records actually created' ( #500 ) from mergedids into beta
...
Reviewed-on: D-Net/dnet-hadoop#500
2024-10-28 13:43:09 +01:00
Giambattista Bloisi
56224e034a
Fill the new mergedIds field when generating dedup records
...
Filter out dedup records composed of invisible records only
Filter out mergerels that have not been used when creating the dedup record (ungrouping of cliques)
2024-10-28 13:31:01 +01:00
Miriam Baglioni
5916346ba1
[TransformativeAgreement] fix to remove the file downloaded from a previous run of the workflow
2024-10-28 12:18:50 +01:00
Claudio Atzori
e4abe55988
merged person_through_the_graph & code formatting
2024-10-28 11:01:49 +01:00
Claudio Atzori
d71df6de19
Merge pull request 'affroNewModelonBeta' ( #494 ) from affroNewModelonBeta into beta
...
Reviewed-on: D-Net/dnet-hadoop#494
2024-10-28 10:48:34 +01:00
Claudio Atzori
1cdcd07a7e
Merge pull request 'dhp-schema upgrade & provision mapping 2' ( #499 ) from beta_provision_alignment_9.0.0 into beta
...
Reviewed-on: D-Net/dnet-hadoop#499
2024-10-28 10:44:08 +01:00
Claudio Atzori
6fd50266f1
translate 'otherresearchproduct' into 'other' when setting the related record type
2024-10-28 10:42:46 +01:00
Claudio Atzori
dffa376eb6
Merge pull request 'dhp-schema upgrade & provision mapping' ( #498 ) from beta_provision_alignment_9.0.0 into beta
...
Reviewed-on: D-Net/dnet-hadoop#498
2024-10-28 10:03:24 +01:00
Claudio Atzori
32fa579b80
[graph provision] select the longest abstract
2024-10-28 10:03:02 +01:00
Claudio Atzori
67e37f41fb
Merge pull request 'blacklist filtering moved before the cleanup phase in order to have case sensitive regex' ( #485 ) from dedup_blacklist_fix into beta
...
Reviewed-on: D-Net/dnet-hadoop#485
2024-10-28 09:42:51 +01:00
Miriam Baglioni
0fb6af5586
Updated main pom dependency against dhp-schema, from 8.0.1 to 9.0.0. The new fields included in the updated schema module are populated by the Solr JSON payload mapping, which also limits the number of authors serialised to 200.
2024-10-25 16:28:50 +02:00
Claudio Atzori
46dbb62598
Merge pull request ' #9839 : include claimed affiliation relationships' ( #476 ) from claim-orgs into beta
...
Reviewed-on: D-Net/dnet-hadoop#476
2024-10-25 10:12:59 +02:00
Claudio Atzori
4a9aeb6238
Merge pull request '9126-impact-indicators-wf-optimisation' ( #471 ) from 9126-impact-indicators-wf-optimisation into beta
...
Reviewed-on: D-Net/dnet-hadoop#471
2024-10-25 10:10:44 +02:00
Claudio Atzori
8172bee8c8
Merge pull request 'Minor fixes' ( #496 ) from beta_fixes_oct into beta
...
Reviewed-on: D-Net/dnet-hadoop#496
2024-10-25 10:09:56 +02:00
Miriam Baglioni
1fce7d5a0f
[Person] remove the isolated nodes from the person set
2024-10-25 10:05:17 +02:00
Miriam Baglioni
842cc75dae
[AffRo] fix name
2024-10-25 09:42:52 +02:00
Miriam Baglioni
e75326d6ec
[FundersMatchFromCrossref] added match from CrossRef to DFG unidentified project
2024-10-25 09:13:54 +02:00
Miriam Baglioni
32f444984e
[person] -
2024-10-24 17:51:42 +02:00
Miriam Baglioni
cab8f1135f
[affroNewModel] -
2024-10-24 17:44:33 +02:00
Miriam Baglioni
c93bf82487
[affroNewModel] extended wf definition
2024-10-24 17:34:34 +02:00
Miriam Baglioni
a7699558ed
[person] -
2024-10-24 16:15:12 +02:00
Miriam Baglioni
01679c935a
[person] added test class to be implemented
2024-10-24 15:27:06 +02:00
Miriam Baglioni
c773421cc7
[person] added new substep in propagation worflow main
2024-10-24 14:44:13 +02:00
Miriam Baglioni
cf07ed9058
[person] refactoring
2024-10-24 14:35:14 +02:00
Miriam Baglioni
c921cf7ee0
[personEntity] removed the deletedbyinference results (not indexed, but still in the graph). Changed the writing mode: append instead of overwrite
2024-10-24 09:57:20 +02:00
Giambattista Bloisi
aa7b8fd014
Use workingDir parameter for temporary data of ORCID enrichment
2024-10-23 14:02:17 +02:00
Giambattista Bloisi
0e34b0ece1
Fix imports: point them from the main distribution packages
2024-10-23 14:01:52 +02:00
Miriam Baglioni
aac5eb3499
[personEntity] changed the data info for the relations with projects. added missing parameters to the job.properties file
2024-10-22 11:54:16 +02:00
Miriam Baglioni
821540f94a
[personEntity] updated the property file to include also the db parameters. The same for the wf definition. Refactoring for compilation
2024-10-22 10:13:30 +02:00
Miriam Baglioni
09a2c93fc7
[personEntity] added relations with projects extracting the info from the database
2024-10-21 16:21:15 +02:00
Miriam Baglioni
ce4ee1189f
[personEntity] create entity for each profile in orcid even without works. Added validated true to each relation coming from orcid data
2024-10-21 14:38:15 +02:00
Miriam Baglioni
2b27afaec8
[createASfromAffRo] refactoring after compilation
2024-10-18 16:22:51 +02:00
Miriam Baglioni
0e5dd14538
[createASfromAffRo] adding the provenance datasource used to get the relation (no datasource can be webcrawl = publisher, rawaff means oalex)
2024-10-18 16:22:21 +02:00
Claudio Atzori
62ff843334
adopting dhp-schemas:8.0.1 to support Auhtor's rawAffiliationString(s). Improved graph2hive implementation
2024-10-08 16:22:54 +02:00
Claudio Atzori
d5867a1992
merged #490
2024-10-08 15:39:59 +02:00
Claudio Atzori
e5df68772d
[graph provision] fixed serialisation of the usage counts as measures in the XML records
2024-10-02 09:35:21 +02:00
Miriam Baglioni
7e6d12fa77
[UsageCount] fixed error
...
(cherry picked from commit 9c9a9562ae
)
2024-10-01 15:55:07 +02:00
Miriam Baglioni
191fc3a461
[UsageCount] add check in case the datasource is not matched against those present in the graph
...
(cherry picked from commit b42bdd5fb3
)
2024-10-01 15:54:31 +02:00
Claudio Atzori
10696f2a44
reverted procedure for creating the UsageCounts actionset
2024-10-01 15:54:13 +02:00
Claudio Atzori
5734b80861
Merge pull request 'datasource table creation split in steps' ( #489 ) from antonis.lempesis/dnet-hadoop:beta into beta
...
Reviewed-on: D-Net/dnet-hadoop#489
2024-09-30 16:34:38 +02:00