Commit Graph

5573 Commits

Author SHA1 Message Date
Miriam Baglioni 071cc95afb [orcipPropagation] test to verify the merge for multiple enrichment from multiple sources. It covers also the check for persistency of other identifiers types 2024-12-20 14:41:09 +01:00
Miriam Baglioni 29611b2091 [orcipPropagation]rewritten in scala. generategraph again not abstract 2024-12-20 12:40:33 +01:00
Miriam Baglioni 1853da1e2c [orcipPropagation] moved from joinWith to join not lo loose the schema, and removed the field matched.col(id) since there is no more Seq(id) 2024-12-20 11:35:11 +01:00
Miriam Baglioni 7752d47776 [orcipPropagation]changes for merge for propagation 2024-12-20 10:20:32 +01:00
Miriam Baglioni a9ccd00483 [orcidPropagatio] - 2024-12-20 08:35:17 +01:00
Miriam Baglioni 3021dfda77 [orcidPropagatio] - 2024-12-19 15:14:09 +01:00
Miriam Baglioni ec4a90f669 [orcidPropagatio] added specific classid and classname for pid qualifier 2024-12-19 11:03:09 +01:00
Miriam Baglioni 345d69d11b [orcidPropagatio] changed the test to include parameter and changed the values for the new resources 2024-12-19 11:02:42 +01:00
Miriam Baglioni 60cfaf119b [orcidPropagatio] added specific orcid propagation classid and classname in the qualifier 2024-12-19 11:02:11 +01:00
Miriam Baglioni df5f1caa7a [orcidPropagatio] added classid and classname for qualifier of the pid 2024-12-19 11:01:45 +01:00
Miriam Baglioni d9be4a36d4 [orcidPropagatio] chenaged resources since the code assumes to have bidirectionality in the relations 2024-12-19 11:01:25 +01:00
Miriam Baglioni 668706a2e3 [orcidPropagatio] changed to avoid searching relations only within the same type of results 2024-12-19 11:00:55 +01:00
Miriam Baglioni 40c002b112 Merge remote-tracking branch 'origin/propagateorcid' into propagateorcid
# Conflicts:
#	dhp-common/src/main/java/eu/dnetlib/dhp/common/enrichment/Constants.java
#	dhp-common/src/main/scala/eu/dnetlib/dhp/common/author/SparkEnrichWithOrcidAuthors.scala
#	dhp-common/src/main/scala/eu/dnetlib/dhp/utils/ORCIDAuthorEnricher.scala
#	dhp-workflows/dhp-enrichment/src/main/java/eu/dnetlib/dhp/PropagationConstant.java
#	dhp-workflows/dhp-enrichment/src/main/java/eu/dnetlib/dhp/orcidtoresultfromsemrel/OrcidAuthors.java
#	dhp-workflows/dhp-enrichment/src/main/java/eu/dnetlib/dhp/orcidtoresultfromsemrel/SparkPropagateOrcidAuthor.java
#	dhp-workflows/dhp-graph-mapper/src/main/scala/eu/dnetlib/dhp/enrich/orcid/SparkEnrichGraphWithOrcidAuthors.scala
2024-12-18 09:23:21 +01:00
Giambattista Bloisi 71fe0374dc Revise propagation tests 2024-12-17 16:01:03 +01:00
Giambattista Bloisi d095b31ea8 [orcidenrichment] Fix lambda to avoid requiring serialization on enclosing class 2024-12-17 15:17:11 +01:00
Giambattista Bloisi 6260526fa1 [orcidenrichment] Fix imports and formatting 2024-12-17 15:17:11 +01:00
Giambattista Bloisi 64f4d7fb71 [orcidenrichment] When comparing authors manage the case of hyphenation and punctuations characters and normalizes utf strings 2024-12-17 15:17:11 +01:00
Giambattista Bloisi e03e8a39c0 [orcidenrichment] Do not match in case of ambiguity: two authors match and at least one of them has affiliation string 2024-12-17 15:17:11 +01:00
Miriam Baglioni fbc19ce4a8 [orcidenrichment] fixing issue 2024-12-17 15:17:11 +01:00
Miriam Baglioni 0a0f820dc7 [orcidenrichment] fixing issue 2024-12-17 15:17:11 +01:00
Miriam Baglioni f9531e0406 [orcidenrichment] refactoring 2024-12-17 15:17:11 +01:00
Miriam Baglioni 1b4bbb2691 [orcidenrichment] refactoring 2024-12-17 15:17:11 +01:00
Miriam Baglioni da9bbdede4 [orcidenrichment] refactoring 2024-12-17 15:17:11 +01:00
Miriam Baglioni eb83a34f64 [OrcidPropagation] alignemnt of property file with new parameters 2024-12-17 15:17:11 +01:00
Miriam Baglioni 0cae085786 [OrcidPropagation] new preparation step to use the authornamedisambiguation employed for orcid enrichment. 2024-12-17 15:17:11 +01:00
Giambattista Bloisi 43a9fe1ef4 Draft SparkPropagateOrcidAuthors 2024-12-17 15:17:11 +01:00
Giambattista Bloisi 36ca0b123e Move AuthorMatchers in dhp-common 2024-12-17 15:17:11 +01:00
sandro.labruzzo dccbcfd36c code formatted 2024-12-13 11:48:32 +01:00
sandro.labruzzo b039952d97 bug fixed on zenodo plugin 2024-12-13 10:43:27 +01:00
Miriam Baglioni 29a2a29666 Merge pull request '[research_fi] added plugin name to collectorplugins' (#519) from beta_researchfi into beta
Reviewed-on: #519
2024-12-12 09:12:05 +01:00
Miriam Baglioni 1b1fb9f1c2 [research_fi] added plugin name to collectorplugins 2024-12-11 16:38:02 +01:00
Giambattista Bloisi 101d9e830d JsonListMatch do not lower the extracted strings
Fix test configurations and assertions
2024-12-11 15:59:13 +01:00
Sandro La Bruzzo dd6ed31383 Merge remote-tracking branch 'origin/beta' into beta 2024-12-06 14:23:58 +01:00
Sandro La Bruzzo 0d05006114 code formatted 2024-12-06 14:23:47 +01:00
Claudio Atzori e4b814b3f1 code formatting 2024-12-06 13:58:39 +01:00
Claudio Atzori 5c7f7fb3b8 Merge pull request 'Add Collector Plugin for Zenodo Dumps' (#516) from zenodo_dump_collection into beta
Reviewed-on: #516
2024-12-06 13:51:10 +01:00
Claudio Atzori 9e6b1f2f24 Merge pull request 'Communities_patents' (#514) from Communities_patents into beta
Reviewed-on: #514
2024-12-06 13:50:43 +01:00
Miriam Baglioni 666155bafa [communityfromsemrelpropagation] changed resource to have deletedbyinference = false. 2024-12-06 12:26:41 +01:00
Miriam Baglioni ee84db7a6a [communityfromsemrelpropagation] added filtering to remove the deletedbyinference and invisible results 2024-12-06 12:20:13 +01:00
Claudio Atzori 77308ed525 Merge pull request 'Crossref Enhancements:' (#511) from crossref_mapping_improvement into beta
Reviewed-on: #511
2024-12-06 11:48:57 +01:00
Miriam Baglioni 302c4d044e Merge branch 'beta' into crossref_mapping_improvement 2024-12-06 11:45:37 +01:00
Claudio Atzori 60da306830 Merge pull request 'raid actionset wf' (#517) from raid_actionset into beta
Reviewed-on: #517
2024-12-06 10:04:20 +01:00
Claudio Atzori 8a5ba8df45 minor changes 2024-12-06 10:03:11 +01:00
Claudio Atzori dade7d5bb8 minor changes 2024-12-06 10:02:07 +01:00
Claudio Atzori f57446ad16 merge from beta 2024-12-06 09:50:42 +01:00
Michele De Bonis 1c144a4dcb minor change 2024-12-06 09:18:10 +01:00
Sandro La Bruzzo fd1038b44d removed a sneaky break that was committed by mistake. 2024-12-06 09:12:06 +01:00
Giambattista Bloisi fed13e083e Fix: do not import joda
formatting
2024-12-05 15:21:32 +01:00
Michele De Bonis 6af3fd16b6 attributes fixes 2024-12-05 14:39:42 +01:00
Michele De Bonis bde59a7c8f implementation of the utilities for the inclusion of raids in the graph 2024-12-05 11:09:30 +01:00