Miriam Baglioni
|
5c5a195e97
|
refactoring and fixing issue on property name
|
2023-10-23 11:26:17 +02:00 |
Miriam Baglioni
|
70b78a40c7
|
removed file from different propagation
|
2023-10-20 15:50:49 +02:00 |
Miriam Baglioni
|
f206ff42d6
|
modified code to use the the API. Removing not needed parameters. Rewritten the code to exploit the parallel stream on the entity types
|
2023-10-20 15:49:41 +02:00 |
Miriam Baglioni
|
34358afe75
|
modified resource file, workflow anf default-config. Add 3g of memory Overhead and specified the shuffle partition in the wf confiduration. Removed the multiple instantiation in the wf because of different implementation of the spark job
|
2023-10-20 15:48:27 +02:00 |
Miriam Baglioni
|
18bfff8af3
|
adding test classes and modifying test for bulktag
|
2023-10-20 15:47:03 +02:00 |
Miriam Baglioni
|
69dac91659
|
adding the new code to use the API instead of the Information Service
|
2023-10-20 15:45:52 +02:00 |
Claudio Atzori
|
242d647146
|
cleanup & docs
|
2023-10-12 12:23:44 +02:00 |
Claudio Atzori
|
af3ffad6c4
|
[AMF] docs
|
2023-10-12 10:07:52 +02:00 |
Claudio Atzori
|
d13bb534f0
|
Merge branch 'master' into fix_8997
|
2023-10-02 11:03:18 +02:00 |
Sandro La Bruzzo
|
9c3ab11d5b
|
Merge branch 'master' of code-repo.d4science.org:D-Net/dnet-hadoop
|
2023-09-25 15:29:19 +02:00 |
Sandro La Bruzzo
|
423ef30676
|
minor fix on the aggregation of uniprot and pdb
|
2023-09-25 15:28:58 +02:00 |
Claudio Atzori
|
4853c19b5e
|
code formatting
|
2023-09-20 15:53:21 +02:00 |
Giambattista Bloisi
|
1f226d1dce
|
Fix defect #8997: GenerateEventsJob is generating huge amounts of logs because broker entity similarity calculation consistently failed
|
2023-09-20 15:42:00 +02:00 |
Alessia Bardi
|
6186cdc2cc
|
Use v5 of the UNIBI Gold ISSN list in test
|
2023-09-19 14:47:01 +02:00 |
Alessia Bardi
|
d94b9bebf7
|
Merge branch 'master' of https://code-repo.d4science.org/D-Net/dnet-hadoop
|
2023-09-19 13:38:45 +02:00 |
Alessia Bardi
|
19abba8fa7
|
tests for d4science catalog
|
2023-09-19 13:38:25 +02:00 |
Serafeim Chatzopoulos
|
2aed5a74be
|
Run CC and RAM sequentieally in dhp-impact-indicators WF
|
2023-09-12 22:31:50 +03:00 |
Alessia Bardi
|
77a2199837
|
updated test for EOSC comunity
|
2023-09-08 11:05:49 +02:00 |
Claudio Atzori
|
265180bfd2
|
added Archive ouverte UNIGE (ETHZ.UNIGENF, opendoar____::1400) to the Datacite hostedBy_map
|
2023-09-07 11:20:35 +02:00 |
Claudio Atzori
|
da0e9828f7
|
resolved conflicts for PR#337
|
2023-09-06 11:28:46 +02:00 |
Claudio Atzori
|
adec6692ca
|
Merge branch 'beta' into invisible_relations
|
2023-09-04 16:13:06 +02:00 |
Claudio Atzori
|
15666e86a8
|
added collectedfrom to the affiliation relations imported from Crossref
|
2023-09-04 15:56:06 +02:00 |
Claudio Atzori
|
5b06c9d06f
|
[graph raw] datainfo.invisible set as true only for entities
|
2023-09-04 15:15:24 +02:00 |
Serafeim Chatzopoulos
|
7de0164c26
|
Fix import of affiliations relations from Crossref
|
2023-09-04 16:04:41 +03:00 |
Giambattista Bloisi
|
6b1c05d118
|
Add sparkExecutorMemoryOverhead workflow config to set off-heap memory for Spark actions. If not explicitly set it is defaulted to 1Gb
|
2023-08-29 16:04:19 +02:00 |
Claudio Atzori
|
bf35280ea6
|
code formatting
|
2023-08-29 11:11:00 +02:00 |
Claudio Atzori
|
58665a246c
|
Merge branch 'beta' into propagate_relation_rewrite
|
2023-08-29 10:47:02 +02:00 |
Claudio Atzori
|
f437be80ad
|
[impact indicators] adjusted paths in the bip ranker wf parameters
|
2023-08-29 09:03:03 +02:00 |
Giambattista Bloisi
|
d012aec0b3
|
Revert PropagateRelation's argument name from outputPath to graphOutputPath in consistency workflow (#8964)
|
2023-08-28 22:44:54 +02:00 |
Giambattista Bloisi
|
a860e19423
|
Fix ensure all relations are written out, not only those managed by dedup
|
2023-08-28 15:36:02 +02:00 |
Giambattista Bloisi
|
0d7b2bf83d
|
Rewrite SparkPropagateRelation exploiting Dataframe API
|
2023-08-28 10:34:54 +02:00 |
Miriam Baglioni
|
9c8b41475a
|
Merge pull request '8172_impact_indicators_workflow' (#284) from 8172_impact_indicators_workflow into beta
Reviewed-on: D-Net/dnet-hadoop#284
|
2023-08-14 15:50:48 +02:00 |
Serafeim Chatzopoulos
|
97c1ba8918
|
Merge actionsets of results and projects
|
2023-08-11 15:56:53 +03:00 |
Giambattista Bloisi
|
95cd2b9b1e
|
Make filterInvisible a mandatory parameter of DispathEntitiesSparkJob
Make filterInvisible a mandatory parameter of both dedup/consistency and graph/group oozie workflows
|
2023-08-10 11:53:48 +02:00 |
Giambattista Bloisi
|
fab9920271
|
DispatchEntitiesSparkJob: manage all entity types together, support filtering by dataInfo.invisible flag
|
2023-08-09 15:41:43 +02:00 |
Miriam Baglioni
|
c25ac21e5e
|
Merge pull request 'graph cleaning, suggestions from ticket 8898' (#325) from cleaning_8898 into beta
Reviewed-on: D-Net/dnet-hadoop#325
|
2023-08-08 11:14:19 +02:00 |
Miriam Baglioni
|
c334fe2438
|
Merge pull request 'Add a "CleanRelation" action after the PropagateRelation to filter out all relations that have been deleted by inference or that are pointing to dangling entities' (#328) from cleanup_relations_after_dedup into beta
Reviewed-on: D-Net/dnet-hadoop#328
|
2023-08-08 09:49:12 +02:00 |
Miriam Baglioni
|
0e2f855807
|
Merge pull request 'Updates Promotion DBs' (#321) from antonis.lempesis/dnet-hadoop:beta into beta
Reviewed-on: D-Net/dnet-hadoop#321
|
2023-08-07 12:09:16 +02:00 |
Miriam Baglioni
|
18fbe52b20
|
Merge pull request 'Import affiliation relations from Crossref' (#320) from 8876 into beta
Reviewed-on: D-Net/dnet-hadoop#320
|
2023-08-07 10:45:30 +02:00 |
Giambattista Bloisi
|
97b6d1dc45
|
Filter ids by dataInfo.deletedbyinference and DataInfo.invisible flags
Filter relations also by dataInfo.invisible flag
|
2023-08-07 10:24:11 +02:00 |
Giambattista Bloisi
|
af49424b59
|
Add a "CleanRelation" action after the PropagateRelation to filter out all relations that have been deleyted by inference or that are pointing to dangling entities
|
2023-08-04 14:27:39 +02:00 |
Claudio Atzori
|
0bc74e2000
|
code formatting
|
2023-08-02 11:52:10 +02:00 |
Claudio Atzori
|
11ffb9bd68
|
rule out records with NULL dataInfo
|
2023-07-31 12:35:33 +02:00 |
Claudio Atzori
|
ccac6a7f75
|
rule out records with NULL dataInfo
|
2023-07-31 12:35:05 +02:00 |
Serafeim Chatzopoulos
|
7cefe2665b
|
Remove unnecessary classes
|
2023-07-28 19:14:39 +03:00 |
Serafeim Chatzopoulos
|
26a92ce762
|
Merge branch '8876' of https://code-repo.d4science.org/D-Net/dnet-hadoop into 8876
|
2023-07-28 19:03:57 +03:00 |
Serafeim Chatzopoulos
|
ebfba38ab6
|
Add changes from code review
|
2023-07-28 19:03:47 +03:00 |
Serafeim Chatzopoulos
|
eb8684a8cf
|
Merge branch 'beta' into 8876
|
2023-07-28 13:39:33 +02:00 |
Claudio Atzori
|
a72b9e96ac
|
expand the instance level fulltext in the XML records
|
2023-07-27 14:57:38 +02:00 |
Claudio Atzori
|
59764145bb
|
cherry picked & fixed commit 270df939c4
|
2023-07-25 17:39:00 +02:00 |