Claudio Atzori
|
6856ab28ab
|
Merge pull request 'SWH_integration' (#343) from SWH_integration into beta
Reviewed-on: D-Net/dnet-hadoop#343
|
2023-10-06 14:15:56 +02:00 |
Claudio Atzori
|
3c23d5f9bc
|
Merge branch 'beta' into SWH_integration
|
2023-10-06 14:15:38 +02:00 |
Claudio Atzori
|
858931ccb6
|
[SWH] compress the output actionset
|
2023-10-06 14:03:33 +02:00 |
Claudio Atzori
|
f759b18bca
|
[SWH] aligned parameter name
|
2023-10-06 13:43:20 +02:00 |
Claudio Atzori
|
eed9fe0902
|
code formatting
|
2023-10-06 12:31:17 +02:00 |
Claudio Atzori
|
7f27111b1f
|
Merge branch 'importpoci' into beta
|
2023-10-06 12:23:28 +02:00 |
Claudio Atzori
|
73c49b8d26
|
Merge branch 'beta' into SWH_integration
|
2023-10-06 12:21:51 +02:00 |
Sandro La Bruzzo
|
13f332ce77
|
ignored jenv prop
|
2023-10-06 10:40:05 +02:00 |
Serafeim Chatzopoulos
|
1bb83b9188
|
Add prefix in SWH ID
|
2023-10-04 20:31:45 +03:00 |
Claudio Atzori
|
ee8a39e7d2
|
cleanup and refinements
|
2023-10-04 12:32:05 +02:00 |
Serafeim Chatzopoulos
|
e9f24df21c
|
Move SWH API Key from constants to workflow param
|
2023-10-03 20:57:57 +03:00 |
Serafeim Chatzopoulos
|
cae75fc75d
|
Add SWH in the collectedFrom field
|
2023-10-03 16:55:10 +03:00 |
Serafeim Chatzopoulos
|
b49a3ac9b2
|
Add actionsetsPath as a global WF param
|
2023-10-03 15:43:38 +03:00 |
Serafeim Chatzopoulos
|
24c43e0c60
|
Restructure workflow parameters
|
2023-10-03 15:11:58 +03:00 |
Serafeim Chatzopoulos
|
9f73d93e62
|
Add param for limiting repo Urls
|
2023-10-03 14:39:08 +03:00 |
Claudio Atzori
|
f344ad76d0
|
Merge pull request 'extended existing code to import of POCI from open citation' (#340) from importpoci into beta
Reviewed-on: D-Net/dnet-hadoop#340
|
2023-10-03 10:52:11 +02:00 |
Claudio Atzori
|
5919e488dd
|
Merge branch 'beta' into importpoci
|
2023-10-03 10:43:53 +02:00 |
Serafeim Chatzopoulos
|
839a8524e7
|
Add action for creating actionsets
|
2023-10-02 23:50:38 +03:00 |
Miriam Baglioni
|
d7fccdc64b
|
fixed paths in wf to match the req of the pathname
|
2023-10-02 14:10:57 +02:00 |
Miriam Baglioni
|
9898470b0e
|
Addressing comments in D-Net/dnet-hadoop#340\#issuecomment-10592
|
2023-10-02 12:54:16 +02:00 |
Giambattista Bloisi
|
c412dc162b
|
Fix bug in conversion from dedup json model to Spark Dataset of Rows: list of strings contained the json escaped representation of the value instead of the plain value, this caused instanceTypeMatch failures because of the leading and trailing double quotes
|
2023-10-02 11:34:51 +02:00 |
Claudio Atzori
|
5d09b7db8b
|
Merge pull request 'SparkPropagateRelation relations do not propagate deletedByInference and invisible' (#333) from consistency_keep_mergerels into beta
Reviewed-on: D-Net/dnet-hadoop#333
|
2023-10-02 11:27:57 +02:00 |
Claudio Atzori
|
7b403a920f
|
Merge branch 'beta' into consistency_keep_mergerels
|
2023-10-02 11:26:00 +02:00 |
Claudio Atzori
|
dc86018a5f
|
Merge branch 'merge_entities_job' into beta
|
2023-10-02 11:24:48 +02:00 |
Giambattista Bloisi
|
3c47920c78
|
Use asScala to convert java List to Scala Sequence
|
2023-10-02 11:04:47 +02:00 |
Claudio Atzori
|
7f244d9a7a
|
code formatting
|
2023-10-02 11:04:36 +02:00 |
Giambattista Bloisi
|
e239b81740
|
Fix defect #8997: GenerateEventsJob is generating huge amounts of logs because broker entity similarity calculation consistently failed
|
2023-10-02 11:04:18 +02:00 |
Miriam Baglioni
|
e84f5b5e64
|
extended existing codo to accomodate import of POCI from open citation
|
2023-10-02 09:25:16 +02:00 |
Serafeim Chatzopoulos
|
ab0d70691c
|
Add step for archiving repoUrls to SWH
|
2023-09-28 20:56:18 +03:00 |
Serafeim Chatzopoulos
|
ed9c81a0b7
|
Add steps to collect last visit data && archive not found repository URLs
|
2023-09-27 19:00:54 +03:00 |
Alessia Bardi
|
0935d7757c
|
Use v5 of the UNIBI Gold ISSN list in test
|
2023-09-20 15:41:35 +02:00 |
Alessia Bardi
|
cc7204a089
|
tests for d4science catalog
|
2023-09-20 15:38:32 +02:00 |
Sandro La Bruzzo
|
76476cdfb6
|
Added maven repo for dependencies that are not in maven central
|
2023-09-20 10:33:14 +02:00 |
Serafeim Chatzopoulos
|
9d44418d38
|
Add collecting software code repository URLs
|
2023-09-14 18:43:25 +03:00 |
Serafeim Chatzopoulos
|
395a4af020
|
Run CC and RAM sequentieally in dhp-impact-indicators WF
|
2023-09-13 08:59:40 +02:00 |
Claudio Atzori
|
8a6892cc63
|
[graph dedup] consistency wf should not remove the relations while dispatching the entities
|
2023-09-12 21:27:05 +02:00 |
Claudio Atzori
|
4786aa0e09
|
added Archive ouverte UNIGE (ETHZ.UNIGENF, opendoar____::1400) to the Datacite hostedBy_map
|
2023-09-07 11:21:07 +02:00 |
Claudio Atzori
|
9f5d16624c
|
Merge pull request '[graph raw] datainfo.invisible set as true only for entities' (#336) from invisible_relations into beta
Reviewed-on: D-Net/dnet-hadoop#336
|
2023-09-04 16:14:47 +02:00 |
Claudio Atzori
|
adec6692ca
|
Merge branch 'beta' into invisible_relations
|
2023-09-04 16:13:06 +02:00 |
Claudio Atzori
|
15666e86a8
|
added collectedfrom to the affiliation relations imported from Crossref
|
2023-09-04 15:56:06 +02:00 |
Claudio Atzori
|
7d6bd4f20b
|
Merge pull request 'Fix import of affiliations relations from Crossref' (#335) from 8876_fix_crossref_affiliation_relations_import into beta
Reviewed-on: D-Net/dnet-hadoop#335
|
2023-09-04 15:19:58 +02:00 |
Claudio Atzori
|
5b06c9d06f
|
[graph raw] datainfo.invisible set as true only for entities
|
2023-09-04 15:15:24 +02:00 |
Serafeim Chatzopoulos
|
7de0164c26
|
Fix import of affiliations relations from Crossref
|
2023-09-04 16:04:41 +03:00 |
Giambattista Bloisi
|
2caaaec42d
|
Include SparkCleanRelation logic in SparkPropagateRelation
SparkPropagateRelation includes merge relations
Revised tests for SparkPropagateRelation
|
2023-09-04 11:33:20 +02:00 |
Giambattista Bloisi
|
6cc7d8ca7b
|
GroupEntities and DispatchEntites are now merged in GroupEntitiesSparkJob
|
2023-08-30 10:43:31 +02:00 |
Claudio Atzori
|
488d9a1cea
|
Merge pull request 'Add sparkExecutorMemoryOverhead workflow config to set off-heap memory for Spark actions. If not explicitly set it is defaulted to 1Gb' (#331) from consistencywf_memoryoverhead_conf into beta
Reviewed-on: D-Net/dnet-hadoop#331
|
2023-08-29 16:31:36 +02:00 |
Giambattista Bloisi
|
6b1c05d118
|
Add sparkExecutorMemoryOverhead workflow config to set off-heap memory for Spark actions. If not explicitly set it is defaulted to 1Gb
|
2023-08-29 16:04:19 +02:00 |
Claudio Atzori
|
bf35280ea6
|
code formatting
|
2023-08-29 11:11:00 +02:00 |
Claudio Atzori
|
0515d81c7c
|
Merge pull request 'Rewrite SparkPropagateRelation exploiting Dataframe API' (#330) from propagate_relation_rewrite into beta
Reviewed-on: D-Net/dnet-hadoop#330
|
2023-08-29 10:47:14 +02:00 |
Claudio Atzori
|
58665a246c
|
Merge branch 'beta' into propagate_relation_rewrite
|
2023-08-29 10:47:02 +02:00 |