[dedup] use common saveParquet and save methods to ensure outputs are compressed #349

Merged
claudio.atzori merged 1 commits from fix_dedup_not_compressed into beta 2023-10-16 11:56:18 +02:00

This PR uses the common dataset writing methods available in the eu.dnetlib.dhp.oa.dedup.AbstractSparkAction class to ensure that all the actions defined in the deduplication workflow produce a compressed output.

This PR uses the common dataset writing methods available in the `eu.dnetlib.dhp.oa.dedup.AbstractSparkAction` class to ensure that all the actions defined in the deduplication workflow produce a compressed output.
claudio.atzori added 1 commit 2023-10-16 10:58:32 +02:00
claudio.atzori requested review from giambattista.bloisi 2023-10-16 10:59:11 +02:00
claudio.atzori added this to the OpenAIRE project 2023-10-16 11:26:38 +02:00

Looks good to me

Looks good to me
claudio.atzori merged commit 389e3fcc59 into beta 2023-10-16 11:56:18 +02:00
claudio.atzori deleted branch fix_dedup_not_compressed 2023-10-16 11:56:19 +02:00
claudio.atzori modified the project from OpenAIRE to OpenAIRE - DNet 2023-10-26 10:00:04 +02:00
Sign in to join this conversation.
No description provided.