dnet-hadoop/dhp-workflows/dhp-dedup-openaire/src/test/java/eu/dnetlib/dhp/oa/dedup
Giambattista Bloisi 3c66e3bd7b Create dedup record for "merged" pivots
Do not create dedup records for group that have more than 20 different acceptance date
2024-01-10 22:59:52 +01:00
..
jpath Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
DatePickerTest.java suggestions from SonarLint 2021-08-11 12:13:22 +02:00
EntityMergerTest.java Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
IdGeneratorTest.java suggestions from SonarLint 2021-08-11 12:13:22 +02:00
SparkDedupTest.java Create dedup record for "merged" pivots 2024-01-10 22:59:52 +01:00
SparkOpenorgsDedupTest.java Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
SparkOpenorgsProvisionTest.java Include SparkCleanRelation logic in SparkPropagateRelation 2023-09-04 11:33:20 +02:00
SparkPublicationRootsTest.java Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
SparkPublicationRootsTest2.java extended dedup testing 2022-11-17 16:13:43 +01:00
SparkStatsTest.java Fix SparkStatsTest bug where parquet tables were incorrectly read as text files leading to unpredictable count() values 2023-07-19 14:24:52 +02:00