dnet-hadoop/dhp-workflows/dhp-dedup-openaire/src/test/java/eu/dnetlib/dhp/oa/dedup
Giambattista Bloisi 43b454399f - Bug fix in matchOrderedTokenAndAbbreviations algorithms where tokens with same initial character were always considered equal
- AuthorsMatch exploits the new matching strategy used for ORCID enhancements in #PR398: split author names in tokens, order the tokens, then check for matches of ordered full tokens or abbreviations
2024-04-15 18:19:29 +02:00
..
jpath Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
DatePickerTest.java suggestions from SonarLint 2021-08-11 12:13:22 +02:00
EntityMergerTest.java Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
IdGeneratorTest.java suggestions from SonarLint 2021-08-11 12:13:22 +02:00
SparkDedupTest.java - Bug fix in matchOrderedTokenAndAbbreviations algorithms where tokens with same initial character were always considered equal 2024-04-15 18:19:29 +02:00
SparkOpenorgsDedupTest.java Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
SparkOpenorgsProvisionTest.java Include SparkCleanRelation logic in SparkPropagateRelation 2023-09-04 11:33:20 +02:00
SparkPublicationRootsTest.java Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
SparkPublicationRootsTest2.java extended dedup testing 2022-11-17 16:13:43 +01:00
SparkStatsTest.java Fix SparkStatsTest bug where parquet tables were incorrectly read as text files leading to unpredictable count() values 2023-07-19 14:24:52 +02:00