dnet-hadoop/dhp-workflows/dhp-aggregation/src/main/java/eu/dnetlib/dhp/collection
Giambattista Bloisi e64c2854a3 Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface
JsonPath cache contention fixed by using a ConcurrentHashMap
Blacklist filtering performance improvement
Minor performance improvements when evaluating similarity
Sorting in clustered elements is deterministic (by ordering and identity field, instead of ordering field only)
2023-07-24 15:36:24 +02:00
..
plugin Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
CollectorWorker.java [metadata collection] updated collector plugin name 2022-07-29 13:54:00 +02:00
CollectorWorkerApplication.java GetCSV refactoring - refactoring due to movement of classes 2021-08-12 18:20:56 +02:00
GenerateNativeStoreSparkJob.java suggestions from SonarLint 2021-08-11 12:13:22 +02:00
UnknownCollectorPluginException.java classes related to the collection workflow moved into common package; implemented MongoDB collection plugins 2021-02-12 12:31:02 +01:00