dnet-hadoop/dhp-workflows/dhp-aggregation/src/main/java/eu/dnetlib/dhp/actionmanager/project/utils
Giambattista Bloisi e64c2854a3 Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface
JsonPath cache contention fixed by using a ConcurrentHashMap
Blacklist filtering performance improvement
Minor performance improvements when evaluating similarity
Sorting in clustered elements is deterministic (by ordering and identity field, instead of ordering field only)
2023-07-24 15:36:24 +02:00
..
model [ECclassification] new implementation for the H2020 classification 2023-03-02 11:14:03 +01:00
EXCELParser.java Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
ExtractFromZip.java [ECclassification] added new classes 2023-03-01 15:29:11 +01:00
ReadCSV.java [ECclassification] added new classes 2023-03-01 15:29:11 +01:00
ReadExcel.java GetCSV refactoring - refactoring due to movement 2021-08-12 18:03:14 +02:00
ReadProjects.java Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
ReadTopics.java Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00