dnet-hadoop/dhp-workflows/dhp-graph-mapper/src/main/scala/eu/dnetlib/dhp/sx/graph
Giambattista Bloisi e64c2854a3 Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface
JsonPath cache contention fixed by using a ConcurrentHashMap
Blacklist filtering performance improvement
Minor performance improvements when evaluating similarity
Sorting in clustered elements is deterministic (by ordering and identity field, instead of ordering field only)
2023-07-24 15:36:24 +02:00
..
SparkConvertDatasetToJsonRDD.scala scalafmt: code formatting 2022-01-11 16:57:48 +01:00
SparkConvertObjectToJson.scala implemented oozie workflow to generate scholix dump filtering relclass semantic 2022-07-20 16:34:32 +02:00
SparkConvertRDDtoDataset.scala Changed the way Scholexplorer filter the relationships, I found that filter all relation coming from openCitation is wrong, because we loose a lot of relation than intersect OpenCitation, but they don't come only from there 2022-10-24 12:53:47 +02:00
SparkCreateInputGraph.scala Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
SparkCreateScholix.scala [scholix] fixed OpenCitation dump procedure 2022-08-10 17:39:29 +02:00
SparkCreateSummaryObject.scala scalafmt: code formatting 2022-01-11 16:57:48 +01:00