dnet-hadoop/dhp-workflows/dhp-dedup-openaire/src/main/java/eu/dnetlib/dhp/oa/dedup
Giambattista Bloisi af49424b59 Add a "CleanRelation" action after the PropagateRelation to filter out all relations that have been deleyted by inference or that are pointing to dangling entities 2023-08-04 14:27:39 +02:00
..
graph Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
model Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
AbstractSparkAction.java Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
DatePicker.java Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
DedupRecordFactory.java Fix entityMerger that was excluding the authors of the first entity in the list to merge 2023-07-21 00:46:54 +02:00
DedupUtility.java Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
IdGenerator.java added IdentifierComparator 2022-11-09 14:20:59 +01:00
IdentifierComparator.java added IdentifierComparator 2022-11-09 14:20:59 +01:00
RelationAggregator.java applied intellij code cleanup 2021-05-14 10:58:12 +02:00
SparkBlockStats.java Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
SparkCleanRelation.scala Add a "CleanRelation" action after the PropagateRelation to filter out all relations that have been deleyted by inference or that are pointing to dangling entities 2023-08-04 14:27:39 +02:00
SparkCopyOpenorgsMergeRels.java core utilities in dhp-common moved in external module dhp-schemas 2021-04-27 15:44:01 +02:00
SparkCopyOpenorgsSimRels.java suggestions from SonarLint 2021-08-11 12:13:22 +02:00
SparkCopyRelationsNoOpenorgs.java fixed filtering criteria applied in SparkCopyRelationsNoOpenorgs to keep the parent/child relations from OpenOrgs 2021-11-19 11:30:29 +01:00
SparkCreateDedupRecord.java suggestions from SonarLint 2021-08-11 12:13:22 +02:00
SparkCreateMergeRels.java [dedup] tweaking num partitions, minor changes 2023-05-17 10:16:22 +02:00
SparkCreateOrgsDedupRecord.java suggestions from SonarLint 2021-08-11 12:13:22 +02:00
SparkCreateSimRels.java Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
SparkPrepareNewOrgs.java suggestions from SonarLint 2021-08-11 12:13:22 +02:00
SparkPrepareOrgRels.java suggestions from SonarLint 2021-08-11 12:13:22 +02:00
SparkPropagateRelation.java suggestions from SonarLint 2021-08-11 12:13:22 +02:00
SparkUpdateEntity.java suggestions from SonarLint 2021-08-11 12:13:22 +02:00
SparkWhitelistSimRels.java Refactor Dedup process to use Spark Dataframe API and intermediate representation with Row interface 2023-07-24 15:36:24 +02:00
UpdateOpenorgsJob.java set configuration property in openorgs duplicates wf 2021-10-07 15:39:55 +02:00