giambattista.bloisi
pushed to dedup-with-dataframe-spark34 at D-Net/dnet-hadoop
2023-07-14 16:05:54 +02:00
b6a8be813b
oozie.launcher.mapreduce.user.classpath.first property is required to avoid launch problems
giambattista.bloisi
deleted branch beta_with_pace_core from D-Net/dnet-hadoop
2023-07-11 14:03:16 +02:00
ef493681d9
Merge pull request 'Import dnet-pace-core module in this project and use it after renaming to dhp-pace-core' (#319) from beta_with_pace_core into beta
801da2fd4a
New sources formatted by maven plugin
bd3fcf869a
rename dnet-pace-core into dhp-pace-core module and use it as dependency in other modules
3b35db5fbd
Import dnet-pace-core module from dnet-dedup repository
6210f6ee48
Merge pull request 'Precompile blacklists patterns before evaluating clustering criteria' (#1) from optimized-clustering into master
Import dnet-pace-core module in this project and use it after renaming to dhp-pace-core
giambattista.bloisi
created branch dedup-with-dataframe-spark34 in D-Net/dnet-hadoop
2023-07-10 15:54:57 +02:00
giambattista.bloisi
pushed to dedup-with-dataframe-spark34 at D-Net/dnet-hadoop
2023-07-10 15:54:57 +02:00
d80f12da06
Build with spark 3.4 (dedup and dependencies only tested)
745e70e0d7
When generating similarities put as 'from' component the one with smaller lexicographic id
dcc08cc512
Use UDAF and Aggregation class for testing
df19548c56
small changes
Import dnet-pace-core module in this project and use it after renaming to dhp-pace-core
bd3fcf869a
rename dnet-pace-core into dhp-pace-core module and use it as dependency in other modules
giambattista.bloisi
created branch beta_with_pace_core in D-Net/dnet-hadoop
2023-07-05 22:25:13 +02:00
giambattista.bloisi
deleted branch import_dedup_project from D-Net/dnet-hadoop
2023-07-05 21:02:57 +02:00