Fix Sliding window: sliding window logic was not applied because a counter was not incremented
85546dfa2f
Fix Sliding window: sliding window logic was not applied because a counter was not incremented
[WIP] Refactor Dedup using Spark Dataframe API and Spark Row representation of data, misc optimizations
d041f6d2be
Move dnet-pace-core inside the project
467693bfcb
Move dnet-pace-core inside the project
b09aba91e3
Update copyDataToImpalaCluster.sh
317ae7b33a
Bug fixes
454ec4d8b0
[aggregator graph] added column alias when mapping organization PIDs from the OpenOrgs database
Remove duplicated code and ensure that load and initialization is done through "DedupConfig.load" method
5b6c361fe0
Remove duplicated code and ensure that load and initialization is done through "DedupConfig.load" method
758e662ab8
Revert "REmove duplicated code and ensure that load and initialization is done through "DedupConfig.load" method"
485f9d18cb
REmove duplicated code and ensure that load and initialization is done through "DedupConfig.load" method
Precompile blacklists patterns before evaluating clustering criteria