forked from D-Net/dnet-hadoop
Lampros Smyrnaios
d46b78b659
- Set Steps 2-7 and 9 to limit the amount of files generated by Spark, from 8000, down to 100, to improve file-transfer and querying performance. - Allow the workflow to run up to Step10. The Step11 seems to have some issues even when using hive-action. |
||
---|---|---|
.. | ||
dhp-actionmanager | ||
dhp-aggregation | ||
dhp-blacklist | ||
dhp-broker-events | ||
dhp-dedup-openaire | ||
dhp-doiboost | ||
dhp-enrichment | ||
dhp-graph-mapper | ||
dhp-graph-provision | ||
dhp-impact-indicators | ||
dhp-stats-actionsets | ||
dhp-stats-hist-snaps | ||
dhp-stats-monitor-irish | ||
dhp-stats-monitor-update | ||
dhp-stats-promote | ||
dhp-stats-update | ||
dhp-swh | ||
dhp-usage-raw-data-update | ||
dhp-usage-stats-build | ||
dhp-workflow-profiles | ||
src/site | ||
pom.xml |