b6e9ffb7e3
computing stats after every table creation
2ed9c5504c
computing stats after every table creation
datasource table creation split in steps
619aa34a15
Merge branch 'beta' of https://code-repo.d4science.org/antonis.lempesis/dnet-hadoop into beta
dbea7a4072
removed duplicate line
c9241dba0d
Merge pull request 'convert_hive_to_spark_actions' (#1) from convert_hive_to_spark_actions into beta
e9686365a2
Improve performance of creating the "result_fos" table, by using a temp-table to cache data, which is requested multiple times.
ce0aee21cc
Improve performance of transferring the stats-DBs to another cluster and querying the DBs' tables, by ordering Spark to create up to 100 files per table, instead of thousands.
7b7dd32ad5
- Fix placement of some "set mapred.job.queue.name=analytics" statements and remove their unused "/*EOS*/" indicator.
7ce051d766
- Update the remaining hive-actions to spark-actions.
convert_hive_to_spark_actions
convert_hive_to_spark_actions