sourcePath
the source path
hive_db_name
the target hive database name
sparkDriverMemory
memory for driver process
sparkExecutorMemory
memory for individual executor
sparkExecutorCores
number of cores used by single executor
Action failed, error message[${wf:errorMessage(wf:lastErrorNode())}]
${jobTracker}
${nameNode}
yarn-cluster
cluster
MapGraphIntoDataFrame
eu.dnetlib.dhp.graph.SparkGraphImporterJob
dhp-graph-mapper-${projectVersion}.jar
--executor-memory ${sparkExecutorMemory}
--executor-cores ${sparkExecutorCores}
--driver-memory=${sparkDriverMemory}
--conf spark.extraListeners="com.cloudera.spark.lineage.NavigatorAppListener"
--conf spark.sql.queryExecutionListeners="com.cloudera.spark.lineage.NavigatorQueryListener"
--conf spark.sql.warehouse.dir="/user/hive/warehouse"
-mt yarn-cluster
--sourcePath${sourcePath}
--hive_db_name${hive_db_name}
--hive_metastore_uris${hive_metastore_uris}
${jobTracker}
${nameNode}
hive_db_name=${hive_db_name}