sourcePath
the source path
whitelist
the white list
allowedtypes
the allowed types
sparkDriverMemory
memory for driver process
sparkExecutorMemory
memory for individual executor
sparkExecutorCores
number of cores used by single executor
sparkExecutorNumber
number of executors used
writeUpdate
writes the information found for the update. No double check done if the information is already present
saveGraph
writes new version of the graph after the propagation step
Action failed, error message[${wf:errorMessage(wf:lastErrorNode())}]
${jobTracker}
${nameNode}
yarn-cluster
cluster
CountryPropagation
eu.dnetlib.dhp.countrypropagation.SparkCountryPropagationJob
dhp-propagation-${projectVersion}.jar
--num-executors=${sparkExecutorNumber}
--executor-memory=${sparkExecutorMemory}
--executor-cores=${sparkExecutorCores}
--driver-memory=${sparkDriverMemory}
--conf spark.extraListeners=${spark2ExtraListeners}
--conf spark.sql.queryExecutionListeners=${spark2SqlQueryExecutionListeners}
--conf spark.yarn.historyServer.address=${spark2YarnHistoryServerAddress}
--conf spark.eventLog.dir=${nameNode}${spark2EventLogDir}
--conf spark.dynamicAllocation.enabled=true
--conf spark.dynamicAllocation.maxExecutors=${spark2MaxExecutors}
-mt yarn-cluster
--sourcePath${sourcePath}
--whitelist${whitelist}
--allowedtypes${allowedtypes}
--hive_metastore_uris${hive_metastore_uris}
--writeUpdate${writeUpdate}
--saveGraph${saveGraph}