orcidPath
the path of the orcid Table generated by the dump
graphPath
the path of the graph we want to apply enrichment
targetPath
the output path of the graph enriched
spark2ExtraListeners
com.cloudera.spark.lineage.NavigatorAppListener
spark 2.* extra listeners classname
spark2SqlQueryExecutionListeners
com.cloudera.spark.lineage.NavigatorQueryListener
spark 2.* sql query execution listeners classname
Action failed, error message[${wf:errorMessage(wf:lastErrorNode())}]
yarn
cluster
Enrich Graph with ORCID
eu.dnetlib.dhp.enrich.orcid.SparkEnrichGraphWithOrcidAuthors
dhp-graph-mapper-${projectVersion}.jar
--executor-memory=${sparkExecutorMemory}
--executor-cores=${sparkExecutorCores}
--driver-memory=${sparkDriverMemory}
--conf spark.executor.memoryOverhead=${sparkExecutorMemory}
--conf spark.sql.shuffle.partitions=5000
--conf spark.extraListeners=${spark2ExtraListeners}
--conf spark.sql.queryExecutionListeners=${spark2SqlQueryExecutionListeners}
--conf spark.yarn.historyServer.address=${spark2YarnHistoryServerAddress}
--conf spark.eventLog.dir=${nameNode}${spark2EventLogDir}
--orcidPath${orcidPath}
--targetPath${targetPath}
--graphPath${graphPath}
--masteryarn
${nameNode}/${graphPath}/datasource
${nameNode}/${targetPath}/datasource
${nameNode}/${graphPath}/organization
${nameNode}/${targetPath}/organization
${nameNode}/${graphPath}/project
${nameNode}/${targetPath}/project
${nameNode}/${graphPath}/relation
${nameNode}/${targetPath}/relation