graphBasePath
the path of the graph
unresolvedPath
the path of the unresolved Entities
targetPath
the target path after resolution
shouldResolveEntities
true
allows to activate/deactivate the resolution process over the entities
Action failed, error message[${wf:errorMessage(wf:lastErrorNode())}]
yarn
cluster
Resolve Relations in raw graph
eu.dnetlib.dhp.oa.graph.resolution.SparkResolveRelation
dhp-graph-mapper-${projectVersion}.jar
--executor-memory=${sparkExecutorMemory}
--executor-cores=${sparkExecutorCores}
--driver-memory=${sparkDriverMemory}
--conf spark.extraListeners=${spark2ExtraListeners}
--conf spark.sql.shuffle.partitions=15000
--conf spark.sql.queryExecutionListeners=${spark2SqlQueryExecutionListeners}
--conf spark.yarn.historyServer.address=${spark2YarnHistoryServerAddress}
--conf spark.eventLog.dir=${nameNode}${spark2EventLogDir}
--masteryarn
--graphBasePath${graphBasePath}
--workingPath${workingDir}
--targetPath${targetPath}
${wf:conf('shouldResolveEntities') eq false}
${wf:conf('shouldResolveEntities') eq true}
yarn
cluster
Resolve Entities in raw graph
eu.dnetlib.dhp.oa.graph.resolution.SparkResolveEntities
dhp-graph-mapper-${projectVersion}.jar
--executor-memory=${sparkExecutorMemory}
--executor-cores=${sparkExecutorCores}
--driver-memory=${sparkDriverMemory}
--conf spark.extraListeners=${spark2ExtraListeners}
--conf spark.sql.shuffle.partitions=10000
--conf spark.sql.queryExecutionListeners=${spark2SqlQueryExecutionListeners}
--conf spark.yarn.historyServer.address=${spark2YarnHistoryServerAddress}
--conf spark.eventLog.dir=${nameNode}${spark2EventLogDir}
--masteryarn
--graphBasePath${graphBasePath}
--unresolvedPath${unresolvedPath}
--workingPath${workingDir}
--targetPath${targetPath}
${nameNode}/${graphBasePath}/publication
${nameNode}/${targetPath}/publication
${nameNode}/${graphBasePath}/dataset
${nameNode}/${targetPath}/dataset
${nameNode}/${graphBasePath}/otherresearchproduct
${nameNode}/${targetPath}/otherresearchproduct
${nameNode}/${graphBasePath}/software
${nameNode}/${targetPath}/software
${nameNode}/${graphBasePath}/organization
${nameNode}/${targetPath}/organization
${nameNode}/${graphBasePath}/project
${nameNode}/${targetPath}/project
${nameNode}/${graphBasePath}/datasource
${nameNode}/${targetPath}/datasource