dnet-hadoop/dhp-workflows/dhp-doiboost/src/main/java/eu/dnetlib/doiboost/orcid
Enrico Ottonello efe4c2a9c5 authors and works are now updated in two separate spark actions of the wf 2020-12-12 02:06:21 +01:00
..
json original orcid xml data are stored in a field of the class that models orcid data 2020-12-09 09:45:19 +01:00
model action to convert lambda file in seq file; spark action to download updated authors 2020-11-23 09:49:22 +01:00
xml original orcid xml data are stored in a field of the class that models orcid data 2020-12-09 09:45:19 +01:00
ActivitiesDecompressor.java added wf to extracting authors and works xml data from orcid dump to hdfs; added wf to download the lamda file (containing last orcid update informations) from orcid to hdfs 2020-11-17 18:23:12 +01:00
ExtractXMLActivitiesData.java added wf to extracting authors and works xml data from orcid dump to hdfs; added wf to download the lamda file (containing last orcid update informations) from orcid to hdfs 2020-11-17 18:23:12 +01:00
ExtractXMLSummariesData.java added wf to extracting authors and works xml data from orcid dump to hdfs; added wf to download the lamda file (containing last orcid update informations) from orcid to hdfs 2020-11-17 18:23:12 +01:00
ORCIDToOAF.scala fixed log classes to make the ORCID test run 2020-06-09 18:07:14 +02:00
OrcidAuthorsDOIsDataGen.java separate workflow to parse orcid summaries, activities and generate dataset with no doi publications; test 2020-07-03 23:30:31 +02:00
OrcidDSManager.java first version of dataset successful generated from orcid dump 2020 2020-11-06 13:47:50 +01:00
OrcidDownloader.java action to convert lambda file in seq file; spark action to download updated authors 2020-11-23 09:49:22 +01:00
SparkConvertORCIDToOAF.scala changed mapping ORCIDToOAF 2020-05-29 09:32:04 +02:00
SparkDownloadOrcidAuthors.java max concurrent executors set to 10, according to ORCID Director of Technology mail request 2020-11-24 17:49:32 +01:00
SparkDownloadOrcidWorks.java completed download function to wf; added accumulators 2020-12-04 21:13:49 +01:00
SparkGenLastModifiedSeq.java action to convert lambda file in seq file; spark action to download updated authors 2020-11-23 09:49:22 +01:00
SparkGenerateDoiAuthorList.java wf doi_authors generates one json data foreach row 2020-12-07 15:28:10 +01:00
SparkUpdateOrcidAuthors.java authors and works are now updated in two separate spark actions of the wf 2020-12-12 02:06:21 +01:00
SparkUpdateOrcidDatasets.java authors and works are now updated in two separate spark actions of the wf 2020-12-12 02:06:21 +01:00
SparkUpdateOrcidWorks.java authors and works are now updated in two separate spark actions of the wf 2020-12-12 02:06:21 +01:00
SummariesDecompressor.java added wf to extracting authors and works xml data from orcid dump to hdfs; added wf to download the lamda file (containing last orcid update informations) from orcid to hdfs 2020-11-17 18:23:12 +01:00