dnet-hadoop/dhp-workflows/dhp-doiboost/src/main/java/eu/dnetlib/doiboost/orcid
Enrico Ottonello 465ce39f75 job execution now based on file last_update.txt on hdfs 2021-02-04 10:44:04 +01:00
..
json original orcid xml data are stored in a field of the class that models orcid data 2020-12-09 09:45:19 +01:00
model action to convert lambda file in seq file; spark action to download updated authors 2020-11-23 09:49:22 +01:00
util job execution now based on file last_update.txt on hdfs 2021-02-04 10:44:04 +01:00
xml original orcid xml data are stored in a field of the class that models orcid data 2020-12-09 09:45:19 +01:00
ActivitiesDecompressor.java added wf to extracting authors and works xml data from orcid dump to hdfs; added wf to download the lamda file (containing last orcid update informations) from orcid to hdfs 2020-11-17 18:23:12 +01:00
ExtractXMLActivitiesData.java added wf to extracting authors and works xml data from orcid dump to hdfs; added wf to download the lamda file (containing last orcid update informations) from orcid to hdfs 2020-11-17 18:23:12 +01:00
ExtractXMLSummariesData.java added wf to extracting authors and works xml data from orcid dump to hdfs; added wf to download the lamda file (containing last orcid update informations) from orcid to hdfs 2020-11-17 18:23:12 +01:00
ORCIDToOAF.scala fixed log classes to make the ORCID test run 2020-06-09 18:07:14 +02:00
OrcidAuthorsDOIsDataGen.java separate workflow to parse orcid summaries, activities and generate dataset with no doi publications; test 2020-07-03 23:30:31 +02:00
OrcidDSManager.java first version of dataset successful generated from orcid dump 2020 2020-11-06 13:47:50 +01:00
SparkConvertORCIDToOAF.scala changed mapping ORCIDToOAF 2020-05-29 09:32:04 +02:00
SparkDownloadOrcidAuthors.java job execution now based on file last_update.txt on hdfs 2021-02-04 10:44:04 +01:00
SparkDownloadOrcidWorks.java job execution now based on file last_update.txt on hdfs 2021-02-04 10:44:04 +01:00
SparkGenLastModifiedSeq.java job execution now based on file last_update.txt on hdfs 2021-02-04 10:44:04 +01:00
SparkGenerateDoiAuthorList.java wf doi_authors generates one json data foreach row 2020-12-07 15:28:10 +01:00
SparkUpdateOrcidAuthors.java job execution now based on file last_update.txt on hdfs 2021-02-04 10:44:04 +01:00
SparkUpdateOrcidWorks.java job execution now based on file last_update.txt on hdfs 2021-02-04 10:44:04 +01:00
SummariesDecompressor.java added wf to extracting authors and works xml data from orcid dump to hdfs; added wf to download the lamda file (containing last orcid update informations) from orcid to hdfs 2020-11-17 18:23:12 +01:00