.. |
json
|
original orcid xml data are stored in a field of the class that models orcid data
|
2020-12-09 09:45:19 +01:00 |
model
|
action to convert lambda file in seq file; spark action to download updated authors
|
2020-11-23 09:49:22 +01:00 |
util
|
job execution now based on file last_update.txt on hdfs
|
2021-02-04 10:44:04 +01:00 |
xml
|
original orcid xml data are stored in a field of the class that models orcid data
|
2020-12-09 09:45:19 +01:00 |
ActivitiesDecompressor.java
|
added wf to extracting authors and works xml data from orcid dump to hdfs; added wf to download the lamda file (containing last orcid update informations) from orcid to hdfs
|
2020-11-17 18:23:12 +01:00 |
ExtractXMLActivitiesData.java
|
added wf to extracting authors and works xml data from orcid dump to hdfs; added wf to download the lamda file (containing last orcid update informations) from orcid to hdfs
|
2020-11-17 18:23:12 +01:00 |
ExtractXMLSummariesData.java
|
added wf to extracting authors and works xml data from orcid dump to hdfs; added wf to download the lamda file (containing last orcid update informations) from orcid to hdfs
|
2020-11-17 18:23:12 +01:00 |
ORCIDToOAF.scala
|
fixed log classes to make the ORCID test run
|
2020-06-09 18:07:14 +02:00 |
OrcidAuthorsDOIsDataGen.java
|
separate workflow to parse orcid summaries, activities and generate dataset with no doi publications; test
|
2020-07-03 23:30:31 +02:00 |
OrcidDSManager.java
|
first version of dataset successful generated from orcid dump 2020
|
2020-11-06 13:47:50 +01:00 |
SparkConvertORCIDToOAF.scala
|
changed mapping ORCIDToOAF
|
2020-05-29 09:32:04 +02:00 |
SparkDownloadOrcidAuthors.java
|
job execution now based on file last_update.txt on hdfs
|
2021-02-04 10:44:04 +01:00 |
SparkDownloadOrcidWorks.java
|
job execution now based on file last_update.txt on hdfs
|
2021-02-04 10:44:04 +01:00 |
SparkGenLastModifiedSeq.java
|
job execution now based on file last_update.txt on hdfs
|
2021-02-04 10:44:04 +01:00 |
SparkGenerateDoiAuthorList.java
|
wf doi_authors generates one json data foreach row
|
2020-12-07 15:28:10 +01:00 |
SparkUpdateOrcidAuthors.java
|
job execution now based on file last_update.txt on hdfs
|
2021-02-04 10:44:04 +01:00 |
SparkUpdateOrcidWorks.java
|
job execution now based on file last_update.txt on hdfs
|
2021-02-04 10:44:04 +01:00 |
SummariesDecompressor.java
|
added wf to extracting authors and works xml data from orcid dump to hdfs; added wf to download the lamda file (containing last orcid update informations) from orcid to hdfs
|
2020-11-17 18:23:12 +01:00 |