Alessia Bardi
|
4c1474e693
|
Dealing with #6859#note-2: we have to decode URLs to avoid & and other chars encoded becasue of the original XML representation of data
|
2021-08-20 17:03:30 +02:00 |
Miriam Baglioni
|
5f8ccbc365
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2021-08-20 11:13:47 +02:00 |
Miriam Baglioni
|
882abb40e4
|
CrossrefDump -
|
2021-08-20 11:12:53 +02:00 |
Miriam Baglioni
|
45c62609af
|
CrossrefDump - modified because parameter file was moved
|
2021-08-20 11:12:31 +02:00 |
Miriam Baglioni
|
35880c0e7b
|
CrossrefDump - changed the wf to be able to resume from one of the steps
|
2021-08-20 11:11:35 +02:00 |
Miriam Baglioni
|
f3b6c392c1
|
CrossrefDump - moving parameter file under folder crossref_dump_reader
|
2021-08-20 11:10:58 +02:00 |
Miriam Baglioni
|
65822400ce
|
CrossrefDump - added new parameter file that was missing
|
2021-08-20 11:10:35 +02:00 |
Alessia Bardi
|
a053e1513c
|
different funders in blacklist from BETA and PROD aggregator
|
2021-08-19 11:32:27 +02:00 |
Alessia Bardi
|
812bd54c57
|
different funders in blacklist from BETA and PROD aggregator
|
2021-08-19 11:30:14 +02:00 |
Miriam Baglioni
|
a65d3caaea
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2021-08-19 10:29:10 +02:00 |
Miriam Baglioni
|
e5cf11d088
|
change open access route to result matching hbm to gold
|
2021-08-19 10:29:04 +02:00 |
Claudio Atzori
|
7c0c67bdd6
|
added mock pom
|
2021-08-13 17:45:53 +02:00 |
Claudio Atzori
|
82086f3422
|
fixed directory name
|
2021-08-13 17:42:14 +02:00 |
Claudio Atzori
|
bc7068106c
|
added crossref download oozie workflow
|
2021-08-13 17:19:44 +02:00 |
Claudio Atzori
|
2c0a05f11a
|
manually merged PR#139
|
2021-08-13 17:15:53 +02:00 |
Claudio Atzori
|
d43667d857
|
Merge pull request 'Automatic download of Crossref' (#138) from crossref_dw_wf into beta
Reviewed-on: D-Net/dnet-hadoop#138
|
2021-08-13 17:10:10 +02:00 |
Miriam Baglioni
|
5856ca8a7b
|
merging with branch beta - resolved conflicts
|
2021-08-13 16:45:45 +02:00 |
Miriam Baglioni
|
6fec71e8d2
|
removed the specific of the infra we are running the wf from the wf name
|
2021-08-13 16:39:02 +02:00 |
Miriam Baglioni
|
ed7e28490a
|
change in sh
|
2021-08-13 16:19:01 +02:00 |
Claudio Atzori
|
7743d0f919
|
consolidated dnet wf profiles into the same submodule
|
2021-08-13 16:14:54 +02:00 |
Miriam Baglioni
|
6eb7508995
|
mergin with branch beta
|
2021-08-13 16:07:04 +02:00 |
Claudio Atzori
|
f74adc4752
|
added DownloadCSV2 as alternative implementation of the same download procedure
|
2021-08-13 15:52:15 +02:00 |
Claudio Atzori
|
5f0903d50d
|
fixed CSV downloader & tests
|
2021-08-13 14:17:54 +02:00 |
Claudio Atzori
|
17cefe6a97
|
[HBM] removed stale replace option
|
2021-08-13 12:43:59 +02:00 |
Claudio Atzori
|
7ee2757fcd
|
fixed DownloadCSV parameters spec; workflow patching the hostedby replaces the graph content (publication, datasource) rather than creating a copy
|
2021-08-13 12:41:01 +02:00 |
Claudio Atzori
|
c3ad4ab701
|
minor fixes
|
2021-08-13 12:23:15 +02:00 |
Claudio Atzori
|
baed5e3337
|
test classes moved in specific components
|
2021-08-13 12:14:47 +02:00 |
Claudio Atzori
|
3359f73fcf
|
cleanup & best practices
|
2021-08-13 12:00:42 +02:00 |
Miriam Baglioni
|
f4ec81c92c
|
mergin with branch beta
|
2021-08-13 10:31:35 +02:00 |
Miriam Baglioni
|
dc8b05b39e
|
Hosted By Map - changed the association with the datasource id for the hostedby element: there is no more the need to compute it. With the new HBM it is already the id in the graph
|
2021-08-13 10:18:25 +02:00 |
Miriam Baglioni
|
32fd75691f
|
refactoring
|
2021-08-13 10:15:42 +02:00 |
Miriam Baglioni
|
01db1f8bc4
|
GetCSV refactoring - removed not needed import
|
2021-08-13 10:14:17 +02:00 |
Miriam Baglioni
|
964a46ca21
|
GetCSV refactoring - modified due to movement of classes
|
2021-08-13 10:11:18 +02:00 |
Miriam Baglioni
|
eaf077fc34
|
GetCSV refactoring - removed not needed dependency
|
2021-08-13 10:08:58 +02:00 |
Miriam Baglioni
|
5f674efb0c
|
moved dependency version in external pom
|
2021-08-13 10:07:53 +02:00 |
Miriam Baglioni
|
5cd5714530
|
GetCSV refactoring - added ignore annotation for fields not in input csv
|
2021-08-13 10:06:49 +02:00 |
Miriam Baglioni
|
ed183d878e
|
GetCSV refactoring - modified test classes due to change in the model of projects and programme
|
2021-08-13 09:28:51 +02:00 |
Miriam Baglioni
|
8769dd8eef
|
GetCSV refactoring - refactoring due to movement of classes
|
2021-08-12 18:20:56 +02:00 |
Miriam Baglioni
|
6b9e1bf2e3
|
GetCSV refactoring - removing not needed dependency
|
2021-08-12 18:17:50 +02:00 |
Miriam Baglioni
|
d57b2bb927
|
GetCSV refactoring - removing not needed dependency
|
2021-08-12 18:12:51 +02:00 |
Miriam Baglioni
|
9da74b544a
|
GetCSV refactoring - refactoring due to movement of classes
|
2021-08-12 18:12:15 +02:00 |
Miriam Baglioni
|
ab8abd61bb
|
GetCSV refactoring - refactoring due to movement of classes
|
2021-08-12 18:11:07 +02:00 |
Miriam Baglioni
|
335a824e34
|
GetCSV refactoring - fixed issue
|
2021-08-12 18:10:10 +02:00 |
Miriam Baglioni
|
f0845e9865
|
GetCSV refactoring - refactoring due to movement of classes
|
2021-08-12 18:04:58 +02:00 |
Miriam Baglioni
|
7a789423aa
|
GetCSV refactoring - refactoring due to movement of classes
|
2021-08-12 18:04:27 +02:00 |
Miriam Baglioni
|
e9fc3ef3bc
|
GetCSV refactoring - changed to use the new class to get and write the csv file
|
2021-08-12 18:03:41 +02:00 |
Miriam Baglioni
|
4317211a2b
|
GetCSV refactoring - refactoring due to movement
|
2021-08-12 18:03:14 +02:00 |
Miriam Baglioni
|
b62cd656a7
|
GetCSV refactoring - changed the model to store only the information needed
|
2021-08-12 18:01:10 +02:00 |
Miriam Baglioni
|
d36e925277
|
GetCSV refactoring - moved under model package
|
2021-08-12 18:00:21 +02:00 |
Miriam Baglioni
|
6e84b3951f
|
GetCSV refactoring - moving classes to dhp-common that have dependency with GetCSV class (that was located in graph-mapper)
|
2021-08-12 17:57:41 +02:00 |