Giambattista Bloisi
|
15ba3cf202
|
Provide paths as dag configuration parameters
|
2024-10-22 10:18:03 +02:00 |
Giambattista Bloisi
|
73e78d6877
|
Add workflow with all graph construction steps
|
2024-10-21 21:36:20 +02:00 |
Giambattista Bloisi
|
aae37058f7
|
Increase memory
|
2024-10-21 20:30:31 +02:00 |
Giambattista Bloisi
|
131f6e5592
|
enable dynamic allocation
|
2024-10-21 20:23:31 +02:00 |
Giambattista Bloisi
|
df46c8c65f
|
Added ORCID enrichment workflows
|
2024-10-21 18:55:41 +02:00 |
Giambattista Bloisi
|
034a01542a
|
Implement consistency workflow
|
2024-10-21 15:33:55 +02:00 |
Giambattista Bloisi
|
c6fbfd3f0a
|
Remove numpartitions argument where not needed
|
2024-10-21 14:30:40 +02:00 |
Giambattista Bloisi
|
ae89274ce4
|
implemente whole scan pipeline
|
2024-10-21 14:10:28 +02:00 |
Giambattista Bloisi
|
0a2956d81f
|
reduce executor cores
|
2024-10-19 17:59:35 +02:00 |
Giambattista Bloisi
|
48f688cda9
|
add deps jar
|
2024-10-19 11:13:21 +02:00 |
Giambattista Bloisi
|
c5f4263061
|
update spark-version
|
2024-10-19 11:09:24 +02:00 |
Giambattista Bloisi
|
ba3f351736
|
print existing files
|
2024-10-19 10:26:18 +02:00 |
Giambattista Bloisi
|
448bb924ab
|
add test dedup task
|
2024-10-19 00:18:00 +02:00 |
Giambattista Bloisi
|
bf7c9e2dce
|
revert some changes
|
2024-10-18 17:16:37 +02:00 |
Giambattista Bloisi
|
8da265f018
|
add utils in the parent folder
|
2024-10-18 17:00:51 +02:00 |
Giambattista Bloisi
|
0fcabed2ae
|
change dag name
|
2024-10-18 16:58:42 +02:00 |
Giambattista Bloisi
|
c3ba29e4c5
|
Add dagutils
|
2024-10-18 16:53:14 +02:00 |
Giambattista Bloisi
|
412e008df7
|
Add untar task
|
2024-10-18 16:42:54 +02:00 |
Sandro La Bruzzo
|
df6e23666e
|
fix
|
2024-10-16 16:35:01 +02:00 |
Sandro La Bruzzo
|
d1afcd4395
|
fixed import
|
2024-10-16 14:08:00 +02:00 |
Sandro La Bruzzo
|
dcd2efd3b4
|
added workflow test
|
2024-10-16 13:56:50 +02:00 |
Sandro La Bruzzo
|
6b555b8f6e
|
added workflow test
|
2024-10-16 13:56:36 +02:00 |
Sandro La Bruzzo
|
b8bf21f8e5
|
fixed import
|
2024-10-16 13:51:49 +02:00 |
Sandro La Bruzzo
|
07ce192207
|
added workflow test
|
2024-10-16 13:38:26 +02:00 |