Claudio Atzori
|
5f1ed61c1f
|
merging from bulkTag branch
|
2023-11-03 12:51:37 +01:00 |
Miriam Baglioni
|
f206ff42d6
|
modified code to use the the API. Removing not needed parameters. Rewritten the code to exploit the parallel stream on the entity types
|
2023-10-20 15:49:41 +02:00 |
Claudio Atzori
|
b0fed1725e
|
avoid NPEs
|
2023-10-19 12:13:45 +02:00 |
Claudio Atzori
|
f3a85e224b
|
merged from branch beta the bulk tagging (single step, negative constraints), the cleanig worflow (single step, pid type based cleaning), instance level fulltext
|
2023-06-28 13:33:57 +02:00 |
Miriam Baglioni
|
efc4f6a658
|
[bulkTag] refactor to enrich each result single step
|
2023-04-18 17:39:31 +02:00 |
Claudio Atzori
|
0cb1c70788
|
code formatting
|
2022-07-01 10:44:08 +02:00 |
Miriam Baglioni
|
c5a863132c
|
[BulkTagging] revert it
|
2022-04-14 14:14:13 +02:00 |
Miriam Baglioni
|
8e8933d41a
|
[BulkTagging] added fix if result.dataInfo is null
|
2022-04-14 09:04:24 +02:00 |
Claudio Atzori
|
23b8883ab1
|
applied intellij code cleanup
|
2021-05-14 10:58:12 +02:00 |
Claudio Atzori
|
55595d7235
|
HACK: patch NULL values with defaults found in result.datainfo.deletedbyinference and result.context
|
2020-05-26 10:28:35 +02:00 |
Claudio Atzori
|
ab37953332
|
added global properties in wf definitions to avoid repeating name-node and job-tracker in the (many) distcp actions; reintroduced output directory removal at the beginning of each spark action
|
2020-05-14 10:25:41 +02:00 |
Claudio Atzori
|
c6b028f2af
|
code formatting
|
2020-05-11 17:38:08 +02:00 |
Claudio Atzori
|
6d0b11252e
|
bulktagging wfs moved into common dhp-enrichment module
|
2020-05-11 17:32:06 +02:00 |