Claudio Atzori
|
1726f49790
|
code formatting
|
2023-12-15 10:37:02 +01:00 |
Miriam Baglioni
|
c0cde53bf6
|
[bulktagging] setting first step of bulktaggin as the copy of the entities and relations not involved in the tagging'
|
2023-12-07 10:08:35 +01:00 |
Claudio Atzori
|
3c3bdb8318
|
[bulktagging] fixed workflow parameters
|
2023-12-05 09:08:48 +01:00 |
Miriam Baglioni
|
5c5a195e97
|
refactoring and fixing issue on property name
|
2023-10-23 11:26:17 +02:00 |
Miriam Baglioni
|
34358afe75
|
modified resource file, workflow anf default-config. Add 3g of memory Overhead and specified the shuffle partition in the wf confiduration. Removed the multiple instantiation in the wf because of different implementation of the spark job
|
2023-10-20 15:48:27 +02:00 |
Miriam Baglioni
|
89184d5b4f
|
used the API instead of the IS for bulktagging and propagation for community through organization. Added a new propagation step for communities through projects. Still using the API and not the IS
|
2023-10-11 18:17:35 +02:00 |
Claudio Atzori
|
f3a85e224b
|
merged from branch beta the bulk tagging (single step, negative constraints), the cleanig worflow (single step, pid type based cleaning), instance level fulltext
|
2023-06-28 13:33:57 +02:00 |
Miriam Baglioni
|
efc4f6a658
|
[bulkTag] refactor to enrich each result single step
|
2023-04-18 17:39:31 +02:00 |
Miriam Baglioni
|
932d07d2dd
|
[bulkTag] added filtering for datasources in eosctag
|
2023-04-06 15:08:27 +02:00 |
Miriam Baglioni
|
b25b401065
|
added test to verify the advconstraints to dth community. inserted some additional logs.
|
2023-04-05 12:18:39 +02:00 |
Claudio Atzori
|
d05ca53a14
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2023-01-31 14:39:53 +01:00 |
Claudio Atzori
|
505867bce9
|
[bulk tagging] better node naming
|
2023-01-20 16:13:16 +01:00 |
Claudio Atzori
|
1b37516578
|
[bulk tagging] better node naming
|
2023-01-20 16:11:26 +01:00 |
Miriam Baglioni
|
ecd398fe51
|
refactoring
|
2023-01-20 14:23:45 +01:00 |
Miriam Baglioni
|
840465958b
|
[EOSC BulkTag] filtering aout the datasources registered in the eosc with compatibility different from 3.0, 4.0 for literature, data and CRIS to add the context eosc to the results
|
2022-09-20 10:30:41 +02:00 |
Miriam Baglioni
|
1c82acb168
|
[EOSC Context Tagging] refactoring: moved EOSC IF tagging in package eosc under bulkTag
|
2022-07-25 14:26:39 +02:00 |
Miriam Baglioni
|
627332526b
|
[EOSC context TAG] workflow start from reset_outputpath action
|
2022-07-22 14:55:11 +02:00 |
Miriam Baglioni
|
7a1c1b6f53
|
[EOSC context TAG] Add test class and resourcesK
|
2022-07-22 14:36:02 +02:00 |
Miriam Baglioni
|
8a72de4011
|
[EOSCTag] modified workflow to execute all the steps and not only the last one
|
2022-05-04 10:10:56 +02:00 |
Miriam Baglioni
|
3aeedd931a
|
[EOSCTag] fixed issue in case description is null. Modified test resources and classes
|
2022-05-04 10:06:38 +02:00 |
Miriam Baglioni
|
7cb7066472
|
[EoscTag] first "rough" implementation
|
2022-04-22 10:44:17 +02:00 |
Claudio Atzori
|
ab37953332
|
added global properties in wf definitions to avoid repeating name-node and job-tracker in the (many) distcp actions; reintroduced output directory removal at the beginning of each spark action
|
2020-05-14 10:25:41 +02:00 |
Claudio Atzori
|
ec0782e582
|
renamed jar containing the bulktagging and propagation workflows from dhp-[bulktagging|propagation] to dhp-enrichment; adjusted xml formatting
|
2020-05-12 15:49:28 +02:00 |
Claudio Atzori
|
6d0b11252e
|
bulktagging wfs moved into common dhp-enrichment module
|
2020-05-11 17:32:06 +02:00 |