Claudio Atzori
|
1726f49790
|
code formatting
|
2023-12-15 10:37:02 +01:00 |
Claudio Atzori
|
c381bacee0
|
[enrichment] passing the community API base URL
|
2023-12-07 14:07:11 +01:00 |
Miriam Baglioni
|
336fb31d87
|
[community_result_propagation] adjusting starting poit of workflow
|
2023-12-07 10:27:25 +01:00 |
Miriam Baglioni
|
c0cde53bf6
|
[bulktagging] setting first step of bulktaggin as the copy of the entities and relations not involved in the tagging'
|
2023-12-07 10:08:35 +01:00 |
Claudio Atzori
|
c5b7253130
|
[community_organization propagation] fixed workflow parameters
|
2023-12-05 09:13:33 +01:00 |
Claudio Atzori
|
3c3bdb8318
|
[bulktagging] fixed workflow parameters
|
2023-12-05 09:08:48 +01:00 |
Miriam Baglioni
|
48e0427a23
|
changed the parameter from production to baseURL. Fixed issue in tagging configuration
|
2023-11-27 15:10:27 +01:00 |
Miriam Baglioni
|
b177cd5a0a
|
Project propagation via communityAPI instead of using IS via IIS
|
2023-11-14 16:25:09 +01:00 |
Miriam Baglioni
|
5c5a195e97
|
refactoring and fixing issue on property name
|
2023-10-23 11:26:17 +02:00 |
Miriam Baglioni
|
34358afe75
|
modified resource file, workflow anf default-config. Add 3g of memory Overhead and specified the shuffle partition in the wf confiduration. Removed the multiple instantiation in the wf because of different implementation of the spark job
|
2023-10-20 15:48:27 +02:00 |
Miriam Baglioni
|
a4214ced1e
|
fixing issue on propagation organization. added --config to workflow definition. added oozie_app to communtiy project
|
2023-10-20 10:14:20 +02:00 |
Sandro La Bruzzo
|
a5a89a702f
|
new spark parrameter updated
|
2023-10-16 11:46:12 +02:00 |
Miriam Baglioni
|
159388f9c2
|
testing and fix some issues
|
2023-10-16 11:26:07 +02:00 |
Miriam Baglioni
|
89184d5b4f
|
used the API instead of the IS for bulktagging and propagation for community through organization. Added a new propagation step for communities through projects. Still using the API and not the IS
|
2023-10-11 18:17:35 +02:00 |
Miriam Baglioni
|
3d6be20989
|
changes to use the API instead of the IS the get the information for the communities to be used during bulktagging and context propagation
|
2023-10-09 14:26:33 +02:00 |
Claudio Atzori
|
da0e9828f7
|
resolved conflicts for PR#337
|
2023-09-06 11:28:46 +02:00 |
Claudio Atzori
|
f3a85e224b
|
merged from branch beta the bulk tagging (single step, negative constraints), the cleanig worflow (single step, pid type based cleaning), instance level fulltext
|
2023-06-28 13:33:57 +02:00 |
Claudio Atzori
|
50d7dc0078
|
[graph enrichment] fixed projectOrganizationPath not being passed to the apply_resulttoorganization_propagation node
|
2023-06-19 15:42:44 +02:00 |
Claudio Atzori
|
fbd9bf704e
|
indent
|
2023-06-19 15:41:22 +02:00 |
Claudio Atzori
|
55f002f1e9
|
Merge branch 'beta' into propagationProjectThroughParentChils
|
2023-06-12 09:56:53 +02:00 |
Miriam Baglioni
|
0389b57ca7
|
added propagation for project to organization
|
2023-05-31 11:06:58 +02:00 |
Miriam Baglioni
|
34172455d1
|
[BulkTag] Adding remove constraints to specify when a community must not appear in the context of a result.
|
2023-05-24 09:56:23 +02:00 |
Miriam Baglioni
|
efc4f6a658
|
[bulkTag] refactor to enrich each result single step
|
2023-04-18 17:39:31 +02:00 |
Miriam Baglioni
|
932d07d2dd
|
[bulkTag] added filtering for datasources in eosctag
|
2023-04-06 15:08:27 +02:00 |
Miriam Baglioni
|
b25b401065
|
added test to verify the advconstraints to dth community. inserted some additional logs.
|
2023-04-05 12:18:39 +02:00 |
Claudio Atzori
|
d05ca53a14
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2023-01-31 14:39:53 +01:00 |
Miriam Baglioni
|
e82e009b46
|
added missing close tag for XML produced by the xquery to get information for the community from the IS
|
2023-01-31 10:19:34 +01:00 |
Claudio Atzori
|
505867bce9
|
[bulk tagging] better node naming
|
2023-01-20 16:13:16 +01:00 |
Claudio Atzori
|
1b37516578
|
[bulk tagging] better node naming
|
2023-01-20 16:11:26 +01:00 |
Miriam Baglioni
|
ecd398fe51
|
refactoring
|
2023-01-20 14:23:45 +01:00 |
Miriam Baglioni
|
6674cccb94
|
[BulkTag] description of parameters more comprehensive for those who do not implement it
|
2022-12-16 15:33:20 +01:00 |
Miriam Baglioni
|
f37113a941
|
[BulkTag] moving xquery to get community configuration in dedicated file
|
2022-12-16 15:32:26 +01:00 |
Miriam Baglioni
|
3d99b78d94
|
[Cleaning] fixed error in parameter (workingPath to workingDir)
|
2022-12-08 10:25:02 +01:00 |
Claudio Atzori
|
ed64618235
|
increased spark.sql.shuffle.partitions in the last join phase of the result (publication) to community through semantic relation propagation
|
2022-11-18 16:06:51 +01:00 |
Claudio Atzori
|
8742934843
|
added spark.sql.shuffle.partitions in the last join phase of the result to community through semantic relation propagation
|
2022-11-18 11:32:22 +01:00 |
Miriam Baglioni
|
840465958b
|
[EOSC BulkTag] filtering aout the datasources registered in the eosc with compatibility different from 3.0, 4.0 for literature, data and CRIS to add the context eosc to the results
|
2022-09-20 10:30:41 +02:00 |
Miriam Baglioni
|
1c82acb168
|
[EOSC Context Tagging] refactoring: moved EOSC IF tagging in package eosc under bulkTag
|
2022-07-25 14:26:39 +02:00 |
Miriam Baglioni
|
627332526b
|
[EOSC context TAG] workflow start from reset_outputpath action
|
2022-07-22 14:55:11 +02:00 |
Miriam Baglioni
|
7a1c1b6f53
|
[EOSC context TAG] Add test class and resourcesK
|
2022-07-22 14:36:02 +02:00 |
Miriam Baglioni
|
8a72de4011
|
[EOSCTag] modified workflow to execute all the steps and not only the last one
|
2022-05-04 10:10:56 +02:00 |
Miriam Baglioni
|
3aeedd931a
|
[EOSCTag] fixed issue in case description is null. Modified test resources and classes
|
2022-05-04 10:06:38 +02:00 |
Miriam Baglioni
|
7cb7066472
|
[EoscTag] first "rough" implementation
|
2022-04-22 10:44:17 +02:00 |
Claudio Atzori
|
48b580b45c
|
[graph enrichment] fixed country_propagation oozie workflow definition, parameter saveGraph is not needed anymore by the SparkCountryPropagationJob
|
2022-04-11 08:52:36 +02:00 |
Miriam Baglioni
|
7b8f85692e
|
[Enrichment country] fixed issues with parameters and workflow args
|
2022-03-23 17:20:23 +01:00 |
Claudio Atzori
|
f10066547b
|
increased spark.sql.shuffle.partitions in affiliation_from_semrel_propagation
|
2022-03-23 12:22:26 +01:00 |
Miriam Baglioni
|
2b643059fa
|
[Country Propagation] changed the logic to get the collectedfrom at the result level. To fix issue when no instance is created for a result that should have the country associated. Change the code to use spark instead of hive to prepare the data needed for the propagation step. Added new tests for the intermediate steps and new verification for the propagation itself
|
2022-03-11 13:56:48 +01:00 |
Miriam Baglioni
|
064f9bbd87
|
[AFFPropSR] added new paprameter for the number of iterations and new code for just one iteration
|
2022-01-07 18:58:51 +01:00 |
Miriam Baglioni
|
28ea532ece
|
[Affilaition Propagation] moved the selection of graph relation as a preparation step
|
2021-11-16 15:24:19 +01:00 |
Miriam Baglioni
|
b9d124bb7c
|
[Enrichment: Propagation through parent-child relationships] Added counters, and changed constraint to verify if filtering out the relation (from classname = harvested to classid != propagation)
|
2021-11-03 13:55:37 +01:00 |
Miriam Baglioni
|
09f36cffb8
|
[Enrichment: Propagation through parent-child relationships] First implementation, testing, and wf for propagation of result to organization through semantic relation
|
2021-10-29 11:20:03 +02:00 |