Commit Graph

61 Commits

Author SHA1 Message Date
Enrico Ottonello db2ad3f97e multithreads http client not needed on indexjobnode, maybe avoid 404 response after a few minutes, added log on each indexing operation 2020-08-12 15:47:55 +02:00
Enrico Ottonello 5c3ef5f25b retrieved body from server response 2020-07-24 18:32:28 +02:00
Enrico Ottonello e189283059 retrieved report on index results 2020-07-24 17:59:23 +02:00
Alessia Bardi 74d0f440e3 Merge branch 'master' of https://code-repo.d4science.org/D-Net/AriadnePlus 2020-06-23 10:32:11 +02:00
Alessia Bardi e5d1f845a3 updated policy 2020-06-23 10:32:04 +02:00
Enrico Ottonello db9b70feb0 new node and workflow to index on ES 2020-06-16 02:36:16 +02:00
Alessia Bardi 4d68ecad58 the job fails if the endpoint returns client errors 4xx 2020-06-05 16:19:54 +02:00
Alessia Bardi 30410e8cca need to enclose policy in CDATA, otherwise we get an error from the X3ML engine 'only whitespace content allowed before start tag and not C' 2020-06-05 16:02:31 +02:00
Alessia Bardi fa3707a56f updated policy 2020-06-05 15:47:25 +02:00
Alessia Bardi e5cfbd01af ADS plugin now able to process one single remote file 2020-06-05 15:20:21 +02:00
Alessia Bardi b220b9de4e fixed tests so they do not fail 2020-06-05 15:10:17 +02:00
Enrico Ottonello 819ef88520 new workflow to import periodo from url 2020-06-01 12:29:48 +02:00
Enrico Ottonello 30b3fa2140 New JobNode and workflow to enrich content on GraphDB 2020-05-29 16:19:55 +02:00
Enrico Ottonello 996150a315 modified access protocol parameter name (mandatory for aat and periodo), it is the name of the root node, inside the metadata record, that is generated converting original json record 2020-04-03 11:58:04 +02:00
Enrico Ottonello 7d6aa5de5e aat json collector that works on a folder 2020-02-19 16:22:22 +01:00
Enrico Ottonello 226cdde77d all records related to an api are published into the same graph 2020-02-19 14:33:54 +01:00
Enrico Ottonello a6b5a80984 modified generated xml according to new mapping 2020-02-14 16:56:34 +01:00
Enrico Ottonello c5945c386b plugin to collect one json file 2020-02-14 13:00:09 +01:00
Alessia Bardi 66360e6d20 browse by Ariadne subject 2020-02-06 12:14:41 +01:00
Enrico Ottonello 17552bafa0 foreach published api, the datasource name is stored into graph datasourceApis 2020-01-24 10:52:04 +01:00
Enrico Ottonello d3b4e6c864 added ads apis 2020-01-24 10:37:00 +01:00
Enrico Ottonello 8530e488d0 all records related to a datasource api are now published on graphdb into n named graphs because of out of memory issue 2020-01-14 16:55:45 +01:00
Enrico Ottonello e748d3d802 added progress status data on publishToGraphDB workflow node ui 2019-12-18 15:55:53 +01:00
Enrico Ottonello 7980449ac0 CloseableHttpClient instance has to be closed after all post has been executed, same handling for PoolingHttpClientConnectionManager instance shutdown 2019-12-17 15:35:36 +01:00
Alessia Bardi 11e3992ab4 slow down the task producer to avoid OOM errors. See here for more: https://stackoverflow.com/questions/42108351/executorservice-giving-out-of-memory-error 2019-12-16 18:52:08 +01:00
Alessia Bardi c770fc40e0 cleaning up: let's remove the classes we do not need anymore, like Virtuoso-related classes and specific plugins that we have created for Parthenos 2019-12-16 18:34:08 +01:00
Enrico Ottonello 5087750f96 rdf handling move away from this node 2019-12-16 15:32:20 +01:00
Enrico Ottonello 98b452c59a new node to elastic search index 2019-12-16 15:25:55 +01:00
Enrico Ottonello 0f6f2e7b75 after records publishing the datasourceApi informations are saved on GraphDB 2019-12-13 14:55:51 +01:00
Enrico Ottonello 2f5fb6fcb5 new module for graphdb publishing 2019-12-12 12:58:30 +01:00
Alessia Bardi 69778c19c9 INFO logging 2019-11-15 11:11:31 +01:00
Alessia Bardi 2ed3a388ac cleaned up pom and testing Ariadne mapping via URL 2019-11-15 10:44:10 +01:00
Enrico Ottonello d78122c64d removed unused wf parameter 2019-10-31 15:18:46 +01:00
Enrico Ottonello b3a8f05c26 x3m mapping is now only taken from url; ads policy aligned to aggregator instance; removed wf parameter unused 2019-10-31 14:08:37 +01:00
Miriam Baglioni 1dab575b58 Merge branch 'master' of https://code-repo.d4science.org/D-Net/AriadnePlus 2019-10-23 16:15:13 +03:00
Miriam Baglioni eaa3992227 changed the schema for the date element. added year as possible provided date. Now date fields have type year || date_optional_time 2019-10-23 16:14:46 +03:00
Enrico Ottonello 45f5b9abba removed unused folder, removed unused parameters from ads repository profile, modified ads xpath identifier, modified default base url on ads access protocol 2019-10-22 15:51:09 +02:00
Enrico Ottonello e968c5faa5 mapping url downloaded from remote url set in workflow parameters 2019-10-22 13:45:17 +02:00
Enrico Ottonello 4047d32587 all files under Access Protocol > Base URL folder are collected together 2019-10-21 15:43:29 +02:00
Miriam Baglioni fc43b5a85d Merge branch 'master' of https://code-repo.d4science.org/D-Net/AriadnePlus 2019-10-18 17:54:48 +02:00
Miriam Baglioni c7289ad497 added new schema for elasticsearch 7 2019-10-18 17:52:44 +02:00
Enrico Ottonello d5a631a138 ads policy updated 2019-10-08 16:10:54 +02:00
Enrico Ottonello 9e54f9b3a6 renamed policy and mapping files for ads 2019-10-08 16:08:02 +02:00
Enrico Ottonello 1280702100 added mapping and policy for ads, added ads schema for solr, added ads datasource; removed not used profiles 2019-09-25 15:47:30 +02:00
Alessia Bardi 07759bdd6b no need to show errors and minimal metadata for now 2019-09-22 18:23:43 +02:00
Alessia Bardi 90b6a4c1f3 fixed content checker config 2019-09-22 17:39:36 +02:00
Enrico Ottonello 329a636090 schema to ads indexing 2019-09-20 20:11:31 +02:00
Enrico Ottonello 75fb3752dc deleted publish/unpublish wf 2019-09-20 20:04:14 +02:00
Enrico Ottonello 1aa345559d added namespace 2019-09-20 20:03:45 +02:00
Enrico Ottonello b5ed4396c6 added ads repository profile 2019-09-20 12:07:02 +02:00