Commit Graph

94 Commits

Author SHA1 Message Date
Enrico Ottonello 290259fbc2 new collector plugin (xmlsFolder) based on ads one, adding a new parameter which represents the list of namespaces that has to be added to the generated xml record during collection step 2021-01-15 13:33:23 +01:00
Enrico Ottonello 35b81aeee4 Merge branch 'master' of https://code-repo.d4science.org/D-Net/AriadnePlus 2021-01-09 00:53:43 +01:00
Enrico Ottonello a14c66f36e added default values for has_temporal_coverage and has_spatial_coverage 2021-01-09 00:53:33 +01:00
Miriam Baglioni 472ee2754c Aggiornare 'dnet-ariadneplus/src/main/resources/es/port.py' 2021-01-08 16:52:26 +01:00
Miriam Baglioni 060f7f8673 Aggiornare 'dnet-ariadneplus/src/main/resources/es/port.py' 2021-01-08 16:51:44 +01:00
Miriam Baglioni 538be9e035 Aggiungi 'dnet-ariadneplus/src/main/resources/es/port.py' 2021-01-08 16:51:07 +01:00
Enrico Ottonello b8d0e3f741 enrichment queries set is now splitted on workflow node side, because of timeout error on very large collection 2021-01-07 23:40:47 +01:00
Enrico Ottonello 4673c143ce curl command to create catalog index with mapping for elastic search on production 2021-01-07 11:48:00 +01:00
Enrico Ottonello 1ca29a2ebf alignment with current elastic-test instance mapping 2021-01-07 11:35:33 +01:00
Enrico Ottonello a05d8e97da replicated with a new id for having different execution dates with aggregation_to_graphdb_wf.xml 2020-12-18 17:39:15 +01:00
Enrico Ottonello 3e68138b81 new wf to publish a collection on staging graphdb and then on public one 2020-12-18 16:20:38 +01:00
Enrico Ottonello dc6bd40d1e let isStart true only for collection node, avoiding parallelism on multiple nodes 2020-12-18 16:19:43 +01:00
Alessia Bardi 5f695ee538 fixed worng copy&paste, probably from an st file...again 2020-12-17 20:15:51 +01:00
Alessia Bardi 0e30eccbae fixed worng copy&paste, probably from an st file 2020-12-17 17:56:13 +01:00
Alessia Bardi 6c150be5b2 added missing param 2020-12-17 13:53:26 +01:00
Alessia Bardi e5eefef411 fixed param name 2020-12-16 16:09:40 +01:00
Alessia Bardi fa3592b1b3 added missing wf node 2020-12-15 17:06:52 +01:00
Alessia Bardi 3e28f8fd57 increasing mid version number 2020-12-14 18:57:36 +01:00
Alessia Bardi 4edfc4af05 new_agg_wf.xml annotated with comments to guide the (manual) migration of old workflows to the new one 2020-12-14 18:56:49 +01:00
Alessia Bardi 687d5ab9ec only one start node 2020-12-14 18:27:23 +01:00
Alessia Bardi be64ce5dcf changed workflow name ad description 2020-12-14 18:22:13 +01:00
Alessia Bardi 8a9782cb1e re-organising workflows and templates for staging/public instances of graphdb and elasticsearch 2020-12-14 17:49:53 +01:00
Enrico Ottonello 5d29077a43 added counter on wf progress information 2020-10-20 19:34:40 +02:00
Enrico Ottonello 8073ac7131 the node retrieves resources identifier list for collection and record, then loops on the 2 identifiers lists and request indexing to rest module 2020-10-20 17:43:27 +02:00
Enrico Ottonello 1398c2495e test for new indexing methods on rest module 2020-10-20 16:42:44 +02:00
Enrico Ottonello 0e96774895 changed http request method for indexing 2020-10-15 17:09:35 +02:00
Enrico Ottonello 25de647504 set KeepAlive true on used socket; replaced SimpleJobNode with AsyncJobNode 2020-10-13 15:34:43 +02:00
Enrico Ottonello 2e732a3b5b sparql insert for ads 1;replaced AsynchJobNode with SimpleJobNode to 404 problem 2020-10-10 18:05:02 +02:00
Enrico Ottonello 8b3f8896a5 removed progress bar handler attached to resultset client (not used here) 2020-10-10 16:53:37 +02:00
Enrico Ottonello db2ad3f97e multithreads http client not needed on indexjobnode, maybe avoid 404 response after a few minutes, added log on each indexing operation 2020-08-12 15:47:55 +02:00
Enrico Ottonello 5c3ef5f25b retrieved body from server response 2020-07-24 18:32:28 +02:00
Enrico Ottonello e189283059 retrieved report on index results 2020-07-24 17:59:23 +02:00
Alessia Bardi 74d0f440e3 Merge branch 'master' of https://code-repo.d4science.org/D-Net/AriadnePlus 2020-06-23 10:32:11 +02:00
Alessia Bardi e5d1f845a3 updated policy 2020-06-23 10:32:04 +02:00
Enrico Ottonello db9b70feb0 new node and workflow to index on ES 2020-06-16 02:36:16 +02:00
Alessia Bardi 4d68ecad58 the job fails if the endpoint returns client errors 4xx 2020-06-05 16:19:54 +02:00
Alessia Bardi 30410e8cca need to enclose policy in CDATA, otherwise we get an error from the X3ML engine 'only whitespace content allowed before start tag and not C' 2020-06-05 16:02:31 +02:00
Alessia Bardi fa3707a56f updated policy 2020-06-05 15:47:25 +02:00
Alessia Bardi e5cfbd01af ADS plugin now able to process one single remote file 2020-06-05 15:20:21 +02:00
Alessia Bardi b220b9de4e fixed tests so they do not fail 2020-06-05 15:10:17 +02:00
Enrico Ottonello 819ef88520 new workflow to import periodo from url 2020-06-01 12:29:48 +02:00
Enrico Ottonello 30b3fa2140 New JobNode and workflow to enrich content on GraphDB 2020-05-29 16:19:55 +02:00
Enrico Ottonello 996150a315 modified access protocol parameter name (mandatory for aat and periodo), it is the name of the root node, inside the metadata record, that is generated converting original json record 2020-04-03 11:58:04 +02:00
Enrico Ottonello 7d6aa5de5e aat json collector that works on a folder 2020-02-19 16:22:22 +01:00
Enrico Ottonello 226cdde77d all records related to an api are published into the same graph 2020-02-19 14:33:54 +01:00
Enrico Ottonello a6b5a80984 modified generated xml according to new mapping 2020-02-14 16:56:34 +01:00
Enrico Ottonello c5945c386b plugin to collect one json file 2020-02-14 13:00:09 +01:00
Alessia Bardi 66360e6d20 browse by Ariadne subject 2020-02-06 12:14:41 +01:00
Enrico Ottonello 17552bafa0 foreach published api, the datasource name is stored into graph datasourceApis 2020-01-24 10:52:04 +01:00
Enrico Ottonello d3b4e6c864 added ads apis 2020-01-24 10:37:00 +01:00