Commit Graph

82 Commits

Author SHA1 Message Date
Enrico Ottonello c7673f7ba1 lowercase to period name on native and temporal fields 2021-10-26 17:49:22 +02:00
Enrico Ottonello 0010458a23 modified log level for a few message 2021-10-22 12:32:04 +02:00
Enrico Ottonello 4a4da01d6b added titlecase on temporal and native period name 2021-10-22 11:38:19 +02:00
Enrico Ottonello 0f7e7de918 handling for title as elastic search object with text and language fields, same handling for description 2021-10-07 11:55:57 +02:00
Enrico Ottonello 0693645c68 added precision information to spatial data 2021-10-04 15:13:16 +02:00
Enrico Ottonello 3510d7d1be Merge branch 'new_es_mapping'
# Conflicts:
#	dnet-ariadneplus-graphdb-publisher/src/main/resources/eu/dnetlib/ariadneplus/sparql/read_collection_data_template.sparql
#	dnet-ariadneplus-graphdb-publisher/src/main/resources/eu/dnetlib/ariadneplus/sparql/read_record_data_template.sparql
#	dnet-ariadneplus-graphdb-publisher/test/java/eu/dnetlib/ariadneplus/GraphDbReaderAndESIndexTest.java
2021-09-08 15:56:12 +02:00
Alessia Bardi d169e1fad8 Merge branch 'master' of https://code-repo.d4science.org/D-Net/AriadnePlus 2021-08-26 17:59:35 +02:00
Alessia Bardi ea3b749792 test with read-only mode connection 2021-08-26 17:59:32 +02:00
Enrico Ottonello 42eb9604a5 new class to handle publish on graphdb operation, transaction throws OOM so currently one write operation at time is executed without transaction 2021-07-27 14:58:00 +02:00
Enrico Ottonello 9e17c18c2f updated graphdb-free-runtime to version 9.8.0, updated publisher version 2021-07-20 12:43:09 +02:00
Enrico Ottonello 8944977a17 added digitalImage support 2021-06-28 12:50:39 +02:00
Enrico Ottonello 6c8c9fcba3 added fields has_type and is_about 2021-06-23 14:56:25 +02:00
Enrico Ottonello 4882e38d64 added new fields from es mapping: agent (homepage,institution,agentIdentifier) and wasCreated 2021-06-23 11:26:56 +02:00
Enrico Ottonello 89f309feaa added mapping for polygon data when aocat:has_polygonal_representation is in wkt format (snd::zip) 2021-06-22 11:06:56 +02:00
Enrico Ottonello 41896d312b fix for from and until values on temporal and native period fields 2021-06-21 15:20:39 +02:00
Enrico Ottonello 54b76c79a0 added polygon support; the correct order of the geopoints describing the polygon is needed 2021-06-18 19:32:12 +02:00
Enrico Ottonello 8ed4b8b08d added spatial boundingbox support using a 4 sides polygon wkt format 2021-06-17 17:59:51 +02:00
Enrico Ottonello e91d82f32c model classes according to new es mapping; creation and indexing of a record with geopoint data 2021-06-15 23:34:59 +02:00
Enrico Ottonello 1545f3900f get specific error for elastic search record parsing error 2021-05-31 16:43:21 +02:00
Enrico Ottonello 5682cc4bd2 added multiple descriptions for arup 2021-04-02 12:46:07 +02:00
Enrico Ottonello f2425ffa2d removed spatial with empty data; removed duplicates on spatial list 2021-03-04 10:59:54 +01:00
Enrico Ottonello 84993ffaba matching periodo name (if any) overwrites native period name 2021-02-16 18:19:30 +01:00
Enrico Ottonello e5b833169b fix multiple has_spatial_coverage issue 2021-02-16 10:51:14 +01:00
Enrico Ottonello a7a1a466b8 fix year date parsing 2021-01-26 16:39:47 +01:00
Enrico Ottonello 9565265f3d added Rock Art mapping; fixed Site/monument mapping 2021-01-08 15:32:46 +01:00
Enrico Ottonello 8baf2d7941 added BST date format handling parsing, before creating es record 2020-11-30 13:27:28 +01:00
Enrico Ottonello 075490f4da removed duplicated log 2020-10-30 17:29:18 +01:00
Enrico Ottonello 529151828f when IRI created from periodo url is wrong, wf has to fail 2020-10-30 17:08:46 +01:00
Enrico Ottonello 665a717ca0 added check on identifier, before indexing 2020-10-23 12:18:58 +02:00
Enrico Ottonello 240be91c5e added rest method to index single record by its identifier 2020-10-19 12:20:56 +02:00
Enrico Ottonello 9174f4df14 add graphdb connection setup 2020-10-16 19:56:14 +02:00
Enrico Ottonello f88231ba18 added new methods to retrieve resource identifiers and to start indexing by resource identifiers 2020-10-16 19:37:21 +02:00
Enrico Ottonello 0e96774895 changed http request method for indexing 2020-10-15 17:09:35 +02:00
Enrico Ottonello 263c1beb7e fix date parsing on es record creation 2020-10-13 00:31:17 +02:00
Alessia Bardi a452133ac3 use the new values statically inserted by the construct query 2020-10-12 18:32:56 +02:00
Alessia Bardi 83ecd94c6f uncommented logs and set them in debug mode 2020-10-12 18:15:15 +02:00
Enrico Ottonello 2aafbc8506 fix loop condition 2020-10-10 01:03:41 +02:00
Enrico Ottonello fe614ce2ed added retry loop on HTTPQueryEvaluationException (Heap space almost full) 2020-10-10 00:55:26 +02:00
Enrico Ottonello 142a296166 changed exception type, if collection indexing failes 2020-09-24 12:53:08 +02:00
Enrico Ottonello 814c908663 collection is now indexed before records, if there is an error on collection indexing, wf failed immediately; enriched query for 397 collection 2020-09-24 12:35:59 +02:00
Alessia Bardi 12c80713d7 indexing result per record logged in debug mode instead of info 2020-09-07 15:45:26 +02:00
Alessia Bardi 832b95f064 parse must succeed if the date is already in the expected format 2020-09-07 15:37:40 +02:00
Alessia Bardi 19ebb63d88 fixed log 2020-09-07 15:00:09 +02:00
Alessia Bardi f1e3de1b07 avoid null pointer exception 2020-09-07 14:59:54 +02:00
Alessia Bardi 4fb5bb2941 it is legal to have no individual resources or no collections: let's index what we have 2020-09-04 16:57:38 +02:00
Enrico Ottonello db2ad3f97e multithreads http client not needed on indexjobnode, maybe avoid 404 response after a few minutes, added log on each indexing operation 2020-08-12 15:47:55 +02:00
Alessia Bardi 8329f6e9dc useless if removed 2020-08-07 16:40:21 +02:00
Alessia Bardi ca4ad7ea3b Need to split the multiple inserts and perform separate transactions in GraphDB Free edition. See https://stackoverflow.com/questions/54428161/graphdb-read-check-and-update-in-a-transaction 2020-08-07 14:54:15 +02:00
Alessia Bardi 69ca7eaade removed redundand lines of code 2020-08-07 12:44:36 +02:00
Enrico Ottonello 220c6be6db report returned to wf node 2020-07-24 13:15:27 +02:00