Claudio Atzori
|
14539f9c8b
|
[graph provision] publicFormat worfklow parameter defined as optional
|
2024-06-28 14:55:18 +02:00 |
Claudio Atzori
|
1bc8c5d173
|
[graph provision] fixed serialization of the instancetypes
|
2024-06-28 14:54:28 +02:00 |
Claudio Atzori
|
1ccf01cdb8
|
Using the updated Solr JSON payload model classes
|
2024-06-28 12:38:07 +02:00 |
Claudio Atzori
|
1c30eacac2
|
updated index feeding procedure to exploit the collection aliases
|
2024-06-25 15:27:38 +02:00 |
Claudio Atzori
|
6055212f77
|
merged from the json_payload branch
|
2024-06-25 12:39:02 +02:00 |
Serafeim Chatzopoulos
|
9f6e16a03c
|
Add support to cretate/update solr collection aliases
|
2024-06-20 16:03:15 +03:00 |
Claudio Atzori
|
f70dc76b61
|
minor
|
2024-06-06 10:43:10 +02:00 |
Claudio Atzori
|
da5c1e73a4
|
Merge pull request 'Irish oaipmh exporter' (#443) from irish-oaipmh-exporter into beta
Reviewed-on: D-Net/dnet-hadoop#443
|
2024-06-05 10:55:09 +02:00 |
Claudio Atzori
|
81090ad593
|
[IE OAIPHM] added oozie workflow, minor changes, code formatting
|
2024-06-05 10:03:33 +02:00 |
Claudio Atzori
|
0d5bdb2db0
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into beta
|
2024-05-27 11:59:02 +02:00 |
Sandro La Bruzzo
|
66c1ffc866
|
merged again from beta (I hope for the last time)
|
2024-05-22 11:02:46 +02:00 |
Claudio Atzori
|
834461ba26
|
[graph provision]fixed wf definition, revised serialization of the usage counts measures
|
2024-05-21 13:48:06 +02:00 |
Claudio Atzori
|
92f018d196
|
[graph provision] fixed path pointing to an intermediate data store in the working directory
|
2024-05-15 15:39:18 +02:00 |
Claudio Atzori
|
0611c81a2f
|
[graph provision] using Qualifier.classNames to populate the correponsing fields in the JSON payload
|
2024-05-15 15:33:10 +02:00 |
Michele Artini
|
2b3b5fe9a1
|
oai finalization and test
|
2024-05-15 14:13:16 +02:00 |
Claudio Atzori
|
1efe7f7e39
|
[graph provision] upgrade to dhp-schema:6.1.2, included project.oamandatepublications in the JSON payload mapping, fixed serialisation of the usageCounts measures
|
2024-05-14 12:39:31 +02:00 |
Claudio Atzori
|
55f39f7850
|
[graph provision] adds the possibility to validate the XML records before storing them via the validateXML parameter
|
2024-05-09 14:06:04 +02:00 |
Claudio Atzori
|
39a2afe8b5
|
[graph provision] fixed XML serialization of the usage counts measures, renamed workflow actions to better reflect their role
|
2024-05-09 13:54:42 +02:00 |
Claudio Atzori
|
18aa323ee9
|
cleanup unused classes, adjustments in the oozie wf definition
|
2024-05-08 11:36:46 +02:00 |
Michele Artini
|
c9a327bc50
|
refactoring of gzip method
|
2024-05-08 11:34:08 +02:00 |
Michele Artini
|
e234848af8
|
oaf record: xpath for root
|
2024-05-08 10:00:53 +02:00 |
Claudio Atzori
|
b4e3389432
|
fixed property mapping creating the RelatedEntity transient objects. spark cores & memory adjustments. Code formatting
|
2024-05-07 16:25:17 +02:00 |
Giambattista Bloisi
|
711048ceed
|
PrepareRelationsJob rewritten to use Spark Dataframe API and Windowing functions
|
2024-05-07 15:44:33 +02:00 |
Michele Artini
|
70bf6ac415
|
oai exporter tests
|
2024-05-07 09:36:26 +02:00 |
Michele Artini
|
aa40e53c19
|
oai exporter parameters
|
2024-05-07 08:01:19 +02:00 |
Michele Artini
|
ed052a3476
|
job for the population of the oai database
|
2024-05-06 16:08:33 +02:00 |
Sandro La Bruzzo
|
0d628cd62b
|
merged again from beta
|
2024-04-23 17:34:55 +02:00 |
Claudio Atzori
|
3a027e97a7
|
[graph indexing] sets spark memoryOverhead in the join operations to the same value used for the memory executor
|
2024-04-19 16:59:58 +02:00 |
Sandro La Bruzzo
|
b84ad0c06e
|
merged beta
|
2024-04-19 14:39:59 +02:00 |
Claudio Atzori
|
ef52128c55
|
included new stats* workflows in parent pom list of modules, code formatting
|
2024-03-26 10:42:10 +01:00 |
Claudio Atzori
|
bfba71a95c
|
further follow up changes from integrating the mergeutils branch
|
2024-03-26 09:01:18 +01:00 |
Claudio Atzori
|
078169b922
|
cleanup
|
2024-03-15 09:56:04 +01:00 |
Claudio Atzori
|
af154d4456
|
implemented changes from #9497: sort abstracts by string length, included author fullnames in the related results, expanded instance details within each children/result XML element
|
2024-03-14 16:21:23 +01:00 |
Claudio Atzori
|
7863c92466
|
expanded paper abstract in the result/children XML element (ticket #9497)
|
2024-03-13 16:25:31 +01:00 |
Claudio Atzori
|
eb5887cb9a
|
including related organization url in the XML record serialization (ticket #9498)
|
2024-03-13 14:46:00 +01:00 |
Claudio Atzori
|
db66555ebb
|
WIP: updated provision workflow to create a JSON based representation of the payload
|
2024-03-12 09:56:09 +01:00 |
Claudio Atzori
|
d4871b31e8
|
WIP: extended provision workflow to create the JSON based payload
|
2024-03-08 11:43:20 +01:00 |
Claudio Atzori
|
6fcf872daa
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into index_records
|
2024-02-28 10:27:28 +01:00 |
Claudio Atzori
|
3f07390a58
|
WIP
|
2024-02-28 10:10:10 +01:00 |
Sandro La Bruzzo
|
7d806a434c
|
formatted code
|
2024-02-28 09:31:58 +01:00 |
Alessia Bardi
|
f2a08d8cc2
|
test for Italian records from IRS repositories
|
2024-01-30 19:20:14 +01:00 |
Claudio Atzori
|
9b13c22e5d
|
[graph provision] retrieve all the context information by adding all=true to the requests issued to thr API
|
2024-01-23 15:36:08 +01:00 |
Claudio Atzori
|
f87f3a6483
|
[graph provision] updated param specification for the XML converter job
|
2024-01-23 08:54:37 +01:00 |
Claudio Atzori
|
1c6db320f4
|
[graph provision] obtain context info from the context API instead from the ISLookUp service
|
2024-01-22 15:53:17 +01:00 |
Miriam Baglioni
|
5011c4d11a
|
refactoring after compiletion
|
2023-12-20 15:57:26 +01:00 |
Claudio Atzori
|
ff924215b8
|
[graph provision] added tests for new peerreviewed field
|
2023-12-12 11:21:30 +01:00 |
Claudio Atzori
|
7e8eff40c1
|
[graph provision] added tests for the new model fields
|
2023-12-12 08:54:15 +01:00 |
Giambattista Bloisi
|
613ec5ffce
|
Add profiles for different spark versions: spark-24, spark-34, spark-35
|
2023-12-05 19:11:06 +01:00 |
Giambattista Bloisi
|
2fa78f6071
|
Changes requires to build and run tests with Java 17
|
2023-12-05 19:11:06 +01:00 |
Giambattista Bloisi
|
326c9dc08c
|
Changes in maven poms to build and test the project using Spark 3.4.x and scala 2.12
|
2023-12-05 19:11:06 +01:00 |