Antonis Lempesis
|
459167ac2f
|
Merge branch 'beta' of https://code-repo.d4science.org/antonis.lempesis/dnet-hadoop into beta
|
2024-03-21 12:44:58 +02:00 |
Antonis Lempesis
|
07f634a46d
|
code cleanup
|
2024-03-21 12:44:30 +02:00 |
Antonis Lempesis
|
9521625a07
|
code cleanup
|
2024-03-21 11:45:08 +02:00 |
Sandro La Bruzzo
|
58dbe71d39
|
update crossref mapping to be runnable separately as a single datasource outside doiboost
|
2024-03-20 17:04:52 +01:00 |
Antonis Lempesis
|
67a5aa0a38
|
Merge branch 'beta' of https://code-repo.d4science.org/antonis.lempesis/dnet-hadoop into beta
|
2024-03-19 11:24:54 +02:00 |
dimitrispie
|
a3a570e9a0
|
Commit monitor-updates-wf
|
2024-03-19 09:42:21 +02:00 |
Giambattista Bloisi
|
664a381d31
|
Unify merge logic of entities in MergeUtils.class
|
2024-03-18 16:04:49 +01:00 |
Michele Artini
|
cb29b9773c
|
xslt rules
|
2024-03-18 15:31:34 +01:00 |
Michele Artini
|
85b844d57e
|
updated BASE filter param
|
2024-03-15 15:03:27 +01:00 |
Michele Artini
|
455f2e1e07
|
apply commits from master
|
2024-03-15 14:56:39 +01:00 |
Michele Artini
|
30167aa882
|
mapped oaf:country from results
|
2024-03-15 11:24:16 +01:00 |
Michele Artini
|
88fef367b9
|
new plugin to collect from a dump of BASE
|
2024-03-15 10:47:52 +01:00 |
Claudio Atzori
|
078169b922
|
cleanup
|
2024-03-15 09:56:04 +01:00 |
Claudio Atzori
|
af154d4456
|
implemented changes from #9497: sort abstracts by string length, included author fullnames in the related results, expanded instance details within each children/result XML element
|
2024-03-14 16:21:23 +01:00 |
Claudio Atzori
|
7863c92466
|
expanded paper abstract in the result/children XML element (ticket #9497)
|
2024-03-13 16:25:31 +01:00 |
Claudio Atzori
|
eb5887cb9a
|
including related organization url in the XML record serialization (ticket #9498)
|
2024-03-13 14:46:00 +01:00 |
Michele Artini
|
a99942f7cf
|
filter by base types
|
2024-03-13 12:12:42 +01:00 |
Michele Artini
|
7f7083f53e
|
updated sql query for filtering BASE records
|
2024-03-13 11:57:26 +01:00 |
Sandro La Bruzzo
|
5281f010a5
|
applied cherry pick
|
2024-03-13 09:59:20 +01:00 |
Sandro La Bruzzo
|
ee1fcb672b
|
code refactor
|
2024-03-13 09:46:31 +01:00 |
Miriam Baglioni
|
5a32bb9578
|
[OC New] last fix
|
2024-03-13 09:36:18 +01:00 |
Sandro La Bruzzo
|
c532831718
|
Moved Crossref Mapping on dhp-aggregations,
refactored code, avoid to use utility for create part of the oaf defined in DOIBoostMappingUtils, used instead utility in OafMappingUtils
|
2024-03-13 06:56:10 +01:00 |
Miriam Baglioni
|
48c052215c
|
[OC New] last fix
|
2024-03-12 23:12:32 +01:00 |
Michele Artini
|
d9b23a76c5
|
comments
|
2024-03-12 14:53:34 +01:00 |
Michele Artini
|
841ca92246
|
Merge pull request 'new plugin to collect from a dump of BASE' (#400) from base-collector-plugin into master
Reviewed-on: D-Net/dnet-hadoop#400
|
2024-03-12 12:22:42 +01:00 |
Michele Artini
|
3bcfc40293
|
new plugin to collect from a dump of BASE
|
2024-03-12 12:17:58 +01:00 |
Claudio Atzori
|
db66555ebb
|
WIP: updated provision workflow to create a JSON based representation of the payload
|
2024-03-12 09:56:09 +01:00 |
Antonis Lempesis
|
f74c7e8689
|
selecting distinct peer_reviewed
|
2024-03-12 02:13:04 +02:00 |
Giambattista Bloisi
|
9092075760
|
Enrich authors with ORCID info using new matching algorithm
|
2024-03-11 13:23:59 +01:00 |
Sandro La Bruzzo
|
cbd4e5e4bb
|
update mag mapping
|
2024-03-08 16:31:40 +01:00 |
Claudio Atzori
|
d4871b31e8
|
WIP: extended provision workflow to create the JSON based payload
|
2024-03-08 11:43:20 +01:00 |
Antonis Lempesis
|
3c79720342
|
fixed the irish result subset
|
2024-03-07 14:08:57 +02:00 |
Antonis Lempesis
|
5ae4b4286c
|
Merge branch 'beta' of https://code-repo.d3science.org/antonis.lempesis/dnet-hadoop into beta
|
2024-03-07 12:15:19 +02:00 |
Miriam Baglioni
|
5180b6ec8a
|
[FOSNEW] removed test class
|
2024-03-07 10:47:13 +01:00 |
Miriam Baglioni
|
7827a2d66b
|
[OCNEW] added creation of the actionset for the results classified with FoS based ont he OpenAIRE identifier
|
2024-03-07 10:36:30 +01:00 |
Antonis Lempesis
|
316d585c8a
|
using distinct apcs per publication to avoid huge sums
|
2024-03-07 02:07:59 +02:00 |
Miriam Baglioni
|
fd34372c40
|
[OCNEW] first implementation
|
2024-03-06 13:42:00 +01:00 |
Sandro La Bruzzo
|
d34cef3f8d
|
Merge remote-tracking branch 'origin/beta' into doidoost_dismiss
|
2024-03-05 11:45:31 +01:00 |
Sandro La Bruzzo
|
3b837d38ce
|
added oozie workflow
|
2024-03-05 11:44:59 +01:00 |
Sandro La Bruzzo
|
f417515e43
|
Implemented class that generates a normalized table of MAG, which is the starting point for the creation of the mag source
|
2024-03-04 17:15:13 +01:00 |
Giambattista Bloisi
|
3067ea390d
|
Use SparkSQL in place of Hive for executing step16-createIndicatorsTables.sql of stats update wf
|
2024-03-04 11:13:34 +01:00 |
Sandro La Bruzzo
|
ad0e9aa80c
|
added first part of refactoring of the code generating MAG,
make it more readable using spark sql queries
|
2024-02-29 18:16:15 +01:00 |
Sandro La Bruzzo
|
9d94648f3b
|
code formatted
|
2024-02-29 18:15:20 +01:00 |
Giambattista Bloisi
|
3cd5590f3b
|
When converting json to XML, remove characters that are not allowed in the XML 1.0 specs, as they will cause xpath failures even if escaped
|
2024-02-28 15:14:18 +01:00 |
Giambattista Bloisi
|
56dd05f85c
|
Merge pull request 'Revised procedure when converting json data into xml' (#395) from restiterator_xmlcleanup into beta
Reviewed-on: D-Net/dnet-hadoop#395
|
2024-02-28 10:38:54 +01:00 |
Claudio Atzori
|
6fcf872daa
|
Merge branch 'beta' of https://code-repo.d4science.org/D-Net/dnet-hadoop into index_records
|
2024-02-28 10:27:28 +01:00 |
Claudio Atzori
|
3f07390a58
|
WIP
|
2024-02-28 10:10:10 +01:00 |
Miriam Baglioni
|
c94d94035c
|
[BulkTagging] added check to verify if field is present in the pathMap
|
2024-02-28 09:41:42 +01:00 |
Sandro La Bruzzo
|
7d806a434c
|
formatted code
|
2024-02-28 09:31:58 +01:00 |
Sandro La Bruzzo
|
e468e99100
|
Merge pull request 'Orcid Update Procedure' (#394) from orcid_update into beta
Reviewed-on: D-Net/dnet-hadoop#394
|
2024-02-28 09:17:30 +01:00 |