Claudio Atzori
|
753c2a72bd
|
Merge pull request 'fix import of ORPs' (#390) from import_orps_fix into beta
Reviewed-on: D-Net/dnet-hadoop#390
|
2024-02-15 15:02:08 +01:00 |
Claudio Atzori
|
a63b091bae
|
Merge branch 'beta' into import_orps_fix
|
2024-02-15 15:01:56 +01:00 |
Giambattista Bloisi
|
85aeff72f1
|
Merge pull request 'Revised instance type comparisons in dedup phase' (#393) from revisedInstanceType into beta
Reviewed-on: D-Net/dnet-hadoop#393
|
2024-02-15 12:15:37 +01:00 |
Giambattista Bloisi
|
d65285da7f
|
Promote "Research" to a jolly instanceType in dedup comparisons
Compare "Journal" and "Part of book or chapter of book" with "Article"
|
2024-02-15 12:11:04 +01:00 |
Giambattista Bloisi
|
29194472a7
|
Promote "Research" to a jolly instanceType in dedup comparisons
Compare Part of book or chapter of book with Article
|
2024-02-15 11:53:46 +01:00 |
Miriam Baglioni
|
8dae10b442
|
-
|
2024-02-14 14:57:08 +01:00 |
Miriam Baglioni
|
83bb97be83
|
[Tagging Projects and Datasource] added test to check datasource tagging. Fixed issue
|
2024-02-14 11:23:47 +01:00 |
Miriam Baglioni
|
6e1f383e4a
|
[Tagging Projects and Datasource] first extention of bulktagging to add the context to projects and datasource
|
2024-02-13 16:37:14 +01:00 |
Miriam Baglioni
|
3f7d262a4e
|
mergin with branch beta
|
2024-02-13 14:05:58 +01:00 |
Miriam Baglioni
|
eca021f4d6
|
[Transformative Agreement] add results with information abount the agreement and the country of the organization paid for it
|
2024-02-13 12:21:07 +01:00 |
Miriam Baglioni
|
bdb6bbb365
|
mergin with branch beta
|
2024-02-12 15:50:43 +01:00 |
Claudio Atzori
|
d85d2df6ad
|
[graph raw] fixed mapping of the original resource type from the Datacite format
|
2024-02-09 10:20:20 +01:00 |
Giambattista Bloisi
|
b19643f6eb
|
Dedup aliases, created when a dedup in a previous build has been merged in a new dedup, need to be marked as "deletedbyinference", since they are "merged" in the new dedup
|
2024-02-08 15:34:59 +01:00 |
Claudio Atzori
|
e6bdee86d1
|
Merge pull request 'Support for the PromoteAction strategy' (#389) from promote_actions_join_type into beta
Reviewed-on: D-Net/dnet-hadoop#389
|
2024-02-08 15:08:05 +01:00 |
Antonis Lempesis
|
dd4c27f4f3
|
added 2 new institutions in monitor
|
2024-02-08 12:57:57 +02:00 |
Claudio Atzori
|
38c9001147
|
fixed import of ORPs stored on HDFS in the internal graph format (e.g. Datacite)
|
2024-02-07 17:02:05 +01:00 |
Claudio Atzori
|
fd17c1f17c
|
[actiosets] fixed join type
|
2024-02-05 16:55:36 +02:00 |
Claudio Atzori
|
009dcf6aea
|
[actiosets] introduced support for the PromoteAction strategy
|
2024-02-05 16:43:40 +02:00 |
Claudio Atzori
|
bb82052c40
|
[graph cleaning] rule out datasources without an officialname
|
2024-02-05 14:59:27 +02:00 |
Claudio Atzori
|
42f5506306
|
[orcid enrichment] fixed directory cleanup before distcp
|
2024-02-05 09:45:36 +02:00 |
Alessia Bardi
|
f2a08d8cc2
|
test for Italian records from IRS repositories
|
2024-01-30 19:20:14 +01:00 |
Antonis Lempesis
|
a512ead447
|
changed orcid ids to all capital
|
2024-01-30 16:54:47 +02:00 |
Miriam Baglioni
|
07a373a0bd
|
[bulkTagging] removing checks while performing the substring action so that it will fire an Exception if the paramneters are wrongly set
|
2024-01-30 13:51:11 +01:00 |
Miriam Baglioni
|
ead08b0dd4
|
mergin with branch beta
|
2024-01-30 12:19:10 +01:00 |
Antonis Lempesis
|
bb10a22290
|
merged changes from dnet-hadoop
|
2024-01-29 21:51:47 +02:00 |
Miriam Baglioni
|
a5995ab557
|
[orcid-enrichment] change the value of parameters.
|
2024-01-29 18:19:48 +01:00 |
Miriam Baglioni
|
a418dacb47
|
[UsageCount] code extention to include also the name of the datasource
|
2024-01-29 18:12:33 +01:00 |
Miriam Baglioni
|
e9131f4e4a
|
mergin with branch beta
|
2024-01-29 16:27:18 +01:00 |
Sandro La Bruzzo
|
9aebca77a0
|
Added exception throwing in Hadoop transformation when TR is not syntactically valid
|
2024-01-29 14:41:02 +01:00 |
Claudio Atzori
|
f804c58bc7
|
Merge pull request 'Use SparkSQL in place of Hive for executing step16-createIndicatorsTables.sql of stats update wf' (#386) from stats_with_spark_sql into beta
Reviewed-on: D-Net/dnet-hadoop#386
|
2024-01-29 09:11:59 +01:00 |
Claudio Atzori
|
926903b06b
|
Merge branch 'beta' into stats_with_spark_sql
|
2024-01-29 09:11:45 +01:00 |
Giambattista Bloisi
|
078df0b4d1
|
Use SparkSQL in place of Hive for executing step16-createIndicatorsTables.sql of stats update wf
|
2024-01-26 21:56:55 +01:00 |
Claudio Atzori
|
bf99c424fa
|
Merge pull request 'Fixed problem on missing author in crossref Mapping' (#383) from crossref_missing_author_fix into beta
Reviewed-on: D-Net/dnet-hadoop#383
|
2024-01-26 15:57:23 +01:00 |
Claudio Atzori
|
ce3200263e
|
Merge branch 'beta' into crossref_missing_author_fix
|
2024-01-26 15:57:04 +01:00 |
Sandro La Bruzzo
|
e889808daa
|
Fixed problem on missing author in crossref Mapping
|
2024-01-26 12:19:04 +01:00 |
Claudio Atzori
|
9e8fc6aa88
|
[collection] increased logging from the oai-pmh metadata collection process
|
2024-01-26 09:17:20 +01:00 |
Antonis Lempesis
|
c548796463
|
Changed step16-createIndicatorsTables to use a spark oozie action instead of hive
|
2024-01-26 02:04:48 +02:00 |
Sandro La Bruzzo
|
0386f36385
|
Added workflow to update ORCID and replaced some parsing, because the update works and employments xml differs from the dump one.
|
2024-01-25 19:40:59 +01:00 |
Antonis Lempesis
|
a7115cfa9e
|
max mem of joins (hive.mapjoin.followby.gby.localtask.max.memory.usage) now 80%, up from 55%.
|
2024-01-25 15:13:16 +01:00 |
Antonis Lempesis
|
fd43b0e84a
|
max mem of joins (hive.mapjoin.followby.gby.localtask.max.memory.usage) now 80%, up from 55%.
|
2024-01-25 15:06:34 +01:00 |
Claudio Atzori
|
2838a9b630
|
Update 'CONTRIBUTING.md'
|
2024-01-24 16:07:05 +01:00 |
Claudio Atzori
|
da944a5c55
|
Merge pull request 'code of conduct and contributing' (#382) from contributing into beta
Reviewed-on: D-Net/dnet-hadoop#382
|
2024-01-24 15:40:26 +01:00 |
Claudio Atzori
|
0c97a3a81a
|
minor
|
2024-01-24 10:56:33 +01:00 |
Claudio Atzori
|
2c1e6849f0
|
added code of conduct and contributing files
|
2024-01-24 10:36:41 +01:00 |
Claudio Atzori
|
9b13c22e5d
|
[graph provision] retrieve all the context information by adding all=true to the requests issued to thr API
|
2024-01-23 15:36:08 +01:00 |
Claudio Atzori
|
3e96777cc4
|
[collection] increased logging from the oai-pmh metadata collection process
|
2024-01-23 15:21:03 +01:00 |
Sandro La Bruzzo
|
43e0bba7ed
|
logg added during download
|
2024-01-23 15:04:49 +01:00 |
Miriam Baglioni
|
f7d06dc661
|
compilation after merging
|
2024-01-23 11:43:08 +01:00 |
Miriam Baglioni
|
6e58d79623
|
mergin with branch beta
|
2024-01-23 11:36:47 +01:00 |
Miriam Baglioni
|
e0ec800d7e
|
[BulkTagging] extend the definition of the pathMap to include also actions that should be performed of the value extracted from the result befor applying the constraint
|
2024-01-23 11:34:53 +01:00 |