Claudio Atzori
|
5add51f38c
|
Merge pull request 'fixed the result_country definition and updated the stats DB copy procedure' (#412) from antonis.lempesis/dnet-hadoop:beta into master
Reviewed-on: D-Net/dnet-hadoop#412
|
2024-04-03 12:34:17 +02:00 |
Lampros Smyrnaios
|
b7c8acc563
|
- Update the code which acquires the "IMPALA_HDFS_NODE", to test the "tmp"-dir, instead of the base-dir and introduce retries, to overcome potential file-system failures. This change was suggested by "Sebastian Tymkow" and "Grzegorz Bakalarski".
- Fix typos.
|
2024-04-03 13:15:37 +03:00 |
Antonis Lempesis
|
df6e3bda04
|
added new orgs in monitor
|
2024-04-01 22:45:29 +03:00 |
Antonis Lempesis
|
573b081f1d
|
added new orgs in monitor
|
2024-04-01 22:24:46 +03:00 |
Antonis Lempesis
|
0bf2a7a359
|
fixed the result_country definition
|
2024-04-01 15:23:22 +03:00 |
Claudio Atzori
|
f01390702e
|
Merge pull request 'fixed typo in indicator query' (#410) from antonis.lempesis/dnet-hadoop:beta into master
Reviewed-on: D-Net/dnet-hadoop#410
|
2024-03-27 13:42:07 +01:00 |
Antonis Lempesis
|
9ff44eed96
|
fixed typo in indicator query
added more institutions
|
2024-03-27 14:39:01 +02:00 |
Claudio Atzori
|
5592ccc37a
|
Merge pull request 'added missing EOS, Generate tables with parquet-files, instead of csv in the contexts.sh script' (#408) from antonis.lempesis/dnet-hadoop:beta into master
Reviewed-on: D-Net/dnet-hadoop#408
|
2024-03-27 12:02:57 +01:00 |
Antonis Lempesis
|
1fee4124e0
|
added missing EOS
|
2024-03-27 12:58:25 +02:00 |
Claudio Atzori
|
d16c15da8d
|
adjusted pom files
|
2024-03-26 14:00:44 +01:00 |
Lampros Smyrnaios
|
036ba03fcd
|
Generate tables with parquet-files, instead of csv, in "dhp-stats-update/.../contexts.sh" script.
|
2024-03-26 13:29:04 +02:00 |
Claudio Atzori
|
09a6d17059
|
Merge pull request '[Stats wf] #372, #405 to production' (#406) from antonis.lempesis/dnet-hadoop:beta into master
Reviewed-on: D-Net/dnet-hadoop#406
|
2024-03-26 12:18:26 +01:00 |
Claudio Atzori
|
d70793847d
|
resolving conflicts on step16-createIndicatorsTables.sql
|
2024-03-26 12:17:52 +01:00 |
Lampros Smyrnaios
|
bc8c97182d
|
Automatically select the ACTIVE HDFS NODE for Impala cluster, in all "copyDataToImpalaCluster.sh" scripts.
|
2024-03-26 13:01:12 +02:00 |
Lampros Smyrnaios
|
92cc27e7eb
|
Use the ACTIVE HDFS NODE for Impala cluster, in "copyDataToImpalaCluster.sh" script.
|
2024-03-26 12:34:11 +02:00 |
Michele De Bonis
|
f6601ea7d1
|
default parameters for openorgs updated
|
2024-03-25 13:07:04 +01:00 |
Michele De Bonis
|
cd4c3c934d
|
openorgs wf updated
|
2024-03-22 15:42:37 +01:00 |
Antonis Lempesis
|
4c40c96e30
|
code cleanup
|
2024-03-22 10:16:49 +02:00 |
Antonis Lempesis
|
459167ac2f
|
Merge branch 'beta' of https://code-repo.d4science.org/antonis.lempesis/dnet-hadoop into beta
|
2024-03-21 12:44:58 +02:00 |
Antonis Lempesis
|
07f634a46d
|
code cleanup
|
2024-03-21 12:44:30 +02:00 |
Antonis Lempesis
|
9521625a07
|
code cleanup
|
2024-03-21 11:45:08 +02:00 |
Antonis Lempesis
|
67a5aa0a38
|
Merge branch 'beta' of https://code-repo.d4science.org/antonis.lempesis/dnet-hadoop into beta
|
2024-03-19 11:24:54 +02:00 |
dimitrispie
|
a3a570e9a0
|
Commit monitor-updates-wf
|
2024-03-19 09:42:21 +02:00 |
Michele Artini
|
a99942f7cf
|
filter by base types
|
2024-03-13 12:12:42 +01:00 |
Michele Artini
|
7f7083f53e
|
updated sql query for filtering BASE records
|
2024-03-13 11:57:26 +01:00 |
Michele Artini
|
d9b23a76c5
|
comments
|
2024-03-12 14:53:34 +01:00 |
Michele Artini
|
841ca92246
|
Merge pull request 'new plugin to collect from a dump of BASE' (#400) from base-collector-plugin into master
Reviewed-on: D-Net/dnet-hadoop#400
|
2024-03-12 12:22:42 +01:00 |
Michele Artini
|
3bcfc40293
|
new plugin to collect from a dump of BASE
|
2024-03-12 12:17:58 +01:00 |
Antonis Lempesis
|
f74c7e8689
|
selecting distinct peer_reviewed
|
2024-03-12 02:13:04 +02:00 |
Antonis Lempesis
|
3c79720342
|
fixed the irish result subset
|
2024-03-07 14:08:57 +02:00 |
Antonis Lempesis
|
5ae4b4286c
|
Merge branch 'beta' of https://code-repo.d3science.org/antonis.lempesis/dnet-hadoop into beta
|
2024-03-07 12:15:19 +02:00 |
Antonis Lempesis
|
316d585c8a
|
using distinct apcs per publication to avoid huge sums
|
2024-03-07 02:07:59 +02:00 |
Giambattista Bloisi
|
3067ea390d
|
Use SparkSQL in place of Hive for executing step16-createIndicatorsTables.sql of stats update wf
|
2024-03-04 11:13:34 +01:00 |
Miriam Baglioni
|
c94d94035c
|
[BulkTagging] added check to verify if field is present in the pathMap
|
2024-02-28 09:41:42 +01:00 |
Michele Artini
|
4374d7449e
|
mapping of project PIDs
|
2024-02-22 14:44:35 +01:00 |
Claudio Atzori
|
07d009007b
|
Merge pull request 'Fixed problem on missing author in crossref Mapping' (#384) from crossref_missing_author_fix_master into master
Reviewed-on: D-Net/dnet-hadoop#384
|
2024-02-15 15:06:17 +01:00 |
Claudio Atzori
|
071d044971
|
Merge branch 'master' into crossref_missing_author_fix_master
|
2024-02-15 15:04:19 +01:00 |
Claudio Atzori
|
b3ddbaed58
|
fixed import of ORPs stored on HDFS in the internal graph format (e.g. Datacite)
|
2024-02-15 15:02:48 +01:00 |
Claudio Atzori
|
1416f16b35
|
[graph raw] fixed mapping of the original resource type from the Datacite format
|
2024-02-09 10:19:53 +01:00 |
Giambattista Bloisi
|
ba1a0e7b4f
|
Merge pull request 'Set deletedbyinference =true to dedup aliases, created when a dedup in a previous build has been merged in a new dedup' (#392) from fix_dedupaliases_deletedbyinference into master
Reviewed-on: D-Net/dnet-hadoop#392
|
2024-02-08 15:29:29 +01:00 |
Giambattista Bloisi
|
079085286c
|
Merge branch 'master' into fix_dedupaliases_deletedbyinference
|
2024-02-08 15:29:13 +01:00 |
Giambattista Bloisi
|
8dd666aedd
|
Dedup aliases, created when a dedup in a previous build has been merged in a new dedup, need to be marked as "deletedbyinference", since they are "merged" in the new dedup
|
2024-02-08 15:27:57 +01:00 |
Claudio Atzori
|
f21133229a
|
Merge pull request 'Support for the PromoteAction strategy [master]' (#391) from promote_actions_join_type_master into master
Reviewed-on: D-Net/dnet-hadoop#391
|
2024-02-08 15:12:16 +01:00 |
Claudio Atzori
|
d86b909db2
|
[actiosets] fixed join type
|
2024-02-08 15:10:55 +01:00 |
Claudio Atzori
|
08162902ab
|
[actiosets] introduced support for the PromoteAction strategy
|
2024-02-08 15:10:40 +01:00 |
Antonis Lempesis
|
dd4c27f4f3
|
added 2 new institutions in monitor
|
2024-02-08 12:57:57 +02:00 |
Claudio Atzori
|
e8630a6d03
|
[graph cleaning] rule out datasources without an officialname
|
2024-02-05 14:59:06 +02:00 |
Claudio Atzori
|
f28c63d5ef
|
[orcid enrichment] fixed directory cleanup before distcp
|
2024-02-05 09:44:56 +02:00 |
Antonis Lempesis
|
a512ead447
|
changed orcid ids to all capital
|
2024-01-30 16:54:47 +02:00 |
Claudio Atzori
|
1a8b609ed2
|
code formatting
|
2024-01-30 11:34:16 +01:00 |