2023-05-08T04:40:20Z - 2024-05-08T04:40:20Z

Overview

136 Active Pull Requests
1 Active Issue
Excluding merges, 8 authors have pushed 28 commits to master and 660 commits to all branches. On master, 50 files have changed and there have been 1746 additions and 1343 deletions.

1 Release published by 1 user

Published september-2023 September 2023 2023-09-11 16:07:49 +02:00

129 Pull requests merged by 8 users

Merged #433 preparations for dhp-common beta release 1.2.5 2024-05-02 11:28:20 +02:00

Merged #429 Miscellaneous related to changes in MergeUtils 2024-04-24 08:55:38 +02:00

Merged #428 [WebCrawl] adding affiliation relations from web information 2024-04-23 09:36:16 +02:00

Merged #426 [graph indexing] sets spark memoryOverhead in the join operations to the same value used for the memory executor 2024-04-19 16:59:46 +02:00

Merged #424 Indicator fixes 2024-04-18 11:39:04 +02:00

Merged #425 Various fixes for the stats DB update workflow, step16-createIndicatorsTables.sql 2024-04-18 11:25:24 +02:00

Merged #418 doidoost_dismiss 2024-04-17 12:01:14 +02:00

Merged #422 Refinements to PR #404: refactoring the Oaf records merge utilities into dhp-common 2024-04-17 11:58:27 +02:00

Merged #421 [BETA] Improvements to copying data from ocean to impala 2024-04-16 14:22:33 +02:00

Merged #420 Improvements to copying data from ocean to impala 2024-04-16 14:17:48 +02:00

Merged #419 Enhance Dedup authors matching with algorithms used for ORCID enhancements (task 9690) 2024-04-16 10:24:12 +02:00

Merged #417 Extend Crossref-funders mapping and datacite hostedbymap 2024-04-09 10:30:54 +02:00

Merged #416 [BETA] fixed the result_country definition and updated the stats DB copy procedure 2024-04-03 12:36:04 +02:00

Merged #412 fixed the result_country definition and updated the stats DB copy procedure 2024-04-03 12:34:18 +02:00

Merged #413 Add action set creation for Datacite affiliations 2024-04-02 17:33:39 +02:00

Merged #415 [UsageCount] fixed error 2024-04-02 17:06:12 +02:00

Merged #414 [UsageCount] add check in case the datasource is not matched against those present in the graph 2024-04-02 16:30:40 +02:00

Merged #318 [UsageCount] Usage count per result split by datasource 2024-04-02 10:21:40 +02:00

Merged #411 [BETA] fixed typo in indicator query 2024-03-27 13:56:43 +01:00

Merged #410 [PROD] fixed typo in indicator query 2024-03-27 13:42:07 +01:00

Merged #409 [BETA] added missing EOS, Generate tables with parquet-files, instead of csv in the contexts.sh script 2024-03-27 12:04:05 +01:00

Merged #408 [PROD] added missing EOS, Generate tables with parquet-files, instead of csv in the contexts.sh script 2024-03-27 12:02:58 +01:00

Merged #407 adding context information to projects and datasources 2024-03-26 14:53:39 +01:00

Merged #406 [Stats wf] #372, #405 to production 2024-03-26 12:18:27 +01:00

Merged #405 correctly selecting the active hdfs node for the impala cluster 2024-03-26 12:07:47 +01:00

Merged #372 Changes to indicators and funders definition 2024-03-26 08:46:21 +01:00

Merged #404 refactoring the Oaf records merge utilities into dhp-common 2024-03-25 16:16:08 +01:00

Merged #403 mapped oaf:country from results 2024-03-25 16:13:32 +01:00

Merged #399 Solr JSON payload 2024-03-25 16:13:00 +01:00

Merged #401 Open Citation integration 2024-03-25 16:10:41 +01:00

Merged #397 FOS ActionSet for the classification of results without a doi 2024-03-25 16:07:48 +01:00

Merged #387 Added exception throwing in Hadoop transformation when TR is not syntactically valid 2024-03-25 16:05:44 +01:00

Merged #381 bulkTaggingPathMapExtention 2024-03-25 16:02:02 +01:00

Merged #371 Extract Information from Transformative Agreement 2024-03-25 15:42:37 +01:00

Merged #398 Enrich authors with ORCID info using new matching algorithm 2024-03-22 17:29:20 +01:00

Merged #370 Unify merge logic of entities in MergeUtils.class 2024-03-22 10:53:15 +01:00

Merged #400 new plugin to collect from a dump of BASE 2024-03-12 12:22:43 +01:00

Merged #395 Revised procedure when converting json data into xml 2024-02-28 10:38:55 +01:00

Merged #394 Orcid Update Procedure 2024-02-28 09:17:30 +01:00

Merged #384 Fixed problem on missing author in crossref Mapping 2024-02-15 15:06:18 +01:00

Merged #390 fix import of ORPs 2024-02-15 15:02:08 +01:00

Merged #393 Revised instance type comparisons in dedup phase 2024-02-15 12:15:38 +01:00

Merged #392 Set deletedbyinference =true to dedup aliases, created when a dedup in a previous build has been merged in a new dedup 2024-02-08 15:29:30 +01:00

Merged #391 Support for the PromoteAction strategy [master] 2024-02-08 15:12:17 +01:00

Merged #389 Support for the PromoteAction strategy 2024-02-08 15:08:05 +01:00

Merged #386 Use SparkSQL in place of Hive for executing step16-createIndicatorsTables.sql of stats update wf 2024-01-29 09:12:00 +01:00

Merged #385 Master branch updates from beta January 2024 2024-01-26 16:09:14 +01:00

Merged #383 Fixed problem on missing author in crossref Mapping 2024-01-26 15:57:24 +01:00

Merged #382 code of conduct and contributing 2024-01-24 15:40:27 +01:00

Merged #380 [graph provision] updated param specification for the XML converter job 2024-01-23 08:55:59 +01:00

Merged #376 Implements pivots table update oozie workflow 2024-01-22 16:37:30 +01:00

Merged #379 Context API update 2024-01-22 15:55:33 +01:00

Merged #378 [enrichment single step] 2024-01-18 09:41:10 +01:00

Merged #375 [FoS integration]fix issue on FoS integration. Removing the null values from FoS 2024-01-12 10:27:28 +01:00

Merged #374 refined mapping for the extraction of the original resource type 2024-01-11 16:29:48 +01:00

Merged #367 Improvements and refactoring in Dedup 2024-01-11 11:24:07 +01:00

Merged #373 enrichmentSingleStep 2024-01-10 16:58:50 +01:00

Merged #369 Master branch updates from beta December 2023 2023-12-15 11:18:31 +01:00

Merged #368 9078_xml_records_irish_tender 2023-12-12 12:34:43 +01:00

Merged #366 [graph cleaning] added cleaning for result.publisher and result.instance.license 2023-12-08 16:58:38 +01:00

Merged #365 [graph provision] added serialization for the new fields imported from the stats DB 2023-12-05 16:39:44 +01:00

Merged #364 ORCID Enrichment and Download 2023-12-01 15:05:45 +01:00

Merged #363 Changes for tables and creation of the new indicator indi_is_result_accessible 2023-12-01 15:05:24 +01:00

Merged #355 StatsDB workflow to export actionsets about OA routes, diamond, and publicly-funded 2023-12-01 15:03:58 +01:00

Merged #359 [ENRICHMENT][BETA] Use of community API in enrichment process AND addition to tagging result for communities through projects 2023-11-30 14:20:34 +01:00

Merged #350 COAR based resource types & Irish tender 2023-11-29 14:38:08 +01:00

Merged #360 Clear working dir in bipranker workflow 2023-11-22 14:10:40 +01:00

Merged #356 graph cleaning, suggestions from ticket 8898 - round 2 2023-11-22 14:00:38 +01:00

Merged #353 Add Pubmed affiliations (inferred by BIP) as actionsets 2023-11-22 13:53:07 +01:00

Merged #352 URL Validator to accept double slashes 2023-11-22 13:52:09 +01:00

Merged #362 Project propagation via communityAPI instead of using IS via IIS 2023-11-14 16:37:54 +01:00

Merged #354 BulkTag via Community APIs 2023-11-03 12:52:15 +01:00

Merged #358 Master branch updates from beta October 2023 2023-11-03 12:09:45 +01:00

Merged #357 9117_pubmed_affiliations_prod 2023-11-03 11:45:35 +01:00

Merged #351 FIX: GroupEntitiesSparkJob deletes whole graph outputPath instead of its temporary folder 2023-10-17 08:40:24 +02:00

Merged #349 [dedup] use common saveParquet and save methods to ensure outputs are compressed 2023-10-16 11:56:18 +02:00

Merged #347 [ActionManagerFramework] documentation 2023-10-12 10:07:26 +02:00

Merged #346 [UnresolvedEntities] changing in the creation of the unresolved entities 2023-10-10 15:10:22 +02:00

Merged #332 Beta stats wf updated 2023-10-10 09:35:33 +02:00

Merged #344 implemented relation to irish funder from a Json list 2023-10-06 14:26:55 +02:00

Merged #342 Extending the coverage of the peer non-unknown refereed instances 2023-10-06 14:22:13 +02:00

Merged #345 Fix cleaning of Pmid where parsing of numbers stopped at first not leading 0 (zero) character 2023-10-06 14:19:49 +02:00

Merged #343 SWH_integration 2023-10-06 14:15:56 +02:00

Merged #340 extended existing code to import of POCI from open citation 2023-10-03 10:52:12 +02:00

Merged #339 Fix bug in conversion from dedup json model to Spark Dataset of Rows (instanceTypeMatch no longer working) 2023-10-02 11:34:20 +02:00

Merged #333 SparkPropagateRelation relations do not propagate deletedByInference and invisible 2023-10-02 11:27:58 +02:00

Merged #334 GroupEntities and DispatchEntites are now merged in GroupEntitiesSparkJob 2023-10-02 11:25:28 +02:00

Merged #341 fixed dedup configuration management in the Broker workflow 2023-10-02 11:03:51 +02:00

Merged #338 Run CC and RAM sequentieally in dhp-impact-indicators WF 2023-09-13 08:52:54 +02:00

Merged #337 Master branch updates from beta September 2023 2023-09-06 11:31:09 +02:00

Merged #336 [graph raw] datainfo.invisible set as true only for entities 2023-09-04 16:14:48 +02:00

Merged #335 Fix import of affiliations relations from Crossref 2023-09-04 15:19:58 +02:00

Merged #331 Add sparkExecutorMemoryOverhead workflow config to set off-heap memory for Spark actions. If not explicitly set it is defaulted to 1Gb 2023-08-29 16:31:37 +02:00

Merged #330 Rewrite SparkPropagateRelation exploiting Dataframe API 2023-08-29 10:47:15 +02:00

Merged #284 8172_impact_indicators_workflow 2023-08-14 15:50:48 +02:00

Merged #329 DispatchEntitiesSparkJob: manage all entity types together, support filtering by dataInfo.invisible flag 2023-08-10 12:56:19 +02:00

Merged #325 graph cleaning, suggestions from ticket 8898 2023-08-08 11:14:20 +02:00

Merged #328 Add a "CleanRelation" action after the PropagateRelation to filter out all relations that have been deleted by inference or that are pointing to dangling entities 2023-08-08 09:49:13 +02:00

Merged #321 Updates Promotion DBs 2023-08-07 12:09:17 +02:00

Merged #320 Import affiliation relations from Crossref 2023-08-07 10:45:31 +02:00

Merged #326 [graph indexing] expand the instance level fulltext in the XML records 2023-07-27 15:02:08 +02:00

Merged #324 Refactor Dedup using Spark Dataframe API, initial support for scala 2.12 and Spark 3.4 2023-07-25 10:17:18 +02:00

Merged #315 [graph cleaning] fixed regex behaviour for cleaning ROR and GRID identifiers, added tests 2023-07-24 10:49:44 +02:00

Merged #323 fix_beta_tests 2023-07-24 10:47:36 +02:00

Merged #317 Master branch updates from beta July 2023 2023-07-18 18:22:05 +02:00

Merged #322 promotion-prod-only 2023-07-13 15:04:53 +02:00

Merged #319 Import dnet-pace-core module in this project and use it after renaming to dhp-pace-core 2023-07-11 14:03:15 +02:00

Merged #301 update sql query to return distinct pids 2023-06-27 12:24:48 +02:00

Merged #314 Update step15_5.sql 2023-06-21 10:33:23 +02:00

Merged #313 Update step15_5.sql 2023-06-21 10:26:17 +02:00

Merged #312 Update step16-createIndicatorsTables.sql 2023-06-21 09:52:33 +02:00

Merged #311 Update step15.sql 2023-06-21 09:20:02 +02:00

Merged #309 Update step20-createMonitorDB_institutions.sql 2023-06-20 15:07:10 +02:00

Merged #308 [stats wf] Bug fixes 2023-06-14 21:57:04 +02:00

Merged #307 [graph cleaning] pid cleaning 2023-06-12 13:32:30 +02:00

Merged #306 update sql query to return distinct pids [beta] 2023-06-12 09:59:01 +02:00

Merged #299 propagation of projects through parent-child relations 2023-06-12 09:57:21 +02:00

Merged #298 [aggregator graph] validation for URLs from oaf:fulltext 2023-06-12 09:55:36 +02:00

Merged #297 removeTaggingCondition 2023-06-12 09:53:06 +02:00

Merged #305 [stats wf] Added memory to hive 2023-06-08 08:58:49 +02:00

Merged #304 [stats wf] Bug fix on indicators step 2023-06-07 16:49:10 +02:00

Merged #303 [ stats wf] Bug fix 2023-06-07 14:41:45 +02:00

Merged #300 Changes to beta stats wf 2023-06-06 11:41:38 +02:00

Merged #296 [UsageCount] addition of usagecount for Projects and datasources 2023-05-22 16:13:25 +02:00

Merged #295 Updates to steps related to transfer data to impala cluster 2023-05-18 08:46:17 +02:00

Merged #294 fix APC affiliation links 2023-05-15 15:47:58 +02:00

Merged #293 Update copyDataToImpalaCluster.sh 2023-05-15 12:05:55 +02:00

Merged #292 removed the inverse of the Citing relation 2023-05-15 11:37:40 +02:00

Merged #291 Update copyDataToImpalaCluster.sh 2023-05-12 11:36:35 +02:00

7 Pull requests proposed by 6 users

Proposed #327 Changes in maven poms to build and test the project using Spark 3.4.x and scala 2.12 2023-08-02 18:12:11 +02:00

Proposed #388 Continuous Validation Workflow 2024-02-05 11:07:44 +01:00

Proposed #396 WIP: Fix SWH integration WF 2024-02-27 15:59:43 +01:00

Proposed #430 Various fixes in the stats wf 2024-04-24 11:18:07 +02:00

Proposed #431 WIP: playing with dependencies to compile also with macos arm64 and openjdk 11 or 17 2024-04-30 14:13:25 +02:00

Proposed #432 rest-collector-plugin-with-retry 2024-05-02 10:12:55 +02:00

Proposed #434 Fixes in Graph Provision 2024-05-07 16:47:35 +02:00

1 Issue created by 1 user

Opened #377 SWH integration produced no data 2024-01-16 12:43:32 +01:00

4 Unresolved Conversations

Open #269 WIP: subjectPropagation 2023-10-26 09:59:28 +02:00

Open #287 WIP: Graph footprint optimisation 2023-10-26 09:59:22 +02:00

Open #263 [bulk tagging] Factor out long xqueries 2023-08-29 16:41:45 +02:00

Open #270 Upgrade POM version 2023-05-24 14:50:46 +02:00