2023-05-08T04:40:20Z - 2024-05-08T04:40:20Z
Overview
1 Release published by 1 user
Published
september-2023
September 2023
129 Pull requests merged by 8 users
Merged
#433 preparations for dhp-common beta release 1.2.5
Merged
#429 Miscellaneous related to changes in MergeUtils
Merged
#428 [WebCrawl] adding affiliation relations from web information
Merged
#426 [graph indexing] sets spark memoryOverhead in the join operations to the same value used for the memory executor
Merged
#424 Indicator fixes
Merged
#425 Various fixes for the stats DB update workflow, step16-createIndicatorsTables.sql
Merged
#418 doidoost_dismiss
Merged
#422 Refinements to PR #404: refactoring the Oaf records merge utilities into dhp-common
Merged
#421 [BETA] Improvements to copying data from ocean to impala
Merged
#420 Improvements to copying data from ocean to impala
Merged
#419 Enhance Dedup authors matching with algorithms used for ORCID enhancements (task 9690)
Merged
#417 Extend Crossref-funders mapping and datacite hostedbymap
Merged
#416 [BETA] fixed the result_country definition and updated the stats DB copy procedure
Merged
#412 fixed the result_country definition and updated the stats DB copy procedure
Merged
#413 Add action set creation for Datacite affiliations
Merged
#415 [UsageCount] fixed error
Merged
#414 [UsageCount] add check in case the datasource is not matched against those present in the graph
Merged
#318 [UsageCount] Usage count per result split by datasource
Merged
#411 [BETA] fixed typo in indicator query
Merged
#410 [PROD] fixed typo in indicator query
Merged
#409 [BETA] added missing EOS, Generate tables with parquet-files, instead of csv in the contexts.sh script
Merged
#408 [PROD] added missing EOS, Generate tables with parquet-files, instead of csv in the contexts.sh script
Merged
#407 adding context information to projects and datasources
Merged
#406 [Stats wf] #372, #405 to production
Merged
#405 correctly selecting the active hdfs node for the impala cluster
Merged
#372 Changes to indicators and funders definition
Merged
#404 refactoring the Oaf records merge utilities into dhp-common
Merged
#403 mapped oaf:country from results
Merged
#399 Solr JSON payload
Merged
#401 Open Citation integration
Merged
#397 FOS ActionSet for the classification of results without a doi
Merged
#387 Added exception throwing in Hadoop transformation when TR is not syntactically valid
Merged
#381 bulkTaggingPathMapExtention
Merged
#371 Extract Information from Transformative Agreement
Merged
#398 Enrich authors with ORCID info using new matching algorithm
Merged
#370 Unify merge logic of entities in MergeUtils.class
Merged
#400 new plugin to collect from a dump of BASE
Merged
#395 Revised procedure when converting json data into xml
Merged
#394 Orcid Update Procedure
Merged
#384 Fixed problem on missing author in crossref Mapping
Merged
#390 fix import of ORPs
Merged
#393 Revised instance type comparisons in dedup phase
Merged
#392 Set deletedbyinference =true to dedup aliases, created when a dedup in a previous build has been merged in a new dedup
Merged
#391 Support for the PromoteAction strategy [master]
Merged
#389 Support for the PromoteAction strategy
Merged
#386 Use SparkSQL in place of Hive for executing step16-createIndicatorsTables.sql of stats update wf
Merged
#385 Master branch updates from beta January 2024
Merged
#383 Fixed problem on missing author in crossref Mapping
Merged
#382 code of conduct and contributing
Merged
#380 [graph provision] updated param specification for the XML converter job
Merged
#376 Implements pivots table update oozie workflow
Merged
#379 Context API update
Merged
#378 [enrichment single step]
Merged
#375 [FoS integration]fix issue on FoS integration. Removing the null values from FoS
Merged
#374 refined mapping for the extraction of the original resource type
Merged
#367 Improvements and refactoring in Dedup
Merged
#373 enrichmentSingleStep
Merged
#369 Master branch updates from beta December 2023
Merged
#368 9078_xml_records_irish_tender
Merged
#366 [graph cleaning] added cleaning for result.publisher and result.instance.license
Merged
#365 [graph provision] added serialization for the new fields imported from the stats DB
Merged
#364 ORCID Enrichment and Download
Merged
#363 Changes for tables and creation of the new indicator indi_is_result_accessible
Merged
#355 StatsDB workflow to export actionsets about OA routes, diamond, and publicly-funded
Merged
#359 [ENRICHMENT][BETA] Use of community API in enrichment process AND addition to tagging result for communities through projects
Merged
#350 COAR based resource types & Irish tender
Merged
#360 Clear working dir in bipranker workflow
Merged
#356 graph cleaning, suggestions from ticket 8898 - round 2
Merged
#353 Add Pubmed affiliations (inferred by BIP) as actionsets
Merged
#352 URL Validator to accept double slashes
Merged
#362 Project propagation via communityAPI instead of using IS via IIS
Merged
#354 BulkTag via Community APIs
Merged
#358 Master branch updates from beta October 2023
Merged
#357 9117_pubmed_affiliations_prod
Merged
#351 FIX: GroupEntitiesSparkJob deletes whole graph outputPath instead of its temporary folder
Merged
#349 [dedup] use common saveParquet
and save
methods to ensure outputs are compressed
Merged
#347 [ActionManagerFramework] documentation
Merged
#346 [UnresolvedEntities] changing in the creation of the unresolved entities
Merged
#332 Beta stats wf updated
Merged
#344 implemented relation to irish funder from a Json list
Merged
#342 Extending the coverage of the peer non-unknown refereed instances
Merged
#345 Fix cleaning of Pmid where parsing of numbers stopped at first not leading 0 (zero) character
Merged
#343 SWH_integration
Merged
#340 extended existing code to import of POCI from open citation
Merged
#339 Fix bug in conversion from dedup json model to Spark Dataset of Rows (instanceTypeMatch no longer working)
Merged
#333 SparkPropagateRelation relations do not propagate deletedByInference and invisible
Merged
#334 GroupEntities and DispatchEntites are now merged in GroupEntitiesSparkJob
Merged
#341 fixed dedup configuration management in the Broker workflow
Merged
#338 Run CC and RAM sequentieally in dhp-impact-indicators WF
Merged
#337 Master branch updates from beta September 2023
Merged
#336 [graph raw] datainfo.invisible set as true only for entities
Merged
#335 Fix import of affiliations relations from Crossref
Merged
#331 Add sparkExecutorMemoryOverhead workflow config to set off-heap memory for Spark actions. If not explicitly set it is defaulted to 1Gb
Merged
#330 Rewrite SparkPropagateRelation exploiting Dataframe API
Merged
#284 8172_impact_indicators_workflow
Merged
#329 DispatchEntitiesSparkJob: manage all entity types together, support filtering by dataInfo.invisible flag
Merged
#325 graph cleaning, suggestions from ticket 8898
Merged
#328 Add a "CleanRelation" action after the PropagateRelation to filter out all relations that have been deleted by inference or that are pointing to dangling entities
Merged
#321 Updates Promotion DBs
Merged
#320 Import affiliation relations from Crossref
Merged
#326 [graph indexing] expand the instance level fulltext in the XML records
Merged
#324 Refactor Dedup using Spark Dataframe API, initial support for scala 2.12 and Spark 3.4
Merged
#315 [graph cleaning] fixed regex behaviour for cleaning ROR and GRID identifiers, added tests
Merged
#323 fix_beta_tests
Merged
#317 Master branch updates from beta July 2023
Merged
#322 promotion-prod-only
Merged
#319 Import dnet-pace-core module in this project and use it after renaming to dhp-pace-core
Merged
#301 update sql query to return distinct pids
Merged
#314 Update step15_5.sql
Merged
#313 Update step15_5.sql
Merged
#312 Update step16-createIndicatorsTables.sql
Merged
#311 Update step15.sql
Merged
#309 Update step20-createMonitorDB_institutions.sql
Merged
#308 [stats wf] Bug fixes
Merged
#307 [graph cleaning] pid cleaning
Merged
#306 update sql query to return distinct pids [beta]
Merged
#299 propagation of projects through parent-child relations
Merged
#298 [aggregator graph] validation for URLs from oaf:fulltext
Merged
#297 removeTaggingCondition
Merged
#305 [stats wf] Added memory to hive
Merged
#304 [stats wf] Bug fix on indicators step
Merged
#303 [ stats wf] Bug fix
Merged
#300 Changes to beta stats wf
Merged
#296 [UsageCount] addition of usagecount for Projects and datasources
Merged
#295 Updates to steps related to transfer data to impala cluster
Merged
#294 fix APC affiliation links
Merged
#293 Update copyDataToImpalaCluster.sh
Merged
#292 removed the inverse of the Citing relation
Merged
#291 Update copyDataToImpalaCluster.sh
7 Pull requests proposed by 6 users
Proposed
#327 Changes in maven poms to build and test the project using Spark 3.4.x and scala 2.12
Proposed
#388 Continuous Validation Workflow
Proposed
#396 WIP: Fix SWH integration WF
Proposed
#430 Various fixes in the stats wf
Proposed
#431 WIP: playing with dependencies to compile also with macos arm64 and openjdk 11 or 17
Proposed
#432 rest-collector-plugin-with-retry
Proposed
#434 Fixes in Graph Provision
1 Issue created by 1 user
Opened
#377 SWH integration produced no data
4 Unresolved Conversations
Open
#269
WIP: subjectPropagation
Open
#287
WIP: Graph footprint optimisation
Open
#263
[bulk tagging] Factor out long xqueries
Open
#270
Upgrade POM version