Commit Graph

  • b195da3a83 Added utility to write time logs during the deduplication phase Sandro La Bruzzo 2023-06-28 11:20:09 +0200
  • 288ec0b7d6 [doiboost] merged workflow from branch beta Claudio Atzori 2023-06-28 09:15:37 +0200
  • 5f32edd9bf adopting dhp-schema:3.17.1 Claudio Atzori 2023-06-27 16:57:17 +0200
  • e10ce92fe5 [stats wf] merged workflows from branch beta Claudio Atzori 2023-06-27 14:32:48 +0200
  • b93e1541aa Merge pull request 'update sql query to return distinct pids' (#301) from distinct_pids_from_openorgs into master Claudio Atzori 2023-06-27 12:24:47 +0200
  • d029bf0b94 Merge branch 'master' into distinct_pids_from_openorgs Claudio Atzori 2023-06-27 12:24:35 +0200
  • 0f5a819f44 [graph cleaning] fixed regex behaviour for cleaning ROR and GRID identifiers, added tests Claudio Atzori 2023-06-23 16:10:49 +0200
  • 60f25b780d Minor fixes in workflow.xml and job.properties Serafeim Chatzopoulos 2023-06-23 12:51:50 +0300
  • 88a1cbc37d fixed a datasource id Michele Artini 2023-06-22 07:56:33 +0200
  • 009d7f312f fixed a datasource Id Michele Artini 2023-06-21 16:17:34 +0200
  • e4b27182d0 [master] refactoring Miriam Baglioni 2023-06-21 11:15:53 +0200
  • b0ebf56367 Merge pull request 'Update step15_5.sql' (#314) from antonis.lempesis/dnet-hadoop:beta into beta Claudio Atzori 2023-06-21 10:33:22 +0200
  • 2b6370eaee Update step15_5.sql dimitrispie 2023-06-21 11:31:10 +0300
  • 35e42a86ed Merge pull request 'Update step15_5.sql' (#313) from antonis.lempesis/dnet-hadoop:beta into beta Claudio Atzori 2023-06-21 10:26:16 +0200
  • 74cb060bfe Update step15_5.sql dimitrispie 2023-06-21 11:24:06 +0300
  • 85e016df17 Merge pull request 'Update step16-createIndicatorsTables.sql' (#312) from antonis.lempesis/dnet-hadoop:beta into beta Claudio Atzori 2023-06-21 09:52:33 +0200
  • a475cfcb7b Update step16-createIndicatorsTables.sql dimitrispie 2023-06-21 10:42:02 +0300
  • 979cf9cd87 Merge pull request 'Update step15.sql' (#311) from antonis.lempesis/dnet-hadoop:beta into beta Claudio Atzori 2023-06-21 09:20:01 +0200
  • 4648cd88d4 Update step15.sql dimitrispie 2023-06-21 10:02:19 +0300
  • 94d2573c77 Update step15.sql dimitrispie 2023-06-21 09:22:39 +0300
  • 0561362de2 Merge pull request 'Update step20-createMonitorDB_institutions.sql' (#309) from antonis.lempesis/dnet-hadoop:beta into beta Claudio Atzori 2023-06-20 15:07:09 +0200
  • 50d7dc0078 [graph enrichment] fixed projectOrganizationPath not being passed to the apply_resulttoorganization_propagation node Claudio Atzori 2023-06-19 15:42:44 +0200
  • fbd9bf704e indent Claudio Atzori 2023-06-19 15:41:22 +0200
  • 758e662ab8 Revert "REmove duplicated code and ensure that load and initialization is done through "DedupConfig.load" method" Giambattista Bloisi 2023-06-19 13:08:10 +0200
  • 485f9d18cb REmove duplicated code and ensure that load and initialization is done through "DedupConfig.load" method Giambattista Bloisi 2023-06-19 13:00:02 +0200
  • 6210f6ee48 Merge pull request 'Precompile blacklists patterns before evaluating clustering criteria' (#1) from optimized-clustering into master Claudio Atzori 2023-06-19 12:43:49 +0200
  • be2caedb04 Update step20-createMonitorDB_institutions.sql dimitrispie 2023-06-19 12:12:17 +0300
  • 36e0a8fec4 Changes to Promotion Stats WF dimitrispie 2023-06-19 09:44:34 +0300
  • b0ade43608 Precompile blacklists patterns before evaluating clustering criteria Enable Junit 5 tests in maven builds Make path comparisons platform-independent Read String resource files assuming they are encoded in UTF-8 Fix a few test conditions Giambattista Bloisi 2023-06-16 09:41:11 +0200
  • 4c770a5e29 Update finalizeImpalaCluster.sh dimitrispie 2023-06-15 13:25:37 +0300
  • e06d962a6a Update step15.sql dimitrispie 2023-06-15 12:20:35 +0300
  • afcad08396 Update step20-createMonitorDB_institutions.sql dimitrispie 2023-06-15 10:28:49 +0300
  • b9748763e2 Merge pull request '[stats wf] Bug fixes' (#308) from antonis.lempesis/dnet-hadoop:beta into beta Claudio Atzori 2023-06-14 21:57:03 +0200
  • 42b8ce2ba4 Update copyDataToImpalaCluster.sh dimitrispie 2023-06-14 19:23:42 +0300
  • 2032b0df40 Bug fixes dimitrispie 2023-06-14 19:09:09 +0300
  • a92206dab5 re-added the name of a column (pid) Michele Artini 2023-06-13 11:43:10 +0200
  • b76a47b103 [aggregator graph] added column alias when mapping organization PIDs from the OpenOrgs database Claudio Atzori 2023-06-13 11:38:10 +0200
  • 744a61a030 depending on dhp-schema:3.17.1 Claudio Atzori 2023-06-12 13:49:44 +0200
  • 2e4616a251 Merge pull request '[graph cleaning] pid cleaning' (#307) from pid_cleaning into beta Claudio Atzori 2023-06-12 13:32:29 +0200
  • d6a8b24711 Merge branch 'beta' into pid_cleaning Claudio Atzori 2023-06-12 13:32:22 +0200
  • fdbfb25614 Merge pull request 'update sql query to return distinct pids [beta]' (#306) from distinct_pids_from_openorgs_beta into beta Claudio Atzori 2023-06-12 09:59:00 +0200
  • ad04f14b81 Merge branch 'beta' into distinct_pids_from_openorgs_beta Claudio Atzori 2023-06-12 09:58:21 +0200
  • a98e6591e2 Merge pull request 'propagation of projects through parent-child relations' (#299) from propagationProjectThroughParentChils into beta Claudio Atzori 2023-06-12 09:57:20 +0200
  • 55f002f1e9 Merge branch 'beta' into propagationProjectThroughParentChils Claudio Atzori 2023-06-12 09:56:53 +0200
  • daa21ddbb5 Merge pull request '[aggregator graph] validation for URLs from oaf:fulltext' (#298) from fulltext_url_validation into beta Claudio Atzori 2023-06-12 09:55:35 +0200
  • 4b00a76271 Merge branch 'beta' into fulltext_url_validation Claudio Atzori 2023-06-12 09:55:25 +0200
  • eb2fa8556b Merge pull request 'removeTaggingCondition' (#297) from removeTaggingCondition into beta Claudio Atzori 2023-06-12 09:53:05 +0200
  • de225c71cd Merge branch 'beta' into removeTaggingCondition Claudio Atzori 2023-06-12 09:50:40 +0200
  • e1409ffe80 update sql query to return distinct pids Claudio Atzori 2023-06-12 09:47:45 +0200
  • 1d33074fd1 WIP: pid cleaning Claudio Atzori 2023-06-09 16:47:25 +0200
  • d9506035e4 [ZenodoApi] gone back to okhttp3 to send the payload. Miriam Baglioni 2023-06-09 12:05:02 +0200
  • da7b66c542 Merge pull request '[stats wf] Added memory to hive' (#305) from antonis.lempesis/dnet-hadoop:beta into beta Claudio Atzori 2023-06-08 08:58:48 +0200
  • c5f42c7f5b Added memory to hive dimitrispie 2023-06-07 18:18:23 +0300
  • afb76ebf0f Merge pull request '[stats wf] Bug fix on indicators step' (#304) from antonis.lempesis/dnet-hadoop:beta into beta Claudio Atzori 2023-06-07 16:49:09 +0200
  • fa24e2e18f Bug fix on indicators step dimitrispie 2023-06-07 17:43:37 +0300
  • 01c67e697d Merge pull request '[ stats wf] Bug fix' (#303) from antonis.lempesis/dnet-hadoop:beta into beta Claudio Atzori 2023-06-07 14:41:44 +0200
  • 28272c1b0e Bug fix dimitrispie 2023-06-07 15:34:01 +0300
  • d5be6a13e9 Updated officialnmae of pangaea in hostedbymap for Datacite to avoid duplicate entries in the source filter of the portal Alessia Bardi 2023-06-06 14:43:32 +0200
  • 118e72d7db Updated officialnmae of pangaea in hostedbymap for Datacite to avoid duplicate entries in the source filter of the portal Alessia Bardi 2023-06-06 14:39:12 +0200
  • 5befd93d7d test records for Solr indexing Alessia Bardi 2023-06-06 14:34:33 +0200
  • cae92cf811 update sql query to return distinct pids Michele Artini 2023-06-06 14:06:06 +0200
  • 8f651f1225 Merge pull request 'Changes to beta stats wf' (#300) from antonis.lempesis/dnet-hadoop:beta into beta Claudio Atzori 2023-06-06 11:41:36 +0200
  • ad07fbf053 Add names to organizations for collaboration indicators dimitrispie 2023-06-02 14:13:10 +0300
  • 2324670714 Split Monitor DBs-Interdisciplinarity indicators dimitrispie 2023-06-02 13:34:16 +0300
  • daf4d7971b refactoring Miriam Baglioni 2023-05-31 18:56:58 +0200
  • 97d72d41c3 finalization of implementation and testing Miriam Baglioni 2023-05-31 18:53:22 +0200
  • 0389b57ca7 added propagation for project to organization Miriam Baglioni 2023-05-31 11:06:58 +0200
  • e45777e7e1 [aggregator graph] added validation for URLs mapped from oaf:fulltext Claudio Atzori 2023-05-26 11:33:42 +0200
  • ebe586b1d1 Impact indicators/Unpaywall dimitrispie 2023-05-26 10:25:28 +0300
  • d6102dd576 Update step16-createIndicatorsTables.sql dimitrispie 2023-05-25 14:52:34 +0300
  • 9097e71853 Added assertion in test Miriam Baglioni 2023-05-24 16:30:53 +0200
  • 9567c13bc3 refactoring Miriam Baglioni 2023-05-24 16:20:05 +0200
  • b64a5eb4a5 Merge branch 'master' of https://code-repo.d4science.org/D-Net/dnet-hadoop Miriam Baglioni 2023-05-24 15:21:58 +0200
  • 34172455d1 [BulkTag] Adding remove constraints to specify when a community must not appear in the context of a result. Miriam Baglioni 2023-05-24 09:56:23 +0200
  • a1b9187039 Fix syntax error on workflow.xml Ilias Kanellos 2023-05-23 17:17:12 +0300
  • 6a7e370a21 Remove unnecessary counts in graph creation Ilias Kanellos 2023-05-23 16:48:58 +0300
  • ec4e010687 End after rankings | Create graph debugged Ilias Kanellos 2023-05-23 16:44:04 +0300
  • 654ffcba60 Merge pull request '[UsageCount] addition of usagecount for Projects and datasources' (#296) from master_datasource_project_usagecounts into master Claudio Atzori 2023-05-22 16:13:24 +0200
  • db625e548d [UsageCount] addition of usagecount for Projects and datasources Claudio Atzori 2023-05-22 15:00:46 +0200
  • 04141fe259 tests for records from D4Science catalogues Alessia Bardi 2023-05-19 14:28:24 +0200
  • a235d2a24a Merge pull request 'Updates to steps related to transfer data to impala cluster' (#295) from antonis.lempesis/dnet-hadoop:beta into beta Claudio Atzori 2023-05-18 08:46:15 +0200
  • 86f4f63daf Updates to steps related to transfer data to impala cluster dimitrispie 2023-05-18 09:33:05 +0300
  • 909729a2fc [dedup] tweaking num partitions, minor changes Claudio Atzori 2023-05-17 10:16:22 +0200
  • 38020e242a Merge branch '8172_impact_indicators_workflow' of https://code-repo.d4science.org/D-Net/dnet-hadoop into 8172_impact_indicators_workflow Ilias Kanellos 2023-05-16 17:34:53 +0300
  • 3d69f33c84 Fix selection of columns in graph creation Ilias Kanellos 2023-05-16 17:34:42 +0300
  • 3c38f7ba6f Fix selection of columns in graph creation Ilias Kanellos 2023-05-16 17:32:53 +0300
  • 8ef718c363 Fix workflow application path Serafeim Chatzopoulos 2023-05-16 16:28:48 +0300
  • 26328e2a0d Move job.properties Serafeim Chatzopoulos 2023-05-16 14:39:38 +0300
  • 4eec3e7052 Add jobTracker, nameNode && spark2Lib as global params in oozie wf Serafeim Chatzopoulos 2023-05-15 22:28:48 +0300
  • b83135c252 Add missing kill nodes in workflow.xml Serafeim Chatzopoulos 2023-05-15 19:55:35 +0300
  • 45f2aa0867 Move end node ... at the end in workflow.xml Serafeim Chatzopoulos 2023-05-15 17:52:20 +0300
  • e309688711 Merge pull request 'fix APC affiliation links' (#294) from apc_affiliation into beta Claudio Atzori 2023-05-15 15:47:57 +0200
  • 8acad52a0c Merge branch 'beta' into apc_affiliation Claudio Atzori 2023-05-15 15:47:33 +0200
  • 8a463cc3e8 fixed organization id created when mapping APC affiliations. Factored out ROR constants in dhp-common Claudio Atzori 2023-05-15 15:44:46 +0200
  • 12a57e1f58 Resolve conflicts Serafeim Chatzopoulos 2023-05-15 15:59:51 +0300
  • 82e2a96f51 Resolve conflicts Serafeim Chatzopoulos 2023-05-15 15:53:12 +0300
  • b8e8c959fe Update workflow.xml && job.properties Serafeim Chatzopoulos 2023-05-15 15:50:23 +0300
  • 4a905932a3 Spark properties from job.properties Ilias Kanellos 2023-05-15 15:24:22 +0300
  • 0c314d5e09 Merge pull request 'Update copyDataToImpalaCluster.sh' (#293) from antonis.lempesis/dnet-hadoop:beta into beta Claudio Atzori 2023-05-15 12:05:54 +0200
  • 07818131ef Update documentation Serafeim Chatzopoulos 2023-05-15 13:04:44 +0300