Commit Graph

165 Commits

Author SHA1 Message Date
Miriam Baglioni e9aca6b702 refactoring 2023-08-04 19:32:16 +02:00
Miriam Baglioni 5fb58362c5 moved parameter file. Added 40| as prefix on projects for computing the delta 2023-08-04 17:18:15 +02:00
Miriam Baglioni 097905171a adding master duplicate to avoid join of relation. Changed the model for the indicators 2023-08-04 16:22:23 +02:00
Miriam Baglioni 6b113961c1 - 2023-07-28 10:26:22 +02:00
Miriam Baglioni a175ac2c7f [dump] refactoring 2023-07-19 09:40:48 +02:00
Miriam Baglioni 2566b97138 [dumpCSV] remove not needed code 2023-07-17 16:28:46 +02:00
Miriam Baglioni 0482648131 merg 2023-07-17 16:24:57 +02:00
Miriam Baglioni 5ff50d115a [dumpCSV] ading double quotes enclosing all the fileds 2023-07-17 16:21:20 +02:00
Miriam Baglioni 81b55dc20b merging with master 2023-07-15 11:15:24 +02:00
Miriam Baglioni 7ccd4e7866 Merge pull request 'Dump per Country' (#3) from dumpSubset into master
Reviewed-on: #3
2023-07-15 11:13:28 +02:00
Miriam Baglioni 25be584028 [dumpSubset] aligned with master 2023-07-15 11:12:27 +02:00
Miriam Baglioni 21a521b97c changed the API to consider the upload only of an already open version 2023-07-15 10:36:54 +02:00
Miriam Baglioni b74d6f1c23 resolved conflict 2023-07-13 18:20:31 +02:00
Miriam Baglioni 787d4d0b4a changed the pom reference to the dho schema 2023-07-13 18:19:12 +02:00
Miriam Baglioni b01573e201 [dumpCSV] removed output directory before starting the jobs 2023-07-12 07:38:53 +02:00
Miriam Baglioni baef25560a [dumpCSV] align pom version with master for graph 2023-07-11 13:47:28 +02:00
Miriam Baglioni 95125d704a [dump] removed usage stats info from the datasource and project 2023-07-11 12:40:21 +02:00
Miriam Baglioni abc30756e4 - 2023-07-07 18:42:41 +02:00
Miriam Baglioni ab791fe424 [master] update reference to ZenodoAPI 2023-07-07 18:12:17 +02:00
Miriam Baglioni 3bfac8bc6e [dumpCSV] addressing the issues fointed out by the Dare Lab people. Repeated relations from author to result due to the author repeated in the data. Repeated relations from result to result due to the same pid present in more that one result. Author table not properly formatted due to the bad formatting of the input data 2023-07-07 18:01:26 +02:00
Miriam Baglioni 9d1b708a89 [dumpCSV] addressing the issues fointed out by the Dare Lab people. Repeated relations from author to result due to the author repeated in the data. Repeated relations from result to result due to the same pid present in more that one result. Author table not properly formatted due to the bad formatting of the input data 2023-07-07 17:44:19 +02:00
Miriam Baglioni 8a44653dbe [DumpCSV] fixing issues 2023-07-05 09:58:55 +02:00
Miriam Baglioni b26fb92838 changed the pom dependency of a different schema 2023-07-01 12:38:18 +02:00
Miriam Baglioni 29b81bef26 refactoring 2023-07-01 11:54:48 +02:00
Miriam Baglioni d53c6850aa Merge pull request 'dump_zenodo_2' (#2) from dump_zenodo_2 into master
Reviewed-on: #2
2023-07-01 11:09:19 +02:00
Miriam Baglioni 3fba247c38 refactoring 2023-07-01 11:07:41 +02:00
Miriam Baglioni 2ac5c4a9ab moved also the model and other linked classes to the dump project 2023-07-01 11:06:41 +02:00
Miriam Baglioni 766288d1c9 Merge branch 'dump_zenodo_2' of https://code-repo.d4science.org/D-Net/dhp-graph-dump into dump_zenodo_2 2023-07-01 10:39:37 +02:00
Miriam Baglioni b9d4d67c72 - 2023-06-30 19:06:15 +02:00
Sandro La Bruzzo d746390b9f new implementatiton with okhttp 2023-06-23 15:15:09 +02:00
Miriam Baglioni 72ead1bd85 added okhttp3 again 2023-06-23 14:16:15 +02:00
Sandro La Bruzzo 6ace388cff fixed method 2023-06-23 14:16:10 +02:00
Sandro La Bruzzo d472050ad4 Added new implementation upload huge file 2023-06-22 17:43:53 +02:00
Sandro La Bruzzo 5d0d14528f Added new implementation upload huge file 2023-06-22 16:54:17 +02:00
Miriam Baglioni e87b790a60 - 2023-06-22 16:54:13 +02:00
Miriam Baglioni 8661bc0c90 aligned with the last version of pom for production 2023-06-02 16:13:18 +02:00
Miriam Baglioni 2e8639f22d added test to verify the dump for indicators at the level of project and datasource. Fixed issue on identifier with the prefix 2023-06-01 15:10:00 +02:00
Miriam Baglioni 32983e90d1 change to the model of the Relation -> flatten: remove the node and add source, sourceType, target, targetType. Adding indicators at the level of Projects and Datasources. Removing the prefix from the identifier of the entities 2023-06-01 12:58:56 +02:00
Miriam Baglioni 2e0999a1df First implementation of the csv dump 2023-05-29 10:16:47 +02:00
Miriam Baglioni f79b9d5c0d [DUMP CSV] sligth modification 2023-05-17 16:58:04 +02:00
Miriam Baglioni 21599598ae [DUMP CSV] test and resources for the result dumps 2023-05-17 16:57:25 +02:00
Miriam Baglioni 66873c1744 [DUMP CSV] Dumping of the results, of the authors and the relationships between results and authors and results and pids 2023-05-17 16:56:28 +02:00
Miriam Baglioni 7563499740 [DUMP CSV] - 2023-05-16 14:29:31 +02:00
Miriam Baglioni f79c06209e [DUMP CSV] test and resources for the SelectResultAndDumpRelation job 2023-05-16 14:21:39 +02:00
Miriam Baglioni 2ed76d4662 [DUMP CSV] tested file to execute the dump of the relations with semantcis Cites from nodes belonging to a selected communities. It also dumps the relationships result_communities and prepare the ground for the dump of the results. 2023-05-16 14:20:45 +02:00
Miriam Baglioni 44a256fc90 [DUMP CSV] refactoring 2023-05-16 14:10:14 +02:00
Miriam Baglioni 636945a5c5 [DUMP CSV] refactoring 2023-05-16 14:09:21 +02:00
Miriam Baglioni b9076f9aa8 [DUMP CSV] model classes to mirror the attributes of the tables to be dumped 2023-05-16 14:06:25 +02:00
Miriam Baglioni acb3c691bc [DUMP CSV] added query and method to get the information to dump in the CSV regarding the selected communities 2023-05-16 14:04:44 +02:00
Miriam Baglioni d0f144d422 first implementation for the dump in csv of the subset of the graph related to specific communities. The only relations considered are the cites. the source must be within the set of communties, the target con be outside => we also have to map nodes not related to the communities of interest. These communities are given as parameter 2023-05-11 16:44:54 +02:00