63 Commits (c6e079d90ba9785dd1fe8b3efdd3ddd6a9e4d267)
 

Author SHA1 Message Date
Andrea Mannocci c6e079d90b renaming notebooks 2 years ago
Andrea Mannocci 9b7c2efc5d Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/data_registries_analysis 2 years ago
Andrea Mannocci f84f5ee093 rerun dedup partitions analysis 2 years ago
miconis efdbc76181 minor changes to fix relative paths 2 years ago
Andrea Mannocci 3353c08405 restructuring the project 2 years ago
Andrea Mannocci abfb626faa datasets updated. new dedup. new partitions 2 years ago
miconis 594ba0e1c7 script to create the csv file basing on the mergerels, generation of the mergerels and the deduplication csv 2 years ago
miconis b31e97f71e script to process ds dumps and create json full dump, ds dedup configuration added 2 years ago
Andrea Mannocci e537f30a32 rerunning notebooks 2 years ago
Miriam Baglioni 6c71bde5f8 OpenDaor, re3data and roar data updated 2 years ago
Andrea Mannocci 2df1b0ca2e new dump from fairsharing 2 years ago
Andrea Mannocci 722a9aa0cf rewiring subject and geo analysis 3 years ago
Andrea Mannocci dff032e2b3 regenerating datasets with proper column names 3 years ago
Andrea Mannocci a8b52c6931 regenerating datasets with proper column names 3 years ago
Andrea Mannocci dca072f654 rewiring notebook for single registry inspection 3 years ago
Andrea Mannocci 98075dbae9 counting duplicates within 3 years ago
Andrea Mannocci cc2c004b9e counting duplicates within 3 years ago
Andrea Mannocci a55db56e2e counting duplicates within 3 years ago
Miriam Baglioni 8f3175f792 last updates 3 years ago
Miriam Baglioni 021c9b4db3 Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/registries_analysis 3 years ago
Miriam Baglioni f8733ffc5f new OpenDOAR with missing field (sorry Andre) 3 years ago
Andrea Mannocci c072a0a90f recreating dataframes 3 years ago
Andrea Mannocci 9dfedb2a7b recreating dataframes 3 years ago
Andrea Mannocci da8f0818df Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/data_registries_analysis 3 years ago
Miriam Baglioni 490b69833a new OpenDOAR with missing field 3 years ago
Andrea Mannocci 02e7ed79a2 recreating dataframes 3 years ago
Miriam Baglioni c0892c676c new mapping with dictionary to list the field name of wrapper elements also for OpenDOAR 3 years ago
Miriam Baglioni cae8426ef7 new mapping with dictionary to list the field name of wrapper elements 3 years ago
Miriam Baglioni 0fefbfd2c8 new mapping 3 years ago
Andrea Mannocci 74bb9edd04 added simple checks for across registrations 3 years ago
Andrea Mannocci 264f527fcb partitioned dup groups 3 years ago
Andrea Mannocci 13f18b5d33 partitioned dup groups 3 years ago
miconis 84b32e5d33 addition of ds dedup with levenshtein distance and 0.9 threshold 3 years ago
Andrea Mannocci 76973593b6 removed old datasets 3 years ago
Andrea Mannocci b62be6dde8 downloaded FS again with metadata block 3 years ago
Andrea Mannocci c5411b0af0 downloaded FS again 3 years ago
Miriam Baglioni 10d08e4251 - 3 years ago
Miriam Baglioni 1c21796be8 - 3 years ago
Miriam Baglioni 56f871dc4d let's try again.... 3 years ago
Andrea Mannocci 7e3d933641 fixed re3data 3 years ago
Miriam Baglioni ca67c1c928 new version of .tsv 3 years ago
Andrea Mannocci b7226dfcc7 renaming fairsharing 3 years ago
Miriam Baglioni b94e693c57 Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/registries_analysis 3 years ago
Miriam Baglioni c4c13fd6f2 Using values instad of None for empty fields 3 years ago
Andrea Mannocci 778882aa64 renaming re3data 3 years ago
Andrea Mannocci 93986fcec6 adding old opendoar dataset which was working fine 3 years ago
Andrea Mannocci faa8fd69cf adding raw fairsharing extended 3 years ago
Andrea Mannocci fffd8be0e0 adding raw fairsharing 3 years ago
Andrea Mannocci 52f10ca94b Merge branch 'master' of https://code-repo.d3science.org/andrea.mannocci/data_registries_analysis 3 years ago
Andrea Mannocci ecf5bd9ad7 addidng raw fairsharing 3 years ago