63 Commits (master)
 

Author SHA1 Message Date
Andrea Mannocci c6e079d90b renaming notebooks 2 months ago
Andrea Mannocci 9b7c2efc5d Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/data_registries_analysis 3 months ago
Andrea Mannocci f84f5ee093 rerun dedup partitions analysis 3 months ago
miconis efdbc76181 minor changes to fix relative paths 3 months ago
Andrea Mannocci 3353c08405 restructuring the project 3 months ago
Andrea Mannocci abfb626faa datasets updated. new dedup. new partitions 3 months ago
miconis 594ba0e1c7 script to create the csv file basing on the mergerels, generation of the mergerels and the deduplication csv 3 months ago
miconis b31e97f71e script to process ds dumps and create json full dump, ds dedup configuration added 3 months ago
Andrea Mannocci e537f30a32 rerunning notebooks 3 months ago
Miriam Baglioni 6c71bde5f8 OpenDaor, re3data and roar data updated 3 months ago
Andrea Mannocci 2df1b0ca2e new dump from fairsharing 3 months ago
Andrea Mannocci 722a9aa0cf rewiring subject and geo analysis 7 months ago
Andrea Mannocci dff032e2b3 regenerating datasets with proper column names 7 months ago
Andrea Mannocci a8b52c6931 regenerating datasets with proper column names 7 months ago
Andrea Mannocci dca072f654 rewiring notebook for single registry inspection 7 months ago
Andrea Mannocci 98075dbae9 counting duplicates within 7 months ago
Andrea Mannocci cc2c004b9e counting duplicates within 7 months ago
Andrea Mannocci a55db56e2e counting duplicates within 7 months ago
Miriam Baglioni 8f3175f792 last updates 8 months ago
Miriam Baglioni 021c9b4db3 Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/registries_analysis 8 months ago
Miriam Baglioni f8733ffc5f new OpenDOAR with missing field (sorry Andre) 8 months ago
Andrea Mannocci c072a0a90f recreating dataframes 8 months ago
Andrea Mannocci 9dfedb2a7b recreating dataframes 8 months ago
Andrea Mannocci da8f0818df Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/data_registries_analysis 8 months ago
Miriam Baglioni 490b69833a new OpenDOAR with missing field 8 months ago
Andrea Mannocci 02e7ed79a2 recreating dataframes 8 months ago
Miriam Baglioni c0892c676c new mapping with dictionary to list the field name of wrapper elements also for OpenDOAR 8 months ago
Miriam Baglioni cae8426ef7 new mapping with dictionary to list the field name of wrapper elements 8 months ago
Miriam Baglioni 0fefbfd2c8 new mapping 8 months ago
Andrea Mannocci 74bb9edd04 added simple checks for across registrations 8 months ago
Andrea Mannocci 264f527fcb partitioned dup groups 8 months ago
Andrea Mannocci 13f18b5d33 partitioned dup groups 8 months ago
miconis 84b32e5d33 addition of ds dedup with levenshtein distance and 0.9 threshold 8 months ago
Andrea Mannocci 76973593b6 removed old datasets 8 months ago
Andrea Mannocci b62be6dde8 downloaded FS again with metadata block 8 months ago
Andrea Mannocci c5411b0af0 downloaded FS again 8 months ago
Miriam Baglioni 10d08e4251 - 8 months ago
Miriam Baglioni 1c21796be8 - 8 months ago
Miriam Baglioni 56f871dc4d let's try again.... 8 months ago
Andrea Mannocci 7e3d933641 fixed re3data 8 months ago
Miriam Baglioni ca67c1c928 new version of .tsv 8 months ago
Andrea Mannocci b7226dfcc7 renaming fairsharing 8 months ago
Miriam Baglioni b94e693c57 Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/registries_analysis 8 months ago
Miriam Baglioni c4c13fd6f2 Using values instad of None for empty fields 8 months ago
Andrea Mannocci 778882aa64 renaming re3data 8 months ago
Andrea Mannocci 93986fcec6 adding old opendoar dataset which was working fine 8 months ago
Andrea Mannocci faa8fd69cf adding raw fairsharing extended 8 months ago
Andrea Mannocci fffd8be0e0 adding raw fairsharing 8 months ago
Andrea Mannocci 52f10ca94b Merge branch 'master' of https://code-repo.d3science.org/andrea.mannocci/data_registries_analysis 8 months ago
Andrea Mannocci ecf5bd9ad7 addidng raw fairsharing 8 months ago