50 Commits (a8b52c69313bacabd39df3bf52442a1504c34a40)
 

Author SHA1 Message Date
Andrea Mannocci a8b52c6931 regenerating datasets with proper column names 3 years ago
Andrea Mannocci dca072f654 rewiring notebook for single registry inspection 3 years ago
Andrea Mannocci 98075dbae9 counting duplicates within 3 years ago
Andrea Mannocci cc2c004b9e counting duplicates within 3 years ago
Andrea Mannocci a55db56e2e counting duplicates within 3 years ago
Miriam Baglioni 8f3175f792 last updates 3 years ago
Miriam Baglioni 021c9b4db3 Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/registries_analysis 3 years ago
Miriam Baglioni f8733ffc5f new OpenDOAR with missing field (sorry Andre) 3 years ago
Andrea Mannocci c072a0a90f recreating dataframes 3 years ago
Andrea Mannocci 9dfedb2a7b recreating dataframes 3 years ago
Andrea Mannocci da8f0818df Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/data_registries_analysis 3 years ago
Miriam Baglioni 490b69833a new OpenDOAR with missing field 3 years ago
Andrea Mannocci 02e7ed79a2 recreating dataframes 3 years ago
Miriam Baglioni c0892c676c new mapping with dictionary to list the field name of wrapper elements also for OpenDOAR 3 years ago
Miriam Baglioni cae8426ef7 new mapping with dictionary to list the field name of wrapper elements 3 years ago
Miriam Baglioni 0fefbfd2c8 new mapping 3 years ago
Andrea Mannocci 74bb9edd04 added simple checks for across registrations 3 years ago
Andrea Mannocci 264f527fcb partitioned dup groups 3 years ago
Andrea Mannocci 13f18b5d33 partitioned dup groups 3 years ago
miconis 84b32e5d33 addition of ds dedup with levenshtein distance and 0.9 threshold 3 years ago
Andrea Mannocci 76973593b6 removed old datasets 3 years ago
Andrea Mannocci b62be6dde8 downloaded FS again with metadata block 3 years ago
Andrea Mannocci c5411b0af0 downloaded FS again 3 years ago
Miriam Baglioni 10d08e4251 - 3 years ago
Miriam Baglioni 1c21796be8 - 3 years ago
Miriam Baglioni 56f871dc4d let's try again.... 3 years ago
Andrea Mannocci 7e3d933641 fixed re3data 3 years ago
Miriam Baglioni ca67c1c928 new version of .tsv 3 years ago
Andrea Mannocci b7226dfcc7 renaming fairsharing 3 years ago
Miriam Baglioni b94e693c57 Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/registries_analysis 3 years ago
Miriam Baglioni c4c13fd6f2 Using values instad of None for empty fields 3 years ago
Andrea Mannocci 778882aa64 renaming re3data 3 years ago
Andrea Mannocci 93986fcec6 adding old opendoar dataset which was working fine 3 years ago
Andrea Mannocci faa8fd69cf adding raw fairsharing extended 3 years ago
Andrea Mannocci fffd8be0e0 adding raw fairsharing 3 years ago
Andrea Mannocci 52f10ca94b Merge branch 'master' of https://code-repo.d3science.org/andrea.mannocci/data_registries_analysis 3 years ago
Andrea Mannocci ecf5bd9ad7 addidng raw fairsharing 3 years ago
Miriam Baglioni 8d376a54f2 fixed openDoar.tsv 3 years ago
Miriam Baglioni 96fbaa553e new mapping from the last import in OpenAIRE 3 years ago
Andrea Mannocci 6abcd9b142 new dedup file 3 years ago
Andrea Mannocci 7ab83cbb10 starting to analyse overlap 3 years ago
Andrea Mannocci dd6b79e69f each registry has a basic analysis 3 years ago
Miriam Baglioni 434fe5ed20 Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/registries_analysis 3 years ago
Andrea Mannocci c2943c4818 each registry has a basic analysis 3 years ago
Andrea Mannocci be70e8cc74 added first dedup results 3 years ago
Miriam Baglioni 76ad6143ea added the mapping with hopefully nan values when empty elements in the xml 3 years ago
Andrea Mannocci 63661dfb32 new notebooks 3 years ago
Andrea Mannocci c6d01322c3 added datasets 3 years ago
Andrea Mannocci c052601c90 restructured analysis 3 years ago
Andrea Mannocci dcb8dbc4bd first import 3 years ago