Andrea Mannocci
|
a8b52c6931
|
regenerating datasets with proper column names
|
3 years ago |
Andrea Mannocci
|
dca072f654
|
rewiring notebook for single registry inspection
|
3 years ago |
Andrea Mannocci
|
98075dbae9
|
counting duplicates within
|
3 years ago |
Andrea Mannocci
|
cc2c004b9e
|
counting duplicates within
|
3 years ago |
Andrea Mannocci
|
a55db56e2e
|
counting duplicates within
|
3 years ago |
Miriam Baglioni
|
8f3175f792
|
last updates
|
3 years ago |
Miriam Baglioni
|
021c9b4db3
|
Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/registries_analysis
|
3 years ago |
Miriam Baglioni
|
f8733ffc5f
|
new OpenDOAR with missing field (sorry Andre)
|
3 years ago |
Andrea Mannocci
|
c072a0a90f
|
recreating dataframes
|
3 years ago |
Andrea Mannocci
|
9dfedb2a7b
|
recreating dataframes
|
3 years ago |
Andrea Mannocci
|
da8f0818df
|
Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/data_registries_analysis
|
3 years ago |
Miriam Baglioni
|
490b69833a
|
new OpenDOAR with missing field
|
3 years ago |
Andrea Mannocci
|
02e7ed79a2
|
recreating dataframes
|
3 years ago |
Miriam Baglioni
|
c0892c676c
|
new mapping with dictionary to list the field name of wrapper elements also for OpenDOAR
|
3 years ago |
Miriam Baglioni
|
cae8426ef7
|
new mapping with dictionary to list the field name of wrapper elements
|
3 years ago |
Miriam Baglioni
|
0fefbfd2c8
|
new mapping
|
3 years ago |
Andrea Mannocci
|
74bb9edd04
|
added simple checks for across registrations
|
3 years ago |
Andrea Mannocci
|
264f527fcb
|
partitioned dup groups
|
3 years ago |
Andrea Mannocci
|
13f18b5d33
|
partitioned dup groups
|
3 years ago |
miconis
|
84b32e5d33
|
addition of ds dedup with levenshtein distance and 0.9 threshold
|
3 years ago |
Andrea Mannocci
|
76973593b6
|
removed old datasets
|
3 years ago |
Andrea Mannocci
|
b62be6dde8
|
downloaded FS again with metadata block
|
3 years ago |
Andrea Mannocci
|
c5411b0af0
|
downloaded FS again
|
3 years ago |
Miriam Baglioni
|
10d08e4251
|
-
|
3 years ago |
Miriam Baglioni
|
1c21796be8
|
-
|
3 years ago |
Miriam Baglioni
|
56f871dc4d
|
let's try again....
|
3 years ago |
Andrea Mannocci
|
7e3d933641
|
fixed re3data
|
3 years ago |
Miriam Baglioni
|
ca67c1c928
|
new version of .tsv
|
3 years ago |
Andrea Mannocci
|
b7226dfcc7
|
renaming fairsharing
|
3 years ago |
Miriam Baglioni
|
b94e693c57
|
Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/registries_analysis
|
3 years ago |
Miriam Baglioni
|
c4c13fd6f2
|
Using values instad of None for empty fields
|
3 years ago |
Andrea Mannocci
|
778882aa64
|
renaming re3data
|
3 years ago |
Andrea Mannocci
|
93986fcec6
|
adding old opendoar dataset which was working fine
|
3 years ago |
Andrea Mannocci
|
faa8fd69cf
|
adding raw fairsharing extended
|
3 years ago |
Andrea Mannocci
|
fffd8be0e0
|
adding raw fairsharing
|
3 years ago |
Andrea Mannocci
|
52f10ca94b
|
Merge branch 'master' of https://code-repo.d3science.org/andrea.mannocci/data_registries_analysis
|
3 years ago |
Andrea Mannocci
|
ecf5bd9ad7
|
addidng raw fairsharing
|
3 years ago |
Miriam Baglioni
|
8d376a54f2
|
fixed openDoar.tsv
|
3 years ago |
Miriam Baglioni
|
96fbaa553e
|
new mapping from the last import in OpenAIRE
|
3 years ago |
Andrea Mannocci
|
6abcd9b142
|
new dedup file
|
3 years ago |
Andrea Mannocci
|
7ab83cbb10
|
starting to analyse overlap
|
3 years ago |
Andrea Mannocci
|
dd6b79e69f
|
each registry has a basic analysis
|
3 years ago |
Miriam Baglioni
|
434fe5ed20
|
Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/registries_analysis
|
3 years ago |
Andrea Mannocci
|
c2943c4818
|
each registry has a basic analysis
|
3 years ago |
Andrea Mannocci
|
be70e8cc74
|
added first dedup results
|
3 years ago |
Miriam Baglioni
|
76ad6143ea
|
added the mapping with hopefully nan values when empty elements in the xml
|
3 years ago |
Andrea Mannocci
|
63661dfb32
|
new notebooks
|
3 years ago |
Andrea Mannocci
|
c6d01322c3
|
added datasets
|
3 years ago |
Andrea Mannocci
|
c052601c90
|
restructured analysis
|
3 years ago |
Andrea Mannocci
|
dcb8dbc4bd
|
first import
|
3 years ago |