Commit Graph

16 Commits

Author SHA1 Message Date
Andrea Mannocci 3854e03d10 added michele notebook 2021-07-20 12:15:17 +02:00
Andrea Mannocci 7b948bb29e new column other_urls being processed 2021-05-12 16:27:29 +02:00
Andrea Mannocci 087129a9c5 new make data with grid download 2021-04-23 11:03:53 +02:00
Andrea Mannocci f21a3f7d30 persisting full urls 2021-04-22 17:30:50 +02:00
Andrea Mannocci 34a37942c5 datetime parsing 2021-04-15 15:01:34 +02:00
Andrea Mannocci 35c2ec418f fixed IP extraction 2021-03-31 12:56:49 +02:00
Andrea Mannocci 0bec69ec6d making uint columns int 2021-03-29 16:50:39 +02:00
Andrea Mannocci efc63f88db optimised memory allocation for dataframe 2021-03-29 15:57:24 +02:00
Andrea Mannocci 629d781645 moved lots of preprocessing under make 2021-03-25 15:20:06 +01:00
Andrea Mannocci e4f8fcab0e pickle in chunks 2021-03-24 13:29:06 +01:00
Andrea Mannocci a465bb027d fixing typo 2021-03-24 12:24:27 +01:00
Andrea Mannocci c49ae8fdfd Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/fake-orcid-analysis 2021-03-24 12:21:03 +01:00
Andrea Mannocci 6bf69bb2ab small changes in the make data file 2021-03-24 12:21:00 +01:00
Miriam Baglioni 444fc36bf5 added encoding information while reading the csv file 2021-03-24 12:13:03 +01:00
Andrea Mannocci b5e99701b1 adding preprocessing with make 2021-03-23 19:03:37 +01:00
Andrea Mannocci 9c2a1bb846 first commit 2021-03-18 17:43:00 +01:00