Commit Graph

42 Commits

Author SHA1 Message Date
Andrea Mannocci 3854e03d10 added michele notebook 2021-07-20 12:15:17 +02:00
Andrea Mannocci 7b948bb29e new column other_urls being processed 2021-05-12 16:27:29 +02:00
Andrea Mannocci 9739054a74 new try with supervisioned ML 2021-04-29 18:50:02 +02:00
Andrea Mannocci 087129a9c5 new make data with grid download 2021-04-23 11:03:53 +02:00
Andrea Mannocci f21a3f7d30 persisting full urls 2021-04-22 17:30:50 +02:00
Andrea Mannocci 1796052086 urls vs grid.ac 2021-04-21 18:29:11 +02:00
Andrea Mannocci 42ff175d05 urls vs grid.ac 2021-04-21 18:28:38 +02:00
Andrea Mannocci c7f7a9a62e updated requirements 2021-04-21 16:51:59 +02:00
Andrea Mannocci 6672d5b28b updated requirements 2021-04-21 16:51:29 +02:00
Andrea Mannocci 535b9d2201 exploring dates and their difference 2021-04-19 19:02:32 +02:00
Andrea Mannocci 0343147b4e exploring dates and their difference 2021-04-19 19:01:38 +02:00
Andrea Mannocci ac0466cc8b dup bios datetime analysis 2021-04-15 15:03:33 +02:00
Andrea Mannocci 34a37942c5 datetime parsing 2021-04-15 15:01:34 +02:00
Andrea Mannocci 35c2ec418f fixed IP extraction 2021-03-31 12:56:49 +02:00
Andrea Mannocci 51e479c287 added study of education and employment 2021-03-30 17:39:05 +02:00
Andrea Mannocci 0bec69ec6d making uint columns int 2021-03-29 16:50:39 +02:00
Andrea Mannocci 25d225dd5f importing mirima analysis to main notebook 2021-03-29 15:58:37 +02:00
Andrea Mannocci efc63f88db optimised memory allocation for dataframe 2021-03-29 15:57:24 +02:00
Andrea Mannocci 83e2005c0e first tries with rudimental ML 2021-03-26 16:01:46 +01:00
Miriam Baglioni 31209807a8 part of the exploratory analysis related to the authoritative works source 2021-03-26 15:14:54 +01:00
Andrea Mannocci 8288d877fa first tries with rudimental ML 2021-03-26 09:16:11 +01:00
Andrea Mannocci 8e159607ea moved lots of preprocessing under make 2021-03-25 16:35:30 +01:00
Andrea Mannocci 629d781645 moved lots of preprocessing under make 2021-03-25 15:20:06 +01:00
Andrea Mannocci 9433fbe46d read pickles in chunks 2021-03-24 13:33:01 +01:00
Andrea Mannocci e4f8fcab0e pickle in chunks 2021-03-24 13:29:06 +01:00
Andrea Mannocci a465bb027d fixing typo 2021-03-24 12:24:27 +01:00
Andrea Mannocci c49ae8fdfd Merge branch 'master' of https://code-repo.d4science.org/andrea.mannocci/fake-orcid-analysis 2021-03-24 12:21:03 +01:00
Andrea Mannocci 6bf69bb2ab small changes in the make data file 2021-03-24 12:21:00 +01:00
Miriam Baglioni 46aaf4655b merging with master 2021-03-24 12:17:01 +01:00
Miriam Baglioni 444fc36bf5 added encoding information while reading the csv file 2021-03-24 12:13:03 +01:00
Andrea Mannocci 5537705192 a few optimisations by creating variables so to make operations once before charting results 2021-03-24 12:06:27 +01:00
Andrea Mannocci b5e99701b1 adding preprocessing with make 2021-03-23 19:03:37 +01:00
Andrea Mannocci 4e30d743d8 adding req modules 2021-03-23 19:02:07 +01:00
Andrea Mannocci 41f2dab89d moving on with keywords 2021-03-23 12:13:04 +01:00
Andrea Mannocci a791a563c1 better top_n function 2021-03-23 10:20:43 +01:00
Andrea Mannocci 718c8f724c better top_n function 2021-03-23 10:20:23 +01:00
Andrea Mannocci 7c0af34f1b better top_n function 2021-03-23 09:47:47 +01:00
Andrea Mannocci 8a42ca76ca better top_n function 2021-03-23 09:35:35 +01:00
Andrea Mannocci aadf84b957 rewrite initial preprocessing 2021-03-22 22:40:12 +01:00
Andrea Mannocci 7891e57082 progress with analysis 2021-03-22 19:08:20 +01:00
Andrea Mannocci bed009581a typo 2021-03-19 12:19:45 +01:00
Andrea Mannocci 9c2a1bb846 first commit 2021-03-18 17:43:00 +01:00