Welcome to Reproducible Data Science

Welcome to the textbook Reproducible Data Science: Accessible Data Analysis with Open Source Python Tools and Real-World Data by Valentin Danchev. The textbook uses real-world social data sets related to the COVID-19 pandemic to provide an accessible introduction to open, reproducible, and ethical data analysis using hands-on Python coding, modern open-source computational tools, and data science techniques. Topics include open reproducible research workflows, data wrangling, exploratory data analysis, data visualisation, pattern discovery (e.g., clustering), prediction & machine learning, causal inference, and network analysis.


Valentin Danchev
Lecturer in Computational Social Science
Department of Sociology
University of Essex


Reproducible Data Science with Python on the Cloud by Valentin Danchev is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.