Welcome to Reproducible Data Science

Welcome to the textbook Reproducible Data Science with Open-Source Python Tools and Real-World Data by Valentin Danchev. The textbook uses real-world social data sets related to the COVID-19 pandemic to provide an accessible introduction to open, reproducible, and ethical data analysis using hands-on Python coding, modern open-source computational tools, and data science techniques. Topics include open reproducible research workflows, data wrangling, exploratory data analysis, data visualisation, pattern discovery (e.g., clustering), prediction and machine learning, causal inference, and network analysis.

Contact

Valentin Danchev
Lecturer in Computational Social Science
Department of Sociology
University of Essex
valentin.danchev@essex.ac.uk
@valdanchev
@valdanchev

License

License: CC BY-SA 4.0

Reproducible Data Science with Open-Source Python Tools and Real-World Data by Valentin Danchev is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.