Tutorials and Books#
This is a guide to many pandas tutorials and books by the community, geared mainly for new users.
Pandas Workout by Reuven Lerner#
https://www.manning.com/books/pandas-workout
This book provides 200 exercises with pandas. From the description:
Pandas Workout hones your pandas skills to a professional-level through two hundred exercises, each designed to strengthen your pandas skills. You’ll test your abilities against common pandas challenges such as importing and exporting, data cleaning, visualization, and performance optimization. Each exercise utilizes a real-world scenario based on real-world data, from tracking the parking tickets in New York City, to working out which country makes the best wines. You’ll soon find your pandas skills becoming second nature—no more trips to StackOverflow for what is now a natural part of your skillset.
Bamboo Weekly by Reuven Lerner#
Bamboo Weekly is a mailing list that provides weekly, hands on data analysis exercises to practice using pandas.
pandas cookbook by Julia Evans#
The goal of this 2015 cookbook (by Julia Evans) is to give you some concrete examples for getting started with pandas. These are examples with real-world data, and all the bugs and weirdness that entails. For the table of contents, see the pandas-cookbook GitHub repository.
Learn pandas by Hernan Rojas#
A set of lesson for new pandas users: https://bitbucket.org/hrojas/learn-pandas
Practical data analysis with Python#
This guide is an introduction to the data analysis process using the Python data ecosystem and an interesting open dataset. There are four sections covering selected topics as munging data, aggregating data, visualizing data and time series.
Exercises for new users#
Practice your skills with real data sets and exercises. For more resources, please visit the main repository.
Modern pandas#
Tutorial series written in 2016 by Tom Augspurger. The source may be found in the GitHub repository TomAugspurger/effective-pandas.
Joyful pandas#
A tutorial written in Chinese by Yuanhao Geng. It covers the basic operations for NumPy and pandas, 4 main data manipulation methods (including indexing, groupby, reshaping and concatenation) and 4 main data types (including missing data, string data, categorical data and time series data). At the end of each chapter, corresponding exercises are posted. All the datasets and related materials can be found in the GitHub repository datawhalechina/joyful-pandas.
Video tutorials#
Pandas: .head() to .tail() (2016) (1:26) GitHub repo
Data analysis in Python with pandas (2016-2018) GitHub repo and Jupyter Notebook
Best practices with pandas (2018) GitHub repo and Jupyter Notebook