Outline

In this section (and the following one), we will start with Pandas – the main and the most popular library for data manipulation in Python. This is a very important library and you will be working with this almost every day from now on.

The truth is, almost everything that has something to do with data and data science, is done using Pandas in Python. We will learn about two most important Pandas data types: Series and DataFrame, and how they represent our data.

We will cover the following topics:

  • Intro to Pandas and Pandas data types
  • Viewing and filtering of data
  • Merge and GroupBy of data
  • Reshaping
  • Apply Functions


We will be using parts of tutorials and examples from official documentation. We changed order of those to have a easier learning flow.