Python: Introduction to Pandas

Objective

This section of the Python pre-work is intended to be an introduction to Pandas. Explore the "Pandas Resources" links below for more advanced options.

Introduction

What is Pandas?

Pandas is a Python library for data manipulation and analysis. In particular, it offers data structures and operations for manipulating numerical tables and time series.

Pandas provides fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python.

Pandas - the name

The name is derived from the term "Panel data", an econometrics term for multidimensional structured data sets.

What is a Pandas data frame?

DataFrame is a 2-dimensional labeled data structure with columns of potentially different types. You can think of it like a spreadsheet or SQL table, or a dict of Series objects. It is generally the most commonly used pandas object.

Pandas Resources

Chris Albon (Data Wrangling section)

Chris Albon (Missing Data)

Greg Reda (Intro to Pandas Data Structures)

Things in Pandas I Wish I Had Known Sooner

💥 Exercises 💥

Part 1: Review Pandas Example

Review the Jupyter Notebook 1_pandas_jeopardy_example.ipynb which uses the jeopardy.csv data.

Part 2: Try Pandas

Open the Jupyter Notebook 2_pandas_olive_questions.ipynb which uses the olive.csv data.
Make a copy of the notebook and name it 3_pandas_olive_answers_myname.ipynb. (Example: my notebook would be called 3_pandas_olive_answers_reshama.ipynb).
Update the header at the top by adding in your name and date.
Edit this notebook and complete the exercises.

Optional Pandas Topics: To Explore Further

groupby objects
applying functions
indexing
conditional selecting; filtering
selecting rows and columns: .loc, .iloc, .ix
working with missing data: Null, NaN, None
sorting
merge, join

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

05c-python_pandas.md

05c-python_pandas.md

Python: Introduction to Pandas

Objective

Introduction

What is Pandas?

Pandas - the name

What is a Pandas data frame?

Pandas Resources

Chris Albon (Data Wrangling section)

Greg Reda (Intro to Pandas Data Structures)

Things in Pandas I Wish I Had Known Sooner

💥 Exercises 💥

Part 1: Review Pandas Example

Part 2: Try Pandas

Optional Pandas Topics: To Explore Further

Files

05c-python_pandas.md

Latest commit

History

05c-python_pandas.md

File metadata and controls

Python: Introduction to Pandas

Objective

Introduction

What is Pandas?

Pandas - the name

What is a Pandas data frame?

Pandas Resources

Chris Albon (Data Wrangling section)

Greg Reda (Intro to Pandas Data Structures)

Things in Pandas I Wish I Had Known Sooner

💥 Exercises 💥

Part 1: Review Pandas Example

Part 2: Try Pandas

Optional Pandas Topics: To Explore Further