
- This event has passed.
Event Information
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with ‘relational’ or ‘labeled’ data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we’ll work with example data and go through the various steps you might need to prepare data for analysis.
We will cover:
- Pandas data structures
- Loading data
- Subsetting and filtering
- Calculating summary statistics
- Dealing with missing values
- Merging data sets
- Creating new variables
- Basic plotting
- Exporting data
Prerequisites: D-Lab’s Python Fundamentals introductory series or equivalent knowledge.
GitHub Repository: https://github.com/dlab-berkeley/Python-Data-Wrangling
Software Requirements:Installation Instructions for Python Anaconda
Register: Log in via CalNet