February 10 @ 10:00 AM - 12:00 PM R Data Wrangling and Manipulation: Part 1 of 2 It is said that 80% of data analysis is spent on the process of cleaning and preparing the data for exploration, visualization, and analysis. This R workshop will introduce the dplyr and tidyr packages to make data wrangling and manipulation easier. Participants will learn how to use these packages to subset and reshape data sets, do calculations across groups of data, clean data, and other useful tasks.
February 10 @ 2:00 PM - 3:30 PM Python Data Wrangling and Manipulation with Pandas: Part 1 of 2 Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis. We will cover: Pandas data structures Loading data Subsetting and filtering Calculating summary statistics Dealing with missing values Merging data sets Creating new variables Basic plotting Exporting data
February 10 @ 2:00 PM - 5:00 PM Excel Data Analysis: Charts, Pivot Tables, and VLOOKUP This three-hour workshop will cover charts in more detail, review pivot tables, and the widely-used VLOOKUP function. We recommend first taking the introductory workshop Excel Data Analysis: Introduction.