Competency: Research

Python Data Wrangling and Manipulation with Pandas

Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with ‘relational’ or ‘labeled’ data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we’ll work with example data and go through the various steps…

R Fundamentals: Part 1 of 4

This interactive workshop series is your complete introduction to programming in R for people with little or no previous programming experience. It covers the basics of using RStudio, creating variables, working with data frames, and starting to analyse your data using summary statistics and data visualization. Registration required.

Analysis and Visualization of Interactions in R

In regression, we are often interested in an interaction, which is the modification or moderation of the effect of an independent variable by another. Understanding interactions involves interpreting the regression coefficients, estimating and testing simple effects and their differences, and visualizing the interaction. This workshop will teach you how to…

LLM Working Group

The LLM Working Group is a community founded to facilitate conversations about Large Language Models (LLMs) and Generative AI within academia. This 4-part series will provide fundamental knowledge of LLMs, and generate conversation about the promises and challenges of LLMs in different facets of academic work. In our fourth session, Researching…

Bash + Git: Introduction

This workshop will start by introducing you to navigating your computer’s file system and basic Bash commands to remove the fear of working with the command line and to give you the confidence to use it to increase your productivity. And then working with Git, a powerful tool for keeping track…

Digital Humanities Working Group

Calling all digital humanities enthusiasts! End-of-the-year celebration! Lightning talk for our fifth DHWG meeting: “Tracking Microchanges: On the Evolution of the Novelistic Scene” – Nicholas Paige, Professor of French, presents an analytical examination of “scenes” in the novel from the early 1800s. If you are interested in…

Python Fundamentals: Part 3 of 3

This three-part interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application. The…

Python Fundamentals: Part 2 of 3

This three-part interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application. The…

Python Fundamentals: Part 1 of 3

This three-part interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application. The…

R Fundamentals: Part 4 of 4

This interactive workshop series is your complete introduction to programming in R for people with little or no previous programming experience. It covers the basics of using RStudio, creating variables, working with data frames, and starting to analyse your data using summary statistics and data visualization. After completing…

R Fundamentals: Part 3 of 4

This interactive workshop series is your complete introduction to programming in R for people with little or no previous programming experience. It covers the basics of using RStudio, creating variables, working with data frames, and starting to analyse your data using summary statistics and data visualization. After completing…

R Fundamentals: Part 2 of 4

This interactive workshop series is your complete introduction to programming in R for people with little or no previous programming experience. It covers the basics of using RStudio, creating variables, working with data frames, and starting to analyse your data using summary statistics and data visualization. After completing…

R Fundamentals: Part 1 of 4

This interactive workshop series is your complete introduction to programming in R for people with little or no previous programming experience. It covers the basics of using RStudio, creating variables, working with data frames, and starting to analyse your data using summary statistics and data visualization. After completing…

US Census Bureau Restricted-Access Research Data Center (FSRDC) Info Session

Interested in restricted Census or partnering RDC agency (AHRQ, BLS, BEA, NCHS) data use? This one-hour introductory workshop will provide an overview of the Berkeley Federal Statistical Research Data Center, with no prior experience assumed. Attendees will learn about the national RDC network, how to access information online about restricted…

GPT Fundamentals

This workshop offers a general introduction to the GPT (Generative Pretrained Transformers) model. We will explore how they reflect and shape our cultural narratives and social interactions, and which drawbacks and constraints they have. We will explore the transformer architecture upon which GPTs are built, how GPTs encode natural…

Python Machine Learning Fundamentals: Part 2 of 2

This workshop introduces students to scikit-learn, the popular machine learning library in Python, as well as the auto-ML library built on top of scikit-learn, TPOT. The focus will be on scikit-learn syntax and available tools to apply machine learning algorithms to datasets. No theory instruction will be provided.

Python Machine Learning Fundamentals: Part 1 of 2

This workshop introduces students to scikit-learn, the popular machine learning library in Python, as well as the auto-ML library built on top of scikit-learn, TPOT. The focus will be on scikit-learn syntax and available tools to apply machine learning algorithms to datasets. No theory instruction will be provided.

Survey Fundamentals

This two-hour workshop offers a comprehensive introduction to designing and conducting survey studies. Tailored for beginners, it provides clear, step-by-step guidance complemented by concise examples, practical considerations, and useful support materials. Participants will learn the entire process, from formulating a research question to creating, administering, and analyzing surveys, as well…

Python Text Analysis: Word Embeddings

How can we use neural networks to create meaningful representations of words? The bag-of-words is limited in its ability to characterize text, because it does not utilize word context. In this part, we study word embeddings, which were among the first attempts to use neural networks to develop numerical representations…

Python Text Analysis: Topic Modeling

Topic Modeling. How do we identify topics within a corpus of documents? In this part, we study unsupervised learning of text data. Specifically, we use topic models such as Latent Dirichlet Allocation and Non-negative Matrix Factorization to construct “topics” in text from the statistical regularities in the data.