September 25, 2023 @ 10:00 AM - 12:00 PM R Data Wrangling and Manipulation: Parts 1-2 Online via Zoom It is said that 80% of data analysis is spent on the process of cleaning and preparing the data for exploration, visualization, and analysis. This R workshop will introduce the dplyr and tidyr packages to make data wrangling and manipulation easier. Participants will learn how to use these packages to subset and reshape data sets, do calculations across groups of data, clean data, and other useful tasks. Prerequisites: D-Lab’s R Fundamentals or equivalent knowledge; previous experience with base R is assumed. Workshop Materials: https://github.com/dlab-berkeley/R-wrang Software Requirements:Installation Instructions for R and RStudio
September 25, 2023 @ 2:00 PM - 5:00 PM Python Text Analysis Fundamentals: Part 1 Online via Zoom This two-part workshop series will prepare participants to move forward with research that uses text analysis, with a special focus on humanities and social science applications. Part 1: Preprocessing Text. How do we standardize and clean text documents? Text data is noisy, and we often need to develop a pipeline in order to standardize the data, to better facilitate computational modeling. In the first part of this workshop, we walk through possible steps in this pipeline using tools from basic Python, NLTK, and spaCy in order to preprocess and tokenize our data. Part 2: Bag-of-words Representations How do we convert text into a representation that we can operate on computationally? This requires developing a numerical representation of the text. In this part of the workshop, we study one of the foundational numerical representation of text data: the bag-of-words model. This model relies heavily on word frequencies in order to characterize text corpora. We build bag-of-words models, and their variations (e.g., TF-IDF), and use these representations to perform classification on text. To continue with Text Analysis sign up for Topic Modeling or Word Embeddings. Part 3: Topic Modeling. How do we identify topics within a corpus of documents? In this part, we study unsupervised learning of text data. Specifically, we use topic models such as Latent Dirichlet Allocation and Non-negative Matrix Factorization to construct “topics” in text from the statistical regularities in the data. Part 4: Word Embeddings How can we use neural networks to create meaningful representations of words? The bag-of-words is limited in its ability to characterize text, because it does not utilize word context. In this part, we study word embeddings, which were among the first attempts to use neural networks to develop numerical representations of text that incorporate context. We learn how to use the package gensim to construct and explore word embeddings of text. The first two parts are taught as a joint series. Parts 3 and 4 can be attended "a la carte"; however, prior knowledge of Parts 1 and 2 is assumed. Prerequisites: D-Lab’s Python Fundamentals introductory series or equivalent knowledge. Workshop Materials: https://github.com/dlab-berkeley/Python-Text-Analysis Software Requirements:Installation Instructions for Python Anaconda
September 25, 2023 @ 4:00 PM - 5:00 PM UndocuGrads: Check-in Group Meetings The Inclusive Excellence Hub 2515 Channing Way, Berkeley Join GradPros UndocuGrads check in group! This group serves as a dedicated and identity safe space where undocumented graduate students meet regularly during the semester with a group of other grad students. GradPro facilitators guide the check-ins, fostering an informal yet supportive community. Each member describes the progress they have made since the previous meet up, as well as any challenges they've encountered. The group then offers support and advice. Check-ins are coordinated by GradPro of the Graduate Division in partnership with OGD!
September 25, 2023 @ 4:00 PM - 6:00 PM UndocuGrads Celebrates Hispanic Heritage Month The Inclusive Excellence Hub 2515 Channing Way, Berkeley We invite undocumented graduate and undergraduate students on campus to get to know each other and network, to celebrate the intersection of being undocumented and Hispanic/Latinx through a short history presentation. RSVP here! We also envision to provide you with a helpful mentorship environment for all, and learn from community members what they want to get out of UndocuGrads this semester and in how we can support each other.
September 25, 2023 @ 4:00 PM - 6:00 PM UndocuGrads Celebrates Hispanic/Latinx Heritage Month The Inclusive Excellence Hub 2515 Channing Way, Berkeley We invite undocumented graduate and undergraduate students on campus to get to know each other and network, to celebrate the intersection of being undocumented and Hispanic/Latinx through a short history presentation. RSVP here! We also envision to provide you with a helpful mentorship environment for all, and learn from community members what they want to get out of UndocuGrads this semester and in how we can support each other. All are welcome and we hope you are able to join us!