
- This event has passed.
Event Information
In this workshop, we cover how to scrape data from the web using Python. Web scraping involves downloading a webpage’s source code and sifting through the material to extract desired data.
Web scraping is typically only done when Web APIs are not available. Platforms like Twitter, Reddit, or The New York Times offer APIs to retrieve data. If you want to learn how to use web APIs in Python, see D-Lab’s Python Web APIs workshop.
Requirements: We will assume a basic knowledge of Python. If you’ve taken the D-Lab’s Python Intensive, that should be sufficient.
Prerequisites: D-Lab’s Python Fundamentals introductory series or equivalent knowledge.
GitHub Repository: https://github.com/dlab-berkeley/Python-Web-Scraping
Software Requirements:Installation Instructions for Python Anaconda