Web scraping is a technique for extracting data from web pages using code, particularly when the data is not available to download or from an API. This hands-on intermediate Python workshop will guide participants through the use of the Python requests-html library for working with content on websites. Presenters will also cover ethical considerations, recommended practices, and other approaches for web scraping using Python. No Python installation is required: Google Colab notebooks will be used for coding during the workshop.
Attendees will understand appropriate and productive technique in web scraping situations; be able to use the requests-html Python library for basic web scraping of a static site; and become aware of other Python libraries for web scraping and when they can be useful.
This workshop is part of the Using Programming and Code for Research workshop series for for anyone who wants to get started or learn more about use programming languages like Python, R, or other applications. These tools can help you to collect, manipulate, clean, analyze, and visualize research data or automate many repetitive tasks. If you need personalized assistance with a data analysis, programming, or coding project, consider booking a consultation with one of our librarian-experts. Learn more about our services for programming and coding and for working with data.
All sessions are free to GW students, faculty, staff, and alumni. GW has an institutional commitment to ensuring that all of our programs and events are accessible for all individuals. If you require any accommodations to participate in this event, please contact email@example.com at least 72 business hours (3 business days) prior to the event.