Web Data Science, Fall 2024 The internet makes many kinds of information easy to access. The ability to retrieve, parse, and analyze this information is a valuable skill for data scientists. This course will provide an overview of computational tools and practices for transforming web documents and APIs into data for common research designs.
- Understand the legal and ethical contours of web data access
- Navigate and parse common web data formats like XML and JSON for data
- Retrieve and automate data extraction from HTML and PDF documents
- Access popular APIs to collect data for common research designs