- Thanks to Yu-Han Kao—adapted some of these materials from her!
Data scraping/web scraping
- Pulling data from the internet (web sites, social media, etc.)
- Involves: crawling/searching, extraction, parsing, reformatting
- Often two general approaches:
- Directly scraping (note possibly rude—your program/bot(s) will make requests from their server)
- Use an API!