Skip to main content
  1. /classes/
  2. Classes, Spring 2024/
  3. CS 2370 Spring 2024: Course Site/

cs2370 Notes: 25 Web Scraping

·44 words·1 min·
import requests

resp = requests.get("https://homework.quest/")
resp.status_code
resp.raise_for_status()
resp.text

Scraping Wikipedia:

  • Trying to use regex
  • main
import requests
import bs4

resp = requests.get("https://homework.quest/")
resp.status_code
resp.raise_for_status()
resp.text
tree = bs4.BeautifulSoup(resp.text, 'html.parser')
xs = tree.select('a')
for x in xs: print("[", x, "]")
Nat Tuck
Author
Nat Tuck