2020/02/09

## virtualenv: pycas2020

Good Way to Learn Python!

But, Who Actually Reads These A to Z?

(spoiler: not me)

me and my programming books

me and my programming books

The Main Things to Know in a Web Scraping Project:

  • Is it worth the trouble?
  • Is it ethical?
  • Tools available in BeautifulSoup and requests
  • What to look for in html code
  • Parsing json objects with json
  • Rudimentary pandas skills
  • <pro-tip> All you need to know about html is how tags work </pro-tip>

What to Look for in a Scraping Project:

  • A sizeable amount of structured data with a regular repeatable format.
  • Identical formating is not required, but the more edge cases present, the more complicated the scraping will be.

Ethics in Scraping

Accessing vast troves of information can be intoxicating.

Just because it’s possible doesn’t mean it should be done

Legal Considerations

Dollar Stores are Taking Over the World!