Web scraping: doing 20 real web scraping projects

Web scraping: doing 20 real web scraping projects

English | MP4 | AVC 1280×720 | AAC 48KHz 2ch | 4 Hours | 732 MB

Master web scraping python in just 2 hours, most practical python web scraping course

Most important data on the web do not available from API. Could you get the product data from Amazon with API ? Could you get housing data from Airbnb ?. Of course not. So the only way to collect these kind of data is using web scraping. Have data mean have power, you are having that power. Well come to my course on web scraping.

This course teach you step by step, from simple example to real life web scraping project.

Introduce with you what is web scraping and why we need web scraping. The big picture of web scraping or 4 steps to do web scraping. And then we will install all kind of tools which be used in next sections.

  • Step 1 in 4 steps to do web scrape, we do inspecting web page and locating data between HTML source.
  • Step 2 in 4 steps, we do download HTML content.
  • Step 3 in 4 steps, we do create Beautiful Soup object and search for tags contain Information.
  • Step 4 in 4 steps, we do scrape data from tags.
  • Finally we do 20 hand-on web scraping project to collect data from Amazon, Linkedin, Airbnb, Nba, Imbd.

What you will get from this course ?

  • Understand indeed how web scraping work.
  • Hand – on experience by doing 20 real life web scraping projects.
  • Hand – on experience working with Sublime Text, one very cool IDE to working with python
  • Love to do web scraping ?
Table of Contents

Web Scraping Introduction and installation
1 What is web scraping _
2 Big picture about web scraping (or steps to do web scraping)
3 Install python and other packages with anaconda
4 Install sublime text – IDE for python development
5 Install support packages for sublime text
6 Working with Sublime Text

Step _ Inspecting Web Page and Locating Data between HTML Source
7 Inspecting tool introduction
8 Example with inspecting NBA player page
9 Example with inspecting IMDB poster page

Step _ Download HTML Content
10 Download HTML use selenium with chrome driver
11 Download HTML use selenium in headless mode with phantomjs driver

Step _ Create Beautiful Soup Object and Search for Tags Contain Information
12 Introduction about Beautiful Soup and create soup object
13 Search for first tag by name
14 Search for all tags by name
15 Search with tag name and class attribute
16 Search with tag name and other attributes
17 Search with tag name and string inside tag
18 Search parent_ child_ sibling tag
19 Search scope in Beautiful Soup object

Step _ Scrape Data From Tags
20 Scrape for text data
21 Scrape for link (href attribute)
22 Scrape data inside table

Hand on with NBA projects
23 Project scrape name of all basketball player
24 Project get players name and detail link of each and put result to a list
25 Project Part get detail information for one player
26 Project Part get detail information for all player
27 Project part get image for one player
28 Project part get image for all players
29 Project Get list of all couch (exercise)

Hand on with IMDB projects
30 Project part scrape list of best films ever as text
31 Project part scrape list of best film ever with link to detail
32 Project part scrape poster get link contain full size poster
33 Project part scrape poster image for one film
34 Project part scrape poster image for each film in list film from project
35 Project scrape all poster (exercise)
36 Project and scrape list of current popular film and all poster (exercise)

Hand on with AIRBNB projects
37 Project part get list of room available in one page_ inspect strategy
38 Project part get list of room available on onepage
39 Project get list of room available on NY – allpage (exercise)

Hand on with AMAZON projects
40 Project part scrape python books list on one page search_ inspect strategy
41 Project part scrape python books list on one page search_ do coding
42 Project part scrape detail information for one book_ inspect strategy
43 Project part scrape detail information for one book_ coding
44 Project part scrape detail information for a book list_ coding
45 Project scrape most value comment (exercise )
46 Project scrape recently review for specific product (exercise)
47 Project get list name of best seller book (exercise )

Hand on with LINKEDIN projects
48 Project part get list of people with specific skill_ intro
49 Project part get list of people with specific skill_ login script
50 Project part get list of people with specific skill_ search script
51 Project part get list of people with specific skill_ scrape link
52 Project expand project in to page search (exercise)
53 Project get name and skill on one publish profile (exercise)