Course Outline

Introduction

Setting up the Development Environment

Python Primer: Data Structures, Conditionals, File Handling, etc.

Python Packages for Web Scraping: Scrapy and BeautifulSoup

How a Website Works

How HTML is Structured

Making a Web Request

Scraping an HTML Page

Working with XPath and CSS

Filtering Data Using Regular Expressions

Creating a Web Crawler

Crawling AJAX and JavaScript Pages with Selenium.

Web Scraping Best Practices

Troubleshooting

Summary and Conclusion

Requirements

  • Programming experience, preferably in Python. If participants have programming experience in a language other than Python, the training can be extended to include more introductory Python exercises.

Audience

  • Developers
  7 Hours
 

Testimonials

Related Courses

Scaling Data Analysis with Python and Dask

  14 hours

Data Analysis with Python, Pandas, and Numpy

  14 hours

Accelerating Python Pandas Workflows with Modin

  14 hours

Machine Learning with Python and Pandas

  14 hours

FARM (FastAPI, React, and MongoDB) Full Stack Development

  14 hours

Developing APIs with Python and FastAPI

  14 hours

Web application development with Flask

  14 hours

Advanced Flask

  14 hours

Build REST APIs with Python and Flask

  14 hours

Kivy: Building Android Apps with Python

  7 hours

Game Development with PyGame

  7 hours

GUI Programming with Python and PyQt

  21 hours

Scientific Computing with Python SciPy

  7 hours

GUI Programming with Python and Tkinter

  14 hours

Web Development with Web2Py

  28 hours