Web Scraping with Python Training Course
Web scraping is a method used to extract data from websites and store it in local files or databases.
This instructor-led training (delivered online or on-site) is designed for developers looking to automate the process of crawling multiple websites using Python, to gather data for further processing and analysis.
By the end of this course, participants will be able to:
- Set up Python along with all necessary packages.
- Extract and interpret data from various websites.
- Grasp how websites function and their HTML structure.
- Create web crawlers capable of large-scale operations.
- Leverage Selenium for scraping AJAX-driven web pages.
Course Format
- Engaging lectures and discussions.
- Extensive exercises and practice sessions.
- Practical implementation in a live-lab setting.
Customization Options for the Course
- This course presumes prior programming knowledge.
- To request tailored training, please contact us to make arrangements.
Course Outline
Introduction
Setting up the Development Environment
Python Primer: Data Structures, Conditionals, File Handling, etc.
Python Packages for Web Scraping: Scrapy and BeautifulSoup
How a Website Works
How HTML is Structured
Making a Web Request
Scraping an HTML Page
Working with XPath and CSS
Filtering Data Using Regular Expressions
Creating a Web Crawler
Crawling AJAX and JavaScript Pages with Selenium.
Web Scraping Best Practices
Troubleshooting
Summary and Conclusion
Requirements
- Programming experience, preferably in Python. If participants have programming experience in a language other than Python, the training can be extended to include more introductory Python exercises.
Audience
- Developers
Need help picking the right course?
Web Scraping with Python Training Course - Enquiry
Testimonials (1)
Many different examples and topics has been covered, from basic investigation to login management and dynamic page management.
Daniele Tagliaferro - Creditsafe Italia Srl
Course - Web Scraping with Python
Upcoming Courses
Related Courses
Scaling Data Analysis with Python and Dask
14 HoursThis instructor-led, live training in the UAE (online or onsite) is aimed at data scientists and software engineers who wish to use Dask with the Python ecosystem to build, scale, and analyze large datasets.
By the end of this training, participants will be able to:
- Set up the environment to start building big data processing with Dask and Python.
- Explore the features, libraries, tools, and APIs available in Dask.
- Understand how Dask accelerates parallel computing in Python.
- Learn how to scale the Python ecosystem (Numpy, SciPy, and Pandas) using Dask.
- Optimize the Dask environment to maintain high performance in handling large datasets.
Data Analysis with Python, Pandas and Numpy
14 HoursThis instructor-led, live training in the UAE (online or onsite) is aimed at intermediate-level Python developers and data analysts who wish to enhance their skills in data analysis and manipulation using Pandas and NumPy.
By the end of this training, participants will be able to:
- Set up a development environment that includes Python, Pandas, and NumPy.
- Create a data analysis application using Pandas and NumPy.
- Perform advanced data wrangling, sorting, and filtering operations.
- Conduct aggregate operations and analyze time series data.
- Visualize data using Matplotlib and other visualization libraries.
- Debug and optimize their data analysis code.
Web Development with Django
21 HoursDjango is a sophisticated Python web framework designed to promote quick development and maintain a clear, practical structure.
Target Audience
This course is tailored for developers and engineers aiming to integrate Django into their projects.
Monax: Build a Smart Contract Application
7 HoursIn this instructor-led, live training in the UAE, participants will learn how to build a smart contract blockchain application with Monax.
By the end of this training, participants will be able to:
- Develop and deploy a distributed application using blockchain and smart contract technology.
- Understand design and functionality of 'smart contracts' and how to create one.
- Implement best practices for secure blockchain application development.
- Leverage Monax tools to streamline distributed application development.
FARM (FastAPI, React, and MongoDB) Full Stack Development
14 HoursThis instructor-led, live training in (online or onsite) is aimed at developers who wish to use the FARM (FastAPI, React, and MongoDB) stack to build dynamic, high-performance, and scalable web applications.
By the end of this training, participants will be able to:
- Set up the necessary development environment that integrates FastAPI, React, and MongoDB.
- Understand the key concepts, features, and benefits of the FARM stack.
- Learn how to build REST APIs with FastAPI.
- Learn how to design interactive applications with React.
- Develop, test, and deploy applications (front end and back end) using the FARM stack.
Developing APIs with Python and FastAPI
14 HoursThis instructor-led, live training in the UAE (online or onsite) is aimed at developers who wish to use FastAPI with Python to build, test, and deploy RESTful APIs easier and faster.
By the end of this training, participants will be able to:
- Set up the necessary development environment to develop APIs with Python and FastAPI.
- Create APIs quicker and easier using the FastAPI library.
- Learn how to create data models and schemas based on Pydantic and OpenAPI.
- Connect APIs to a database using SQLAlchemy.
- Implement security and authentication in APIs using the FastAPI tools.
- Build container images and deploy web APIs to a cloud server.
Web Application Development with Flask
14 HoursThis hands-on course is designed for Python developers looking to develop and manage their initial web applications. It also caters to individuals who are already acquainted with other web frameworks like Django or Web2py, and wish to understand how employing a microframework—one that integrates third-party libraries rather than offering an all-encompassing solution—affects the development process.
A considerable portion of the course focuses not on Flask itself (given its minimalist nature), but on the various third-party libraries and tools frequently utilized in Flask projects.
Advanced Flask
14 HoursThis instructor-led, live training in the UAE (online or onsite) is aimed at developers who wish to use the advanced features of Flask to build scalable web applications on top of MongoDB.
By the end of this training, participants will be able to:
- Set up the necessary development environment to start developing web applications with Flask.
- Get to know the advanced concepts and techniques for real-world Flask projects.
- Build a RESTful API server on top of MongoDB.
- Learn how to containerize, test, and deploy microservices with Flask, Docker, and Amazon EC2.
- Gain some insights on the advanced Flask integrations for scaling web applications.
Kivy: Building Android Apps with Python
7 HoursKivy is an open-source, cross-platform graphical user interface library developed in Python that facilitates multi-touch application development across a variety of devices.
During this instructor-led live training session, participants will learn how to install and deploy Kivy on various platforms, customize and manipulate widgets, schedule and respond to events, modify graphics with multi-touch capabilities, resize screens, package applications for Android, and more.
By the end of this training, participants will be able to
- Connect Python code with the Kivy language.
- Demonstrate a thorough understanding of how Kivy operates and utilize its key components such as widgets, events, properties, graphics, etc.
- Effortlessly develop and deploy Android applications tailored to different business and design needs.
Course format
- A combination of lectures, discussions, exercises, and extensive hands-on practice
Accelerating Python Pandas Workflows with Modin
14 HoursThis instructor-led, live training in the UAE (online or onsite) is aimed at data scientists and developers who wish to use Modin to build and implement parallel computations with Pandas for faster data analysis.
By the end of this training, participants will be able to:
- Set up the necessary environment to start developing Pandas workflows at scale with Modin.
- Understand the features, architecture, and advantages of Modin.
- Know the differences between Modin, Dask, and Ray.
- Perform Pandas operations faster with Modin.
- Implement the entire Pandas API and functions.
Game Development with PyGame
7 HoursThis instructor-led, live training in the UAE (online or onsite) is aimed at developers who wish to use PyGame to create and build games using Python programming.
By the end of this training, participants will be able to:
- Set up the necessary development environment to start creating game applications with PyGame and Python.
- Learn how to create interactive PyGame applications integrated with animations and multimedia features.
- Run and test game programs with PyGame test suite and convert them into executable files.
GUI Programming with Python and PyQt
21 HoursThis instructor-led, live training in the UAE (online or onsite) is aimed at persons who wish to program a visually attractive software application using Python and the Qt UI framework.
By the end of this training, participants will be able to:
- Set up a development environment that includes all needed libraries, packages and frameworks.
- Create a desktop or server application whose user interface functions smoothly and is visually appealing.
- Implement various UI elements and effects, including widgets, charts, layers, etc. to achieve maximum effect in usability.
- Implement good UI design and code organization during the design and development phase.
- Test and debug the application.
Build REST APIs with Python and Flask
14 HoursThis instructor-led, live training in the UAE (online or onsite) is aimed at backend developers who wish to build REST APIs with Python and Flask.
By the end of this training, participants will be able to:
- Implement a REST API to allow a Flask web application to read and write to a database in the backend.
- Develop advanced authentication features like refresh tokens.
- Build a reusable backend for future Python projects.
- Simplify storage of data with SQLAlchemy.
- Deploy REST APIs onto a cloud based server.
Scientific Computing with Python SciPy
7 HoursThis instructor-led, live training in the UAE (online or onsite) is aimed at developers who wish to use SciPy to create advanced scientific computing functions with Python.
By the end of this training, participants will be able to:
- Set up the necessary development environment to start creating scientific computing functions.
- Get the full benefit of SciPy features by performing practical examples of complex operations.
- Implement and optimize mathematical algorithms and functions to solve scientific problems.
- Design data structures and interpolation methods for visualization, processing, and analysis.
Web Development with Web2Py
28 HoursWeb2py is a free, open-source full-stack framework built on Python that facilitates the quick creation of fast, scalable, secure, and portable database-driven web applications.
Audience
This course is designed for Engineers and Developers who utilize web2py as their framework for web development.