Web Scraping with Python Training Course
Web scraping is a method used to extract data from websites and save it to local files or databases.
This instructor-led live training, available online or onsite, is designed for developers looking to use Python to automate the process of crawling multiple websites to extract data for processing and analysis.
Upon completing this training, participants will be able to:
- Install and configure Python along with all necessary packages.
- Retrieve and parse data hosted across various websites.
- Comprehend how websites function and the structure of their HTML.
- Develop spiders to crawl the web at scale.
- Utilize Selenium to crawl AJAX-driven web pages.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and practical sessions.
- Hands-on implementation within a live lab environment.
Customization Options
- Prerequisite programming knowledge is required.
- To request customized training for this course, please contact us to make arrangements.
Course Outline
Introduction
Setting up the Development Environment
Python Primer: Data Structures, Conditionals, File Handling, etc.
Python Packages for Web Scraping: Scrapy and BeautifulSoup
How a Website Works
How HTML is Structured
Making a Web Request
Scraping an HTML Page
Working with XPath and CSS
Filtering Data Using Regular Expressions
Creating a Web Crawler
Crawling AJAX and JavaScript Pages with Selenium.
Web Scraping Best Practices
Troubleshooting
Summary and Conclusion
Requirements
- Programming experience, preferably in Python. If participants have programming experience in a language other than Python, the training can be extended to include additional introductory Python exercises.
Audience
- Developers
Need help picking the right course?
Web Scraping with Python Training Course - Enquiry
Testimonials (1)
Many different examples and topics has been covered, from basic investigation to login management and dynamic page management.
Daniele Tagliaferro - Creditsafe Italia Srl
Course - Web Scraping with Python
Upcoming Courses
Related Courses
Advanced Python: Best Practices and Design Patterns
28 HoursThis intensive, hands-on course delves into advanced Python techniques, engineering best practices, and widely used design patterns to help you build maintainable, testable, and high-performance Python applications. It places a strong emphasis on modern tooling, typing, concurrency models, architectural patterns, and deployment-ready workflows.
Delivered as instructor-led live training (available online or onsite), this program targets intermediate to advanced Python developers who aim to adopt professional practices and patterns for building production-grade Python systems.
Upon completion of this training, participants will be equipped to:
- Enhance code reliability by applying Python typing, dataclasses, and type-checking.
- Structure robust applications using established design patterns and architectural principles.
- Correctly implement concurrency and parallelism through asyncio and multiprocessing.
- Develop well-tested code utilizing pytest, property-based testing, and CI pipelines.
- Profile, optimize, and harden Python applications for production environments.
- Package, distribute, and deploy Python projects using modern tools and containerization.
Course Format
- Interactive lectures and brief demonstrations.
- Hands-on labs and coding exercises conducted daily.
- A capstone mini-project that integrates patterns, testing, and deployment strategies.
Course Customization Options
- To request a customized training session or focus on a specific area such as data, web, or infrastructure, please contact us to arrange.
Agentic AI Engineering with Python — Build Autonomous Agents
21 HoursThis course imparts practical engineering methodologies for designing, building, testing, and deploying agentic (autonomous) systems using Python. It encompasses the agent loop, tool integrations, memory and state management, orchestration patterns, safety controls, and considerations for production environments.
Delivered as an instructor-led live training (available online or onsite), this program targets intermediate to advanced ML engineers, AI developers, and software engineers aiming to construct robust, production-ready autonomous agents utilizing Python.
Upon completion of this training, participants will be equipped to:
- Design and implement the agent loop and decision-making workflows.
- Integrate external tools and APIs to expand agent capabilities.
- Implement short-term and long-term memory architectures for agents.
- Coordinate multi-step orchestrations and agent composability.
- Apply safety, access control, and observability best practices for deployed agents.
Course Format
- Interactive lectures and discussions.
- Hands-on labs focused on building agents with Python and popular SDKs.
- Project-based exercises resulting in deployable prototypes.
Course Customization Options
- To request customized training for this course, please contact us to arrange.
Introduction to Data Science and AI using Python
35 HoursThis is a 5-day introductory course covering Data Science and Artificial Intelligence (AI).
The course is delivered with examples and exercises using Python
Artificial Intelligence with Python (Intermediate Level)
35 HoursArtificial Intelligence with Python focuses on creating intelligent systems leveraging Python’s comprehensive ecosystem of AI and machine learning libraries.
This instructor-led live training, available online or onsite, is designed for intermediate Python developers who aim to design, implement, and deploy AI solutions using Python.
By the conclusion of this training, participants will be able to:
- Implement AI algorithms using Python’s core AI libraries.
- Work with supervised, unsupervised, and reinforcement learning models.
- Integrate AI solutions into existing applications and workflows.
- Evaluate model performance and optimize for accuracy and efficiency.
Format of the Course
- Interactive lecture and discussion.
- Extensive exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Algorithmic Trading with Python and R
14 HoursThis instructor-led live training in the UAE (online or onsite) is designed for business analysts looking to automate trades using algorithmic trading, Python, and R.
Upon completion of this training, participants will be able to:
- Utilize algorithms to rapidly buy and sell securities at specialized increments.
- Lower trade-related costs by leveraging algorithmic trading.
- Automatically monitor stock prices and execute trades.
Applied AI from Scratch in Python
28 HoursThis four-day course provides an introduction to Artificial Intelligence and its practical applications using the Python programming language. Upon completion, participants have the option to dedicate an additional day to working on a hands-on AI project.
AWS Cloud9 and Python: A Practical Guide
14 HoursThis instructor-led live training in the UAE (online or on-site) is designed for intermediate-level Python developers seeking to enhance their development experience using AWS Cloud9.
By the end of this training, participants will be able to:
- Set up and configure AWS Cloud9 for Python development.
- Understand the AWS Cloud9 IDE interface and features.
- Write, debug, and deploy Python applications in AWS Cloud9.
- Collaborate with other developers using the AWS Cloud9 platform.
- Integrate AWS Cloud9 with other AWS services for advanced deployments.
Building Chatbots in Python
21 HoursChatbots are automated computer programs designed to simulate human interactions through chat interfaces. They assist organizations in enhancing operational efficiency by streamlining and accelerating user engagement.
This instructor-led live training will guide participants through the process of building chatbots using Python.
Upon completion of this training, participants will be able to:
- Grasp the core principles of chatbot development
- Construct, test, deploy, and resolve issues for various types of chatbots using Python
Target Audience
- Software Developers
Course Format
- A blend of lectures, discussions, exercises, and extensive hands-on practice
Note
- For customized training options, please reach out to us to arrange.
GPU Programming with CUDA and Python
14 HoursThis instructor-led, live training in the UAE (online or on-site) is targeted at intermediate-level developers who wish to use CUDA to build Python applications that run in parallel on NVIDIA GPUs.
By the end of this training, participants will be able to:
- Use the Numba compiler to accelerate Python applications running on NVIDIA GPUs.
- Create, compile and launch custom CUDA kernels.
- Manage GPU memory.
- Convert a CPU based application into a GPU-accelerated application.
Scaling Data Analysis with Python and Dask
14 HoursThis instructor-led, live training in the UAE (online or onsite) is tailored for data scientists and software engineers who wish to leverage Dask within the Python ecosystem to build, scale, and analyze large datasets.
By the end of this training, participants will be able to:
- Set up the environment to start building big data processing with Dask and Python.
- Explore the features, libraries, tools, and APIs available in Dask.
- Understand how Dask accelerates parallel computing in Python.
- Learn how to scale the Python ecosystem (Numpy, SciPy, and Pandas) using Dask.
- Optimize the Dask environment to maintain high performance in handling large datasets.
Data Analysis with Python, Pandas and Numpy
14 HoursThis instructor-led, live training in the UAE (online or onsite) is aimed at intermediate-level Python developers and data analysts who wish to enhance their skills in data analysis and manipulation using Pandas and NumPy.
By the end of this training, participants will be able to:
- Set up a development environment that includes Python, Pandas, and NumPy.
- Create a data analysis application using Pandas and NumPy.
- Perform advanced data wrangling, sorting, and filtering operations.
- Conduct aggregate operations and analyze time series data.
- Visualize data using Matplotlib and other visualization libraries.
- Debug and optimize their data analysis code.
FARM (FastAPI, React, and MongoDB) Full Stack Development
14 HoursThis instructor-led live training, offered online or onsite, targets developers who wish to leverage the FARM (FastAPI, React, and MongoDB) stack to build dynamic, high-performance, and scalable web applications.
By the end of this training, participants will be able to:
- Set up the necessary development environment that integrates FastAPI, React, and MongoDB.
- Understand the key concepts, features, and benefits of the FARM stack.
- Learn how to build REST APIs with FastAPI.
- Learn how to design interactive applications with React.
- Develop, test, and deploy applications (front end and back end) using the FARM stack.
Developing APIs with Python and FastAPI
14 HoursThis instructor-led live training in the UAE (online or onsite) is designed for developers who wish to use FastAPI with Python to build, test, and deploy RESTful APIs more efficiently and rapidly.
By the end of this training, participants will be able to:
- Configure the essential development environment for API development with Python and FastAPI.
- Accelerate and simplify API creation using the FastAPI library.
- Master the creation of data models and schemas based on Pydantic and OpenAPI.
- Integrate APIs with databases via SQLAlchemy.
- Apply security measures and authentication mechanisms in APIs using FastAPI utilities.
- Construct container images and deploy web APIs to cloud servers.
Fraud Detection with Python and TensorFlow
14 HoursThis instructor-led live training in the UAE (online or onsite) is tailored for data scientists aiming to apply TensorFlow to the analysis of potential fraud data.
By the end of this training, participants will be able to:
- Construct a fraud detection model using Python and TensorFlow.
- Build linear regression models to predict fraud.
- Develop an end-to-end AI application for analyzing fraud data.
Accelerating Python Pandas Workflows with Modin
14 HoursThis instructor-led, live training in the UAE (online or onsite) is designed for data scientists and developers who wish to use Modin to build and implement parallel computations with Pandas for faster data analysis.
By the end of this training, participants will be able to:
- Set up the necessary environment to start developing Pandas workflows at scale with Modin.
- Understand the features, architecture, and advantages of Modin.
- Know the differences between Modin, Dask, and Ray.
- Perform Pandas operations faster with Modin.
- Implement the entire Pandas API and functions.