Web Scraping with Python Training Course
Web Scraping is a method used to extract data from websites and store it locally in a file or database.
This instructor-led, live training (available online or on-site) is designed for developers who want to leverage Python to automate the process of crawling multiple websites, extracting data for processing and analysis.
By the end of this training, participants will be able to:
- Install and configure Python along with all necessary packages.
- Retrieve and parse data stored across numerous websites.
- Understand how websites function and how their HTML is structured.
- Build spiders to crawl the web at scale.
- Use Selenium to crawl AJAX-driven web pages.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- This course assumes prior knowledge of programming.
- To request customized training for this course, please contact us to make arrangements.
Course Outline
Introduction
Setting up the Development Environment
Python Primer: Data Structures, Conditionals, File Handling, etc.
Python Packages for Web Scraping: Scrapy and BeautifulSoup
How a Website Works
How HTML is Structured
Making a Web Request
Scraping an HTML Page
Working with XPath and CSS
Filtering Data Using Regular Expressions
Creating a Web Crawler
Crawling AJAX and JavaScript Pages with Selenium.
Web Scraping Best Practices
Troubleshooting
Summary and Conclusion
Requirements
- Programming experience, preferably in Python. If participants have programming experience in a language other than Python, the training can be extended to include more introductory Python exercises.
Audience
- Developers
Open Training Courses require 5+ participants.
Web Scraping with Python Training Course - Booking
Web Scraping with Python Training Course - Enquiry
Web Scraping with Python - Consultancy Enquiry
Testimonials (1)
Many different examples and topics has been covered, from basic investigation to login management and dynamic page management.
Daniele Tagliaferro - Creditsafe Italia Srl
Course - Web Scraping with Python
Upcoming Courses
Related Courses
Advanced Python: Best Practices and Design Patterns
28 HoursThis intensive, hands-on course explores advanced Python techniques, engineering best practices, and widely adopted design patterns to help you build maintainable, testable, and high-performance Python applications. It focuses on modern tooling, type hints, concurrency models, architectural patterns, and deployment-ready workflows.
Delivered as an instructor-led, live training session (available online or onsite), this program is designed for intermediate to advanced Python developers who aim to adopt professional practices and patterns for production-grade Python systems.
Upon completion of this training, participants will be able to:
- Utilize Python typing, dataclasses, and type-checking to enhance code reliability.
- Apply design patterns and architectural principles to structure robust applications.
- Effectively implement concurrency and parallelism using asyncio and multiprocessing.
- Develop well-tested code using pytest, property-based testing, and CI pipelines.
- Profile, optimize, and secure Python applications for production environments.
- Package, distribute, and deploy Python projects using modern tools and containerization.
Course Format
- Interactive lectures accompanied by concise demonstrations.
- Practical labs and coding exercises each day.
- A capstone mini-project that integrates patterns, testing, and deployment.
Customization Options
- To request customized training or focus on specific areas (such as data, web, or infrastructure), please contact us to arrange.
Agentic AI Engineering with Python — Build Autonomous Agents
21 HoursThis course delivers practical engineering methodologies for designing, constructing, testing, and deploying agentic (autonomous) systems using Python. Key topics include the agent loop, tool integrations, memory and state management, orchestration patterns, safety controls, and production-grade considerations.
Delivered as an instructor-led, live training session (available online or onsite), this program targets intermediate to advanced ML engineers, AI developers, and software engineers looking to build robust, production-ready autonomous agents in Python.
Upon completing this training, participants will be equipped to:
- Design and implement agent loops and decision-making workflows.
- Integrate external tools and APIs to enhance agent capabilities.
- Implement memory architectures for both short-term and long-term state management.
- Coordinate multi-step orchestrations and ensure agent composability.
- Apply best practices for safety, access control, and observability in deployed agents.
Course Format
- Interactive lectures and discussions.
- Hands-on labs focused on building agents with Python and popular SDKs.
- Project-based exercises resulting in deployable prototypes.
Customization Options
- To arrange customized training for this course, please contact us.
Introduction to Data Science and AI using Python
35 HoursThis is a 5 day introduction to Data Science and Artificial Intelligence (AI).
The course is delivered with examples and exercises using Python
Artificial Intelligence with Python (Intermediate Level)
35 HoursArtificial Intelligence with Python focuses on building intelligent systems by leveraging Python’s comprehensive ecosystem of AI and machine learning libraries.
This instructor-led live training, available either online or onsite, is designed for intermediate-level Python programmers who aim to design, implement, and deploy AI solutions using Python.
Upon completing this training, participants will be equipped to:
- Develop AI algorithms utilizing Python’s primary AI libraries.
- Apply supervised, unsupervised, and reinforcement learning models.
- Integrate AI solutions seamlessly into existing applications and workflows.
- Assess model performance and optimize for both accuracy and efficiency.
Course Format
- Engaging lectures and interactive discussions.
- Extensive exercises and practical application.
- Hands-on implementation within a live laboratory environment.
Customization Options
- To arrange a customized training session for this course, please get in touch with us.
Algorithmic Trading with Python and R
14 HoursThis instructor-led, live training in Argentina (online or onsite) is designed for business analysts who wish to automate trading using algorithmic strategies, Python, and R.
By the end of this training, participants will be able to:
- Utilize algorithms to rapidly buy and sell securities at specialized increments.
- Lower costs associated with trading through the application of algorithmic methods.
- Automatically monitor stock prices and execute trades.
Applied AI from Scratch in Python
28 HoursThis comprehensive 4-day course provides an introduction to Artificial Intelligence and its practical applications using the Python programming language. Participants also have the option to extend their learning with an additional day dedicated to completing a real-world AI project upon finishing the course.
AWS Cloud9 and Python: A Practical Guide
14 HoursThis instructor-led, live training in Argentina (online or onsite) is aimed at intermediate-level Python developers who wish to enhance their Python development experience using AWS Cloud9.
By the end of this training, participants will be able to:
- Set up and configure AWS Cloud9 for Python development.
- Understand the AWS Cloud9 IDE interface and features.
- Write, debug, and deploy Python applications in AWS Cloud9.
- Collaborate with other developers using the AWS Cloud9 platform.
- Integrate AWS Cloud9 with other AWS services for advanced deployments.
Building Chatbots in Python
21 HoursChatbots are software applications designed to automatically mimic human responses through chat interfaces. They enable organizations to optimize operational efficiency by offering more streamlined and rapid user interaction options.
This instructor-led live training guides participants through the process of creating chatbots using Python.
Upon completion of this training, participants will be able to:
- Grasp the core principles of chatbot development
- Create, test, deploy, and troubleshoot diverse chatbot applications using Python
Audience
- Developers
Course Format
- A blend of lectures, discussions, exercises, and extensive hands-on practice
Note
- To arrange a customized training session for this course, please get in touch with us.
GPU Programming with CUDA and Python
14 HoursThis instructor-led, live training in Argentina (online or onsite) is aimed at intermediate-level developers who wish to use CUDA to build Python applications that run in parallel on NVIDIA GPUs.
By the end of this training, participants will be able to:
- Use the Numba compiler to accelerate Python applications running on NVIDIA GPUs.
- Create, compile and launch custom CUDA kernels.
- Manage GPU memory.
- Convert a CPU based application into a GPU-accelerated application.
Bespoke Applied Artificial Intelligence and LLM Engineering with Python
35 HoursCourse Overview
This practical training is tailored for data engineering professionals aiming to develop applied skills in artificial intelligence, Python, and large language models. The curriculum emphasizes real-world use cases, encompassing model utilization, prompt engineering, and the creation of AI-driven solutions. Participants will engage in a series of progressive exercises, advancing from foundational concepts to the development of deployable AI workflows.
Training Format
• Classroom-based, in-person sessions
• Instructor-led instruction with guided practice
• Interactive discussions and real-world case studies
• Daily hands-on exercises
Course Objectives
• Grasp core AI and machine learning principles pertinent to contemporary applications
• Enhance Python proficiency for AI development and data workflows
• Comprehend the mechanics of large language models and their effective application
• Craft and optimize prompts to ensure reliable outputs
• Construct complete AI solutions utilizing APIs and frameworks
• Incorporate AI capabilities into data engineering pipelines
Scaling Data Analysis with Python and Dask
14 HoursThis instructor-led live training in Argentina (available online or onsite) targets data scientists and software engineers who intend to utilize Dask alongside the Python ecosystem to construct, scale, and analyze large datasets.
By the conclusion of this training, participants will be able to:
- Configure the environment necessary to begin developing big data solutions using Dask and Python.
- Investigate the features, libraries, tools, and APIs provided by Dask.
- Comprehend how Dask enhances parallel computing capabilities in Python.
- Acquire the skills to scale the Python ecosystem, including NumPy, SciPy, and Pandas, using Dask.
- Optimize the Dask environment to ensure high performance when managing large datasets.
Data Analysis with Python, Pandas and Numpy
14 HoursThis instructor-led, live training in Argentina (online or onsite) is aimed at intermediate-level Python developers and data analysts who wish to enhance their skills in data analysis and manipulation using Pandas and NumPy.
By the end of this training, participants will be able to:
- Set up a development environment that includes Python, Pandas, and NumPy.
- Create a data analysis application using Pandas and NumPy.
- Perform advanced data wrangling, sorting, and filtering operations.
- Conduct aggregate operations and analyze time series data.
- Visualize data using Matplotlib and other visualization libraries.
- Debug and optimize their data analysis code.
FARM (FastAPI, React, and MongoDB) Full Stack Development
14 HoursThis instructor-led live training, offered online or onsite, targets developers who want to employ the FARM (FastAPI, React, and MongoDB) stack to construct dynamic, high-performance, and scalable web applications.
By the end of this course, participants will be able to:
- Set up the required development environment integrating FastAPI, React, and MongoDB.
- Understand the fundamental concepts, features, and benefits of the FARM stack.
- Learn how to build REST APIs with FastAPI.
- Learn how to design interactive applications with React.
- Develop, test, and deploy applications (front end and back end) using the FARM stack.
Developing APIs with Python and FastAPI
14 HoursThis instructor-led live training in Argentina (online or onsite) is designed for developers who want to use Python with FastAPI to build, test, and deploy RESTful APIs more easily and quickly.
By the end of this training, participants will be able to:
- Set up the necessary development environment to develop APIs with Python and FastAPI.
- Create APIs quicker and easier using the FastAPI library.
- Learn how to create data models and schemas based on Pydantic and OpenAPI.
- Connect APIs to a database using SQLAlchemy.
- Implement security and authentication in APIs using the FastAPI tools.
- Build container images and deploy web APIs to a cloud server.
Accelerating Python Pandas Workflows with Modin
14 HoursThis instructor-led, live training in Argentina (available online or onsite) is designed for data scientists and developers who wish to use Modin to build and implement parallel computations with Pandas for faster data analysis.
Upon completing this training, participants will be able to:
- Configure the necessary environment to begin developing Pandas workflows at scale using Modin.
- Gain a clear understanding of Modin’s features, architecture, and advantages.
- Identify the key differences between Modin, Dask, and Ray.
- Execute Pandas operations more efficiently with the help of Modin.
- Implement the full range of Pandas API functions.