Skip to content

richardalamo/web-scraping-tutorial

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 

Repository files navigation

Web Scraping with Python: A Learning Guide

This repository provides educational material for learning web scraping using Python. It starts with the basics using requests and BeautifulSoup and then progresses to more advanced techniques with Scrapy.

Table of Contents

  1. Introduction to Web Scraping
  2. Basic Web Scraping with requests and BeautifulSoup
  3. Setting Up the Environment
  4. Introduction to Scrapy
  5. Advanced Scrapy Techniques
  6. Handling JavaScript-Heavy Sites with Selenium
  7. Responsible Web Scraping: Best Practices and Ethics
  8. Challenges and Solutions in Web Scraping
  9. Practical Projects for Skill Application
  10. Handling Data Post-Scraping
  11. Sample Project: Scraping Patek.com
  12. Sample Project: Scraping Blancpain.com

Prerequisites

  • Basic knowledge of Python
  • Python environment set up on your local machine

Sample Projects

Practical application enhances learning. That's why we've provided sample projects that offer a hands-on approach to utilizing the concepts and techniques discussed in this tutorial. These projects demonstrate the use of popular Python libraries for web scraping: BeautifulSoup, Scrapy, and Selenium.

This project showcases the power and flexibility of Scrapy, a comprehensive web scraping framework in Python. We've targeted Patek.com for this task, extracting detailed information about various watch models. You'll find everything you need to understand, run, and learn from this real-world example in the project's directory, including the script, sample output data, and a detailed walkthrough of the code.

Our second project takes a different approach, using BeautifulSoup and Selenium to scrape data from Blancpain's website. While BeautifulSoup is perfect for simpler scraping tasks, combining it with Selenium allows handling JavaScript-heavy websites. This project will walk you through a practical example of how to use BeautifulSoup and Selenium for web scraping. The project's folder contains the script, sample data, and an extensive guide to understanding each step of the process.

About

web scraping tutorial

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.7%
  • Shell 0.3%