DG · Portfolio
Study Projects

Book Scraper

Web scraping and ETL pipeline

BeautifulSoup CLI Python Requests

About the project

Book Scraper is a Python program that builds an ETL pipeline to gather book information from the Books to Scrape website. It can extract data for every book in a category and optionally download every cover image.

Main features

Scrapes every book category

Extracts data for all books in each category

Optionally downloads cover images

Automatically navigates across pages (pagination)

Exports data to CSV per category (output_data/ folder)

Challenges encountered

First programming project

Learnings

Web scraping, ETL pipeline, BeautifulSoup, Requests, CLI

Technical stack technique

B

BeautifulSoup

Tool

C

CLI

Tool

P

Python

Language

R

Requests

Tool

Is this project interesting for you ?

Let's discuss how I can help you with your next project