🕸️ Python Web Scraper with CSV/Excel Export

A flexible Python web scraper that lets you:

Input any URL at runtime
Preview available HTML tags and select which elements to scrape
Export scraped data to CSV or Excel
Choose tags dynamically (no hardcoded tag list)
Confirm before final scraping
Handles invalid input gracefully
Displays saved file location at the end

🚨 Important Note

This scraper does not currently support JavaScript-rendered pages.
Support for JS-rendered pages (via Selenium or Playwright) is planned for a future release.

🚀 Features

✔ Dynamic Tag Detection – Pre-scrapes the page and lists all available HTML tags
✔ User-Controlled Scraping – Select which tags you want to scrape
✔ Multiple Export Options – Save as CSV or Excel
✔ Error Handling – Handles invalid choices without crashing
✔ Clear Exit Options – Press q anytime to quit
✔ File Path Confirmation – Confirms where your files were saved

🛠️ Requirements

Python 3.8+
The following Python libraries (see requirements.txt):
- requests
- beautifulsoup4
- pandas

📥 Installation

Clone the repository:

git clone https://github.com/YOUR_USERNAME/python-web-scraper.git
cd python-web-scraper

Install dependencies:

pip install -r requirements.txt

▶️ Usage

Run the scraper:

python scraper.py

Enter the URL you want to scrape
The script previews all HTML tags found
Select tags to scrape (e.g., p, h1, h2)
Choose CSV or Excel output
Confirm and scrape
Files are saved in the current folder, and the path is displayed at the end

🖥️ Future Enhancements

✅ Add support for JavaScript-rendered pages
✅ Add a Streamlit Web Interface for easy use
✅ Deploy on Streamlit Cloud so anyone can try it online
✅ Support search by CSS selectors or attributes

🤝 Contributing

Pull requests are welcome! For major changes, please open an issue first to discuss what you’d like to change.

📜 License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Python		Python
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
webscraper7sl.py		webscraper7sl.py
webscraperv07.py		webscraperv07.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🕸️ Python Web Scraper with CSV/Excel Export

🚨 Important Note

🚀 Features

🛠️ Requirements

📥 Installation

▶️ Usage

🖥️ Future Enhancements

🤝 Contributing

📜 License

About

Uh oh!

Releases

Packages

Languages

zegron/webscraper

Folders and files

Latest commit

History

Repository files navigation

🕸️ Python Web Scraper with CSV/Excel Export

🚨 Important Note

🚀 Features

🛠️ Requirements

📥 Installation

▶️ Usage

🖥️ Future Enhancements

🤝 Contributing

📜 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages