A collection of web scraping and data extraction projects built with Python and various scraping libraries.
- Selenium β Browser automation and dynamic content scraping
- SeleniumBase β Enhanced Selenium with anti-bot bypass
- Undetected ChromeDriver β Stealth browser automation
- Selenium Driverless β CDP-based browser control without chromedriver
- Playwright β Modern browser automation
- Scrapy β Large-scale web crawling and scraping framework
- Crawlee β Web scraping and crawling library
- BeautifulSoup β HTML and XML parsing
- Requests β HTTP requests and REST API interactions
- curl_cffi β HTTP client with browser fingerprint impersonation
- Pandas β Data manipulation and analysis
- Streamlit β Interactive data dashboards and visualization
scraping-projects/
βββ milestone-projects/
βββ other-projects/
βββ.gitignore
βββREADME.md
Some projects in this repository were built for learning and experimentation purposes. All scraping was done responsibly and in accordance with the respective website's terms of service.