Scrapping the data from each page of biocides listed on the BAUA website into a csv file

Last update: Nov 30, 2021

Related tags

Overview

Baua Biocides Scraper

Scrapping the data from each page of biocides listed on the BAUA website (https://www.baua.de/DE/Biozid-Meldeverordnung/Offen/offen.html) into a csv file.
A windows standalone client is avalaible in the dist folder

About the project

What's the problem?

Baua website contains many usefull data for biocides domain, but the website only allows you to search product by product and it is not easy to find and get some informations with over 80,000 products listed

The idea

Facilitate the data manipulation with providing a csv file with all data scraped from Baua website.

How does it work ?

The user start the program.
The program extract data from Baua website.
A csv file containing data are created.

Roadmap

This project was created after a request and is not intended to evolve. Nevertheless you can fork the project to improve it by yourself and propose them via the project pull requests. or make a suggestion via the project issues.

Build with

Programming language : Python 3.10.0
Scraping Framework : Scrapy 2.5.1
HTTP library : Requests 2.26.0
Standalone Builder : PyInstaller 4.7

Demo

You can use the windows standalone client in the dist folder

Version management

We use a semantic version management, that is a version number MAJOR.MINOR.CORRECTIVE :

the MAJOR version number when there are non backward compatible changes,
the MINOR version number when there are backward compatible feature additions,
the FIX version number when there are backwards compatible bug fixes.

See SignMail tags For more info: semver.org

Authors

Eric De Maria - Numio - Initial work

License

This project is licensed under the GNU GPL 3 license - See the LICENSE file for more details.

Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

Instagram_scrapper This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or exce

5 Oct 17, 2022

Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX)

mcc-mnc.com-webscraper Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX) A Python script for web scraping mcc-mnc.com Link: mcc

1 Nov 7, 2021

AssistScraper - program for /r/nba to use to find list of all players a player assisted and how many assists each player recieved

5 Nov 25, 2021

EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address

Introduction This is a project I built with the sole intent to learn more about

1 Jan 14, 2022

An application that on a given url, crowls a web page and gets all words, sorts and counts them.

Web-Scrapping-1 An application that on a given url, crowls a web page and gets all words, sorts and counts them. Installation Using the package manage

1 Jan 16, 2022

Releases(v0.1.0)

v0.1.0(Nov 30, 2021)

The windows standalone client for the first public version of Baua Biocides Scraper
Source code(tar.gz)
Source code(zip)
Baua_Biocides_Scraper_Windows.zip(16.02 MB)

Scrapping the data from each page of biocides listed on the BAUA website into a csv file

Related tags

Overview

Baua Biocides Scraper

About the project

What's the problem?

The idea

How does it work ?

Roadmap

Build with

Demo

Version management

Authors

License

You might also like...

Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX)

AssistScraper - program for /r/nba to use to find list of all players a player assisted and how many assists each player recieved

A Python module to bypass Cloudflare's anti-bot page.

Screenhook is a script that captures an image of a web page and send it to a discord webhook.

A Python module to bypass Cloudflare's anti-bot page.

Python script who crawl first shodan page and check DBLTEK vulnerability

EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address

An application that on a given url, crowls a web page and gets all words, sorts and counts them.

Releases(v0.1.0)

v0.1.0(Nov 30, 2021)

Owner

Eric DE MARIA

爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、百度指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书》

This script is intended to crawl license information of repositories through the GitHub API.

Web Scraping OLX with Python and Bsoup.

This is a script that scrapes the longitude and latitude on food.grab.com

A modern CSS selector implementation for BeautifulSoup

Semplice scraper realizzato in Python tramite la libreria BeautifulSoup

Python script to check if there is any differences in responses of an application when the request comes from a search engine's crawler.

Google Developer Profile Badge Scraper

A Python library for automating interaction with websites.

Web Scraping Practica With Python

Web scrapping

A simple python web scraper.

✂️🕷️ Spider-Cut is a Network Mapper Framework (NMAP Framework)

京东茅台抢购 2021年4月最新版

An helper library to scrape data from Instagram effortlessly, using the Influencer Hunters APIs.

Meme-videos - Scrapes memes and turn them into a video compilations

让中国用户使用git从github下载的速度提高1000倍!

simple http & https proxy scraper and checker

Get paper names from dblp.org

A package designed to scrape data from Yahoo Finance.