Scrape plants scientific name information from Agroforestry Species Switchboard 2.0.

Last update: Dec 23, 2021

Overview

Agroforestry Species Switchboard 2.0 Scraper

Scrape plants scientific name information from Species Switchboard 2.0.

Requirements

python >= 3.10 (you can use pyenv for easier python version management)
pipenv

How to run

Install dependencies

cp env.sample .env
pipenv --python 3
pipenv install

Run
```
pipenv run python main.py
```
The result will be placed in a file named result.*.csv

Test Shell

pipenv run scrapy shell 'http://apps.worldagroforestry.org/products/switchboard/index.php/species_search/Acacia%20abyssinica'

Cleanup All Outputs

rm result.* && rm log.*

Special Cases

Case	Link	Note
ICRAF Databases Not Found	Engelhardia spicata
Genus Found	Forficula	What to do next?
Multiple Species Found	Alstonia spectabilis	Get the matched species right?
Species Variant Found	Engelhardtia spicata	Need human to check
Similar Species Found	Costus speciosus	Need human to check

Contributing

Fork this repo
Develop
Create pull request
Tag @rizqirizqi for review
Merge~~

License

GPL-3.0

Scrape plants scientific name information from Agroforestry Species Switchboard 2.0.

Related tags

Overview

Agroforestry Species Switchboard 2.0 Scraper

Requirements

How to run

Test Shell

Cleanup All Outputs

Special Cases

Contributing

License

Owner

Mgs. M. Rizqi Fadhlurrahman

This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster

Pseudo API for Google Trends

A simple flask application to scrape gogoanime website.

Scraping script for stats on covid19 pandemic status in Chiba prefecture, Japan

jd_maotai rpa 基于selenium驱动的jd抢购rpa机器人

The core packages of security analyzer web crawler

Audio media crawler for lbry.

Nekopoi scraper using python3

Scrape data on SpaceX: Capsules, Rockets, Cores, Roadsters, SpaceX Info

Searching info from Google using Python Scrapy

An Automated udemy coupons scraper which scrapes coupons and autopost the result in blogspot post

Google Scholar Web Scraping

A repository with scraping code and soccer dataset from understat.com.

LSpider 一个为被动扫描器定制的前端爬虫

Get paper names from dblp.org

一个m3u8视频流下载脚本

Web scraped S&P 500 Data from Wikipedia using Pandas and performed Exploratory Data Analysis on the data.

Visual scraping for Scrapy

A way to scrape sports streams for use with Jellyfin.

Find thumbnails and original images from URL or HTML file.