Haphazard scripts for scraping bitcoin/bitcoin data from GitHub

Last update: Oct 12, 2022

Related tags

Web Crawling bitcoin-github-scrape

Overview

This is a quick-and-dirty tool used to scrape bitcoin/bitcoin pull request and commentary data.

Each output/<pr number> folder contains

comments.json: an aggregated list of both issue and review comments, in Github's original format
commits.json: a list of commit objects corresponding to the PR, in Github's original format
pr.json: the pull request object, in Github's original format
comments_abbrev.csv: abbreviated representation of each comment in CSV format
pr_abbrev.csv: abbreviated representation of the PR in CSV format
done: the datetime we retrieved the PR data

Limitations

Right now this doesn't really handle open PRs (or PRs that are expected to be updated) properly since it will not refresh data once the done sentinel is created. This could be fixed by comparing various timestamps to the done sentinel and overwriting.

Haphazard scripts for scraping bitcoin/bitcoin data from GitHub

Related tags

Overview

Limitations

See also

Owner

James O'Beirne

A modern CSS selector implementation for BeautifulSoup

A dead simple crawler to get books information from Douban.

Anonymously scrapes onlinesim.ru for new usable phone numbers.

Unja is a fast & light tool for fetching known URLs from Wayback Machine

Download images from forum threads

Creating Scrapy scrapers via the Django admin interface

Scrapping the data from each page of biocides listed on the BAUA website into a csv file

A Python web scraper to scrape latest posts from official Coinbase's Blog.

👨🏼‍⚖️ reddit bot that turns comment chains into ace attorney scenes

News, full-text, and article metadata extraction in Python 3. Advanced docs:

jd_maotai rpa 基于selenium驱动的jd抢购rpa机器人

Searching info from Google using Python Scrapy

Python scrapper scrapping torrent website and download new movies Automatically.

基于Github Action的定时HITsz疫情上报脚本，开箱即用

Crawl BookCorpus

Fundamentus scrapy

An helper library to scrape data from TikTok in one line, using the Influencer Hunters APIs.

Github scraper app is used to scrape data for a specific user profile created using streamlit and BeautifulSoup python packages

Semplice scraper realizzato in Python tramite la libreria BeautifulSoup

Automatically scrapes all menu items from the Taco Bell website