Newsscraper - A simple Python 3 module to get crypto or news articles and their content from various RSS feeds.

Last update: Jan 02, 2022

Overview

NewsScraper

A simple Python 3 module to get crypto or news articles and their content from various RSS feeds.

🔧 Installation

Clone the repo locally.
Use the package manager pip to install the requirements.

pip install -r requirements.txt

✨ Basic Usage

import NewsScraper

all_data = NewsScraper.fetch_all()
news_data = NewsScraper.fetch_news_data()
crypto_data = NewsScraper.fetch_crypto_data()

fetch_all()

Returns a set of NewsScraper.Result containing fetched results from all available RSS feeds

Can include categories: GLOBAL, US, EU, CRYPTO, BLOCKCHAIN, BTC, ETH, LTC.

fetch_news_data()

Returns a set of NewsScraper.Result containing fetched results from CNN, ABC News, Yahoo News, Fox News RSS feeds

Can include categories: GLOBAL, US, EU.

fetch_crypto_data()

Returns a set of NewsScraper.Result containing fetched results from CoinJournal, Crypto Currency News RSS feeds.

Can include categories: CRYPTO, BLOCKCHAIN, BTC, ETH, LTC.

🔨 Advanced Usage

NewsScraper.Result class

A class used to represent a returned article.

Attributes

context : str

A string describing the category of the article.

ex. "GLOBAL", "US", "BLOCKCHAIN", "BTC".
title : str

A string containing the name of the article.
summary : str

A string containing the summary of the article.

NOTE: sometimes it can have the value of "", because the RSS feed didn't provide a summary.
content : str

A string containing the content of the article.

Methods

Result.json()

Returns a dictionary with the attributes of the class formatted in JSON.

ex.

{
  "context": "global",
  "title": "title of the article",
  "summary": "summary of the article",
  "content": "content of the article"
}

News RSS Feeds

All of these functions return a set of NewsScraper.Result containing fetched results of the described RSS feeds.

fetch_abc()
fetch_cnn()
fetch_yahoo()
fetch_fox_news()

Can include categories: GLOBAL, US, EU.

Alternatively, you can use fetch_news_data() to receive results from all of them.

Crypto RSS Feeds

All of these functions return a set of NewsScraper.Result containing fetched results of the described RSS feeds.

fetch_coinjournal()
fetch_cryptocurrencynews()

Can include categories: CRYPTO, BLOCKCHAIN, BTC, ETH, LTC.

Alternatively, you can use fetch_news_data() to receive results from all of them.

🤝 Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

📝 License

This project is licensed under the MIT license.

Newsscraper - A simple Python 3 module to get crypto or news articles and their content from various RSS feeds.

Related tags

Overview

NewsScraper

🔧 Installation

✨ Basic Usage

🔨 Advanced Usage

NewsScraper.Result class

context : str

title : str

summary : str

content : str

Result.json()

News RSS Feeds

Crypto RSS Feeds

🤝 Contributing

📝 License

Owner

Rokas

Basic-html-scraper - A complete how to of web scraping with Python for beginners

This is a module that I had created along with my friend. It's a basic web scraping module

Amazon scraper using scrapy, a python framework for crawling websites.

This is python to scrape overview and reviews of companies from Glassdoor.

API which uses discord to scrape NameMC searches/droptime/dropping status of minecraft names

EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address

Html Content / Article Extractor, web scrapping lib in Python

PaperRobot: a paper crawler that can quickly download numerous papers, facilitating paper studying and management

This repo has the source code for the crawler and data crawled from auto-data.net

Scrape and display grades onto the console

淘宝、天猫半价抢购，抢电视、抢茅台，干死黄牛党

Web and PDF Scraper Refactoring

A tool for scraping and organizing data from NewsBank API searches

SmartScraper: 简单、自动、快捷的Python网络爬虫

a way to scrape a database of all of the isef projects

A social networking service scraper in Python

Searching info from Google using Python Scrapy

A Python module to bypass Cloudflare's anti-bot page.

IGLS - Instagram Like Scraper CLI tool

👨🏼‍⚖️ reddit bot that turns comment chains into ace attorney scenes