a small library for extracting rich content from urls

Last update: Dec 27, 2022

Related tags

Overview

A small library for extracting rich content from urls.

what does it do?

micawber supplies a few methods for retrieving rich metadata about a variety of links, such as links to youtube videos. micawber also provides functions for parsing blocks of text and html and replacing links to videos with rich embedded content.

examples

here is a quick example:

import micawber

# load up rules for some default providers, such as youtube and flickr
providers = micawber.bootstrap_basic()

providers.request('http://www.youtube.com/watch?v=54XHDUOHuzU')

# returns the following dictionary:
{
    'author_name': 'pascalbrax',
    'author_url': u'http://www.youtube.com/user/pascalbrax'
    'height': 344,
    'html': u'<iframe width="459" height="344" src="http://www.youtube.com/embed/54XHDUOHuzU?fs=1&feature=oembed" frameborder="0" allowfullscreen></iframe>',
    'provider_name': 'YouTube',
    'provider_url': 'http://www.youtube.com/',
    'title': 'Future Crew - Second Reality demo - HD',
    'type': u'video',
    'thumbnail_height': 360,
    'thumbnail_url': u'http://i2.ytimg.com/vi/54XHDUOHuzU/hqdefault.jpg',
    'thumbnail_width': 480,
    'url': 'http://www.youtube.com/watch?v=54XHDUOHuzU',
    'width': 459,
    'version': '1.0',
}

providers.parse_text('this is a test:\nhttp://www.youtube.com/watch?v=54XHDUOHuzU')

# returns the following string:
this is a test:
<iframe width="459" height="344" src="http://www.youtube.com/embed/54XHDUOHuzU?fs=1&feature=oembed" frameborder="0" allowfullscreen></iframe>

providers.parse_html('<p>http://www.youtube.com/watch?v=54XHDUOHuzU</p>')

# returns the following html:
<p><iframe width="459" height="344" src="http://www.youtube.com/embed/54XHDUOHuzU?fs=1&amp;feature=oembed" frameborder="0" allowfullscreen="allowfullscreen"></iframe></p>

a small library for extracting rich content from urls

Related tags

Overview

what does it do?

examples

Owner

Charles Leifer

This project was created using Python technology and flask tools to scrape a music site

A dead simple crawler to get books information from Douban.

The first public repository that provides free BUBT website scraping API script on Github.

Extract embedded metadata from HTML markup

A Scrapper with python

A simple django-rest-framework api using web scraping

Footballmapies - Football mapies for learning webscraping and use of gmplot module in python

A Python web scraper to scrape latest posts from official Coinbase's Blog.

Creating Scrapy scrapers via the Django admin interface

Python Web Scrapper Project

Scraping web pages to get data

河南工业大学完美校园自动校外打卡

A package that provides you Latest Cyber/Hacker News from website using Web-Scraping.

A simple app to scrap data from Twitter.

Web-Scrapper using Python and Flask

Web Scraping Framework

This is a sport analytics project that combines the knowledge of OOP and Webscraping

Web-scraping - Program that scrapes a website for a collection of quotes, picks one at random and displays it

用python爬取江苏几大高校的就业网站，并提供3种方式通知给用户，分别是通过微信发送、命令行直接输出、windows气泡通知。

This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster

a small library for extracting rich content from urls

Related tags

Overview

what does it do?

examples

Owner

Charles Leifer

This project was created using Python technology and flask tools to scrape a music site

A dead simple crawler to get books information from Douban.

The first public repository that provides free BUBT website scraping API script on Github.

Extract embedded metadata from HTML markup

A Scrapper with python

A simple django-rest-framework api using web scraping

Footballmapies - Football mapies for learning webscraping and use of gmplot module in python

A Python web scraper to scrape latest posts from official Coinbase's Blog.

Creating Scrapy scrapers via the Django admin interface

Python Web Scrapper Project

Scraping web pages to get data

河南工业大学 完美校园 自动校外打卡

A package that provides you Latest Cyber/Hacker News from website using Web-Scraping.

A simple app to scrap data from Twitter.

Web-Scrapper using Python and Flask

Web Scraping Framework

This is a sport analytics project that combines the knowledge of OOP and Webscraping

Web-scraping - Program that scrapes a website for a collection of quotes, picks one at random and displays it

用python爬取江苏几大高校的就业网站，并提供3种方式通知给用户，分别是通过微信发送、命令行直接输出、windows气泡通知。

This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster

河南工业大学完美校园自动校外打卡