Facebook Group Scraping Using Beautiful Soup & Selenium

Last update: Aug 12, 2022

Overview

Notes

The scraper should only be used for educational purposes
Kindly refrain from scraping sensitive or private information
It is highly recommended to scrape public (and not private) groups
Ask for consent from the group adminstrator and/or group members before running any code
I am not responsible for any misuse of the code in any shape or form

Facebook Group Scraping Using Beautiful Soup & Selenium

Extract Facebook group posts that are related to a specific topic and write them to a .json file. This project was created in order to gather data needed to build a chatbot for a university's website.

Input

User's Credentials
Facebook Group URL
Number of Scrolls
- Number of posts you want to collect
Directory of the Chromedriver
Optional: Specific topic to be searched

What the Scraper Does

Logs into Facebook using the User's Credentials
Enters the group specified by the User
Searches for the topic
Extracts all posts & their comments

Scraper Output

.json file that includes:

Each post
The comments replying to it

Format of file:

{ 
   "tag": "Topic 1",
   "patterns":  [ "Post text" ],
   "responses": [ "Comment 1", 
        "Comment 2",
        "Comment 3"  
    ]
}

Setup Requirements

Make sure chrome is installed
Install Chromedriver and place it in the same directory as the file
Enter inputs required by the code
Run the code

Updates

Scrape comments found in "view more comments"
Add a file for inputs only
Add comments to the code
Add an option to scrape the general group discussions and not specific topics

Facebook Group Scraping Using Beautiful Soup & Selenium

Related tags

Overview

Notes

Facebook Group Scraping Using Beautiful Soup & Selenium

Input

What the Scraper Does

Scraper Output

Format of file:

Setup Requirements

Updates

Owner

Fatima Ghadieh

a small library for extracting rich content from urls

A package designed to scrape data from Yahoo Finance.

A simple flask application to scrape gogoanime website.

Web scrapping tool written in python3, using regex, to get CVEs, Source and URLs.

Web Scraping OLX with Python and Bsoup.

A Python web scraper to scrape latest posts from official Coinbase's Blog.

Scrape data on SpaceX: Capsules, Rockets, Cores, Roadsters, SpaceX Info

UsernameScraperTool - Username Scraper Tool With Python

A python script to extract answers to any question on Quora (Quora+ included)

Displays market info for the LUNI token on the Terra Blockchain

This project was created using Python technology and flask tools to scrape a music site

Open Crawl Vietnamese Text

Newsscraper - A simple Python 3 module to get crypto or news articles and their content from various RSS feeds.

A web service for scanning media hosted by a Matrix media repository

Dude is a very simple framework for writing web scrapers using Python decorators

京东云无线宝积分推送，支持查看多设备积分使用情况

This is python to scrape overview and reviews of companies from Glassdoor.

WebScraping - Scrapes Job website for python developer jobs and exports the data to a csv file

tweet random sand cat pictures

Complete pipeline for crawling online newspaper article.