Scrap the 42 Intranet's elearning videos in a single click

Last update: Oct 27, 2022

Related tags

Web Crawling 42intra_scraper

Overview

42intra_scraper

Scrap the 42 Intranet's elearning videos in a single click.

Why you would want to use it ?

Adjust speed at your convenience. (The intra doesn't allow this)
Working in a remote location where internet is hit or miss ? Download what you need and you'll have it in your computer.
Have a friend that is freeze and can't access the intra's resources ? You can download the videos, compress them and send them via drive.

How to use it:

git clone [email protected]:Dovalich/42intra_scraper.git

pip3 install -r requirements.txt

python3 intra_scraper.py

And then all you have to do is follow the instructions that the program gives you, that is:

enter your 42 intranet username
enter your 42 intranet password
enter the elearning link you want to scrap for example https://elearning.intra.42.fr/tags/38/notions

Here's a short Tutorial gif:

How does it work ?

It's fairly simple.

The program makes a post request to the intranet using your logins (via the requests module).
Once logged-in, it recursively searches for any links that are in the middle of the page (the ones that contain videos).
Once it finds a video link, it downloads it based on the video quality you chose (SD or HD).

Note

As you can see in the code I don't store your user name and password. In fact I only use them once to login. But be careful when using these types of scripts. You should always read the source code before giving away sensitive information.

If you have feedback on the code please let me know! 👨‍🎓

And feel free to use it however you want.

Scrap the 42 Intranet's elearning videos in a single click

Related tags

Overview

42intra_scraper

Why you would want to use it ?

How to use it:

How does it work ?

Note

Owner

Noufel

News, full-text, and article metadata extraction in Python 3. Advanced docs:

A simple reddit scraper to get memes (only images) from r/ProgrammerHumor.

Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

A Python Oriented tool to Scrap WhatsApp Group Link using Google Dork it Scraps Whatsapp Group Links From Google Results And Gives Working Links.

Web Crawlers for Data Labelling of Malicious Domain Detection & IP Reputation Evaluation

A scrapy pipeline that provides an easy way to store files and images using various folder structures.

A web service for scanning media hosted by a Matrix media repository

Google Scholar Web Scraping

Footballmapies - Football mapies for learning webscraping and use of gmplot module in python

Scrape data on SpaceX: Capsules, Rockets, Cores, Roadsters, SpaceX Info

🤖 Threaded Scraper to get discord servers from disboard.org written in python3

Scrapy-soccer-games - Scraping information about soccer games from a few websites

This app will let you continuously scrape certain parts of LeasePlan and extract data of cars becoming available for lease.

Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX)

API to parse tibia.com content into python objects.

WebScrapping Project - G1 Latest News

Ebay Webscraper for Getting Average Product Price

Scrapes Every Email Address of Every Society in Every University

A Pixiv web crawler module

feapder 是一款简单、快速、轻量级的爬虫框架。以开发快速、抓取快速、使用简单、功能强大为宗旨。支持分布式爬虫、批次爬虫、多模板爬虫，以及完善的爬虫报警机制。