This is a Text Data Analysis Project Involving (YouTube Case Study).

Overview

Text_Data_Analysis

This is a Text Data Analysis Project Involving (YouTube Case Study).

Problem Statement => Sentiment Analysis.

Package1:

There are many Sentiment Packages such as Vader, Pacy. In this project i am using TextBlob which is a NLP library.

When considering Texblob Sentiment Analysis, there are two keys involved which are Polarity and Subjectivity.

-- Polarity: Which ranges from [-1 to +1] for negative and positive sentiments.

-- Subjectivity: When there are no Sentiments in a sentence.

Error_Handling

Use try,except to handle error in your code.

Package2:

The second package I'll be using to perform visualization on the sentiment Analysis is Wordcloud.

-- wordcloud:analysis give regard to the keyword with the bigger Font, therfore any keyword with this attribute has the higher priority.

-- wordcloud : data must also be stored in String nature before being passed.

Instaling wordcloud.

wordcloud can be tricky when installing . irrespective of command prompt or conda prompt.. open and run as administrator and excute the following line of codes below.

-- for conda. conda install -c https://conda.anaconda.org/conda-forge wordcloud

-- For command prompt.

git clone https://github.com/amueller/word_cloud.git
cd word_cloud
pip install .

stopword.

This are words that donot make any sense in Analysis. such as He, Him, Is, The.

-- wordcloud has parameters that removes this stopwords . ie (stopwords = reset(STOPWORDS).

Negative comment Visuation.

Negative_comment

Positive comment Visuation.

Positive_comment

Problem Statement => Emoji Analysis.

-- !pip install emoji on Jupyter notebook.

-- pip install emoji on conda prompt or Cmd prompt open and run as administrator

Note:

After iterating the emoji_list, you need to compute it into frequncies, which means you need to come up with data in the form of Dict. this can be implemented completely from Scratch, or using count. But in this analysis we shall be using Collections models.

Emoji Visualization.

emoji

Problem Statement => Collecting the Entire data of Youtube.

-- The first way to do this by using the OS, interating with the OS and using a path to access all the files.

-- The second way is Glob, which is consider to be the best way.

Most Common econdings for reading data.

-- latin, UTF-8, iso-8859-1 (essential in reading complex data example is japanese data), cp-1252

Problem Statement => Which Category has the Maximum Likes.

The category file was clean and coverted to dict, then,the Category_name in the dict was map with category_id in the full_df and store in a column created in the full_df.

-- finding the maximum likes in the category can be archived with groupby as well but we used boxplot to visualized.

Visualized most liked category.

most_like_category

Problem Statement => Find out weather Audience are Engaged or not.

The three most important keys to consider when solving this kind of problem are Like_rate, Dislike-rate, Comment_rate.

Like rate:

likes_rate

Analysis weather your Viewes will affect your likes or not.

In this case you can use Scatter plot or Regplot to check and also correlation and visualized it using heatmap

Note.

when using seaborn regplot in jupyter notebook always set ci=None, for it to execute.

Problem Statement => Analyse trending videos

In this project the data frame had only channel_tile and video_id, so we used a groupby function to group the video_id's according to there channel_title. and renamed the video_id's column to total_video according to there channel_title respectfully, the channel with the highest count of video_id had the trending vidoes.

Visualized Trending videos.

trending videos

Problem Statement => Does Punctuation in a title and tags have any relations with views,likes,dislikes and comments?.

We have to extracts all the punctuations marks from title, channel_title or tags, and this can be done completely from scratch or using python build in modules like Regular expression, Strings. in this project we shall use the string module.

--- To check if the punctuatons will affect the views,likes,dislikes or comments. use correlatoin to check the count punctuations from title, channel_title or tags on the views,likes,dislikes or comments.

title punctuation count plot.

title_punc_count

Owner
Data Science || Data Analyst.
The PornHub Downloader is a powerfull script used to download and manage both videos and pictures

The PornHub Downloader is a powerfull script used to download and manage both videos and pictures

16 Aug 31, 2022
Application Updater using an download link

Application-Updater This tool will update your app using an storage link

ExtremeDev 1 Dec 20, 2021
this is udemy course downloader, before a start you know how to get access token.

udemy_downloader this is udemy course downloader, before a start you know how to get access token. To get the access_token on Google Chrome (once on U

OkUgur 18 Dec 04, 2022
Scripts to download files and folders programmatically from Google Drive

Google Drive Downloader Scripts Every time I need to download a lot of files from Google Drive (e.g. a dataset), it's always incredibly frustrating an

Ivan Evtimov 6 Jul 22, 2021
Pypixiv - A fully-typed, asynchronous api wrapper for pixiv

pypixiv this library is a fully-typed, asynchronous api wrapper for pixiv. featu

DeltaLaboratory 2 Nov 16, 2022
A cli tool to download purchased products from the DLsite.

dlsite-downloader A cli tool to download purchased products from the DLsite. How can I use? This program runs with configurations defined at settings.

AcrylicShrimp 9 Dec 23, 2022
This project is helps to download contents from Streamtape by utilizing the API

It scrapes Streamtape api and download contents from the site.

Debiprasad Das 5 Dec 28, 2022
Twitter Media Downloader (Telegram Bot)

Twitter Media Downloader (Telegram Bot)

Matin Baloochestani 8 Oct 27, 2022
VD Song Bot - A telegram bot that can download songs

VD Song Bot A telegram bot that can download songs Reach me on Telegram @MusicVNDbot Deploy to Heroku The easiest way to deploy this Song Bot Mandator

Venuja Thilakarathna 2 Feb 19, 2022
Download Web-10K data by querying Bing Image Search

gpv2-web10k This repository contains the script to download images from the Web-10K dataset. The script takes in a list of queries, queries Bing Image

AI2 8 Sep 06, 2022
Download h3t4y for later read

h3nt4y_dl Download h3nt4y for later read Tải h3nt4y về đọc thôi nào các bạn ơiiiiiiii! (Tải từ h**taivn nhé) Usage: python get_that_ht4i.py New versio

1 Dec 03, 2021
A simple kemono.party downloader using python.

kemono-dl This is a simple kemono.party downloader. How to use Install python Download source code from releases and extract it Then install requireme

318 Dec 27, 2022
A python module to download ISO Standards

ISO Standards Downloader A python module to download ISO Standards from https://standards.iso.org/iso-iec/ Report Bug · Request Feature Table of conte

Daniel 1 Dec 29, 2021
SABnzbd - The automated Usenet download tool

SABnzbd is an Open Source Binary Newsreader written in Python.

SABnzbd 1.8k Dec 30, 2022
S3 file download with Python and access with VBA

S3 file download with Python and access with VBA This simple project is using the following stacks: Python AWS S3 VBA/Excel A Bitcoin API With this st

Julio Cesar Scheidt 0 Dec 07, 2021
The tool allows to download a list of tiktok sounds

dependencies: pip install requests how to use LAUCH THE PROGRAM file (option f)

carpal 3 Jan 21, 2022
Downloads data from OSM API and uploads it to the mapping sandbox.

OpenStreetMap To Sandbox This is a script to download data from OSM API and upload it to the mapping sandbox. Note that it clears all data in the sand

Ilya Zverev 5 Nov 27, 2022
Get the latest updates around you as they happen

Adherent We all are different, experience various things happening around us but we stick together. We are all a part of a greater community. As human

Shreyas Daniel 1 Nov 10, 2022
Discord Nitro Generator + Checker

Discord Nitro Generator + Checker Usage Download the project files and run main.py You will be prompted with 2 questions the first one being the amoun

509 Jan 02, 2023
YT-Spammer-Purge - Allows you easily scan for and delete scam comments using several methods

YouTube Spammer Purge What Is This? - Allows you to filter and search for spamme

4.3k Dec 31, 2022