Exploratory Data Analysis of the 2019 Indian General Elections using a dataset from Kaggle.

Last update: Oct 10, 2022

Overview

2019-indian-election-eda

Exploratory Data Analysis of the 2019 Indian General Elections using a dataset from Kaggle.

This project is a part of the Course - Data Analysis using Python: Zero to Pandas offered by Jovian.ai.

We perform Exploratory Data Analyis on the 2019 Indian General Elections dataset. Here we use various Python libraries to perform Data Cleaning and Visualization. The Dataset which is used in this project is from Kaggle, authored by the user Prakrut Chauhan.

Link to the Dataset used - https://www.kaggle.com/prakrutchauhan/indian-candidates-for-general-election-2019

The dataset contains information of all the candidates who contested the elections from various Constituencies. Data includes personal information like Assets, Education, Criminal Record, etc. as well as electoral information such as Contesting Constituency, Political Party, Total Votes received, etc.

The Libraries used in the Project are:

Matplotlib (for visualization of data),
Seaborn (used alongside Matplotlib for visualization),
Numpy (used for operations on numeric data),
Pandas (used for utilising DataFrames and organising the data),
Jovian (used for downloading dataset and to run, save and upload the Notebook).

Apart from the above mentioned libraries, we use the opendatasets package to directly download the files from Kaggle and parse the data. Link to the package - https://github.com/JovianML/opendatasets

To view the Jupyter Notebook containing the EDA, click on the .ipynb file to open it. Scroll down to see the analysis. Some contents might not be visible in Dark Theme, so I recommend viewing the notebook in Light Theme.

The Notebook can also be viewed in Google Colab and Binder or can be downloaded and viewed locally.

Link to a Blog Post will be added soon.

Hope you like my work !!!

Exploratory Data Analysis of the 2019 Indian General Elections using a dataset from Kaggle.

Related tags

Overview

2019-indian-election-eda

Owner

Souradeep Banerjee

Statistical Rethinking course winter 2022

Statistical package in Python based on Pandas

Detecting Underwater Objects (DUO)

follow-analyzer helps GitHub users analyze their following and followers relationship

Containerized Demo of Apache Spark MLlib on a Data Lakehouse (2022)

AWS Glue ETL Code Samples

Repositori untuk menyimpan material Long Course STMKGxHMGI tentang Geophysical Python for Seismic Data Analysis

An ETL framework + Monitoring UI/API (experimental project for learning purposes)

4CAT: Capture and Analysis Toolkit

DaDRA (day-druh) is a Python library for Data-Driven Reachability Analysis.

Data Analysis for First Year Laboratory at Imperial College, London.

Desafio 1 ~ Bantotal

📊 Python Flask game that consolidates data from Nasdaq, allowing the user to practice buying and selling stocks.

CubingB is a timer/analyzer for speedsolving Rubik's cubes, with smart cube support

OpenDrift is a software for modeling the trajectories and fate of objects or substances drifting in the ocean, or even in the atmosphere.

The OHSDI OMOP Common Data Model allows for the systematic analysis of healthcare observational databases.

Functional tensors for probabilistic programming

A program that uses an API and a AI model to get info of sotcks

Geospatial data-science analysis on reasons behind delay in Grab ride-share services

A data structure that extends pyspark.sql.DataFrame with metadata information.