DS-Take-Home

Solution to the book "A Collection of Data Science Take-Home Challenges".

Note:

Please don't contact me for the dataset.

This repository is only for self-learning purpose. I am really happy if my solution is helpful to you. However, I won't provide the original book or the data files. If you want to do the exercise, you can go to https://datamasked.com/ to purchase the book. Please respect the author of the original work.

Conversion Rate
Spanish Translation A/B Test
Employee Retention
Identifying Fraudulent Activities
Funnel Analysis
Pricing Test
Marketing Email Campaign
Song Challenge
Clustering Grocery Items
Credit Card Transactions
User Referral Program
Loan Granting
Json City Similarities
Optimization of Employee Shuttle Stops
Diversity in the Workplace
URL Parsing Challenge
Engagement Test
On-Line Video Challenge
Subscription Retention Rate
Ads Analysis

Other useful resource: https://github.com/stasi009/TakeHomeDataChallenges

My solution to the book A Collection of Data Science Take-Home Challenges

Related tags

Overview

DS-Take-Home

Note:

Owner

Jifu Zhao

A set of tools to analyse the output from TraDIS analyses

Provide a market analysis (R)

talkbox is a scikit for signal/speech processing, to extend scipy capabilities in that domain.

Recommendations from Cramer: On the show Mad-Money (CNBC) Jim Cramer picks stocks which he recommends to buy. We will use this data to build a portfolio

PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)

PrimaryBid - Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift

Project under the certification "Data Analysis with Python" on FreeCodeCamp

Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code

A pipeline that creates consensus sequences from a Nanopore reads. I

Port of dplyr and other related R packages in python, using pipda.

ToeholdTools is a Python package and desktop app designed to facilitate analyzing and designing toehold switches, created as part of the 2021 iGEM competition.

Helper tools to construct probability distributions built from expert elicited data for use in monte carlo simulations.

wikirepo is a Python package that provides a framework to easily source and leverage standardized Wikidata information

Analyze the Gravitational wave data stored at LIGO/VIRGO observatories

SNV calling pipeline developed explicitly to process individual or trio vcf files obtained from Illumina based pipeline (grch37/grch38).

COVID-19 deaths statistics around the world

Pypeln is a simple yet powerful Python library for creating concurrent data pipelines.

DaDRA (day-druh) is a Python library for Data-Driven Reachability Analysis.

Analytical view of olist e-commerce in Brazil

Python-based Space Physics Environment Data Analysis Software