Learn machine learning the fun way, with Oracle and RedBull Racing

Last update: Oct 24, 2022

Related tags

Overview

Red Bull Racing Analytics Hands-On Labs

Introduction

Are you interested in learning machine learning (ML)? How about doing this in the context of the exciting world of F1 racing?! Get your ML skills bootstrapped here with Oracle and Red Bull Racing!

This tutorial teaches ML analytics with a series of hands-on labs (HOLs) using the Data Science service in Oracle Cloud Infrastructure.

You'll learn how to get data from some public data sources, then how to analyze this data using some of the latest ML techniques. In the process you'll build ML models and test them out in a predictor app.

Getting Started

There is some infrastructure that must be deployed before you can enjoy this tutorial. See the Terraform documentation for more information.

After the OCI infrastructure is deployed, proceed with the beginner's tutorial to start through the ML labs.

Prerequisites

You must have an OCI account. Click here to create a new cloud account.

This solution is designed to work with several OCI services, allowing you to quickly be up-and-running:

There are required OCI resources (see the Terraform documentation for more information) that are needed for this tutorial.

Notes/Issues

None at this time.

URLs

Oracle and Red Bull partnership announcement

Contributing

This project is open source. Please submit your contributions by forking this repository and submitting a pull request! Oracle appreciates any contributions that are made by the open source community.

License

Licensed under the Universal Permissive License (UPL), Version 1.0.

See LICENSE for more details.

Comments

Refactored Terraform code
Compatible with ORM, Cloud Shell and Terraform CLI

Updated README to include instructions for all three methods

Refactored, removing unnecessary resources (Vault, public Subnet, etc.).

Added a nerd knob so that it could use an existing Group (rather than create a new one)

Fixed ORM RegEx filters to allow dashes (-) and underscores (_), for the names
opened by timclegg 2
Issue with hands on lab guide - launchapp.sh missing

https://github.com/oracle-devrel/redbull-analytics-hol/tree/main/beginners#beginners-hands-on-lab

In Starting The Web Application it reads:

cd /home/opc/redbull-analytics-hol/beginners/web ./launchapp.sh start

However is launchapp.sh is missing, for example

(redbullenv) cd /home/opc/redbull-analytics-hol/beginners/web (redbullenv) ./launchapp.sh start bash: ./launchapp.sh: No such file or directory

opened by raekins 1
fix: Updating schema.yaml syntax

Making the variable notation follow what the doc syntax shows (https://docs.oracle.com/en-us/iaas/Content/ResourceManager/Concepts/terraformconfigresourcemanager_topic-schema.htm)

opened by timclegg 1
Exploratory Data Analysis Merge Issue

Hello I have been encountering an issue while running the lab. The Jupyter notebook 03.f1_analysis_EDA.ipynb has the following issue on cell number 5:

ValueError Traceback (most recent call last) in ----> 1 df1 = pd.merge(races,results,how='inner',on=['raceId']) 2 df2 = pd.merge(df1,quali,how='inner',on=['raceId','driverId','constructorId']) 3 df3 = pd.merge(df2,drivers,how='inner',on=['driverId']) 4 df4 = pd.merge(df3,constructors,how='inner',on=['constructorId']) 5 df5 = pd.merge(df4,circuit,how='inner',on=['circuitId'])

~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in merge(left, right, how, on, left_on, right_on, left_index, right_index, sort, suffixes, copy, indicator, validate) 85 copy=copy, 86 indicator=indicator, ---> 87 validate=validate, 88 ) 89 return op.get_result()

~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in init(self, left, right, how, on, left_on, right_on, axis, left_index, right_index, sort, suffixes, copy, indicator, validate) 654 # validate the merge keys dtypes. We may need to coerce 655 # to avoid incompatible dtypes --> 656 self._maybe_coerce_merge_keys() 657 658 # If argument passed to validate,

~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in _maybe_coerce_merge_keys(self) 1163 inferred_right in string_types and inferred_left not in string_types 1164 ): -> 1165 raise ValueError(msg) 1166 1167 # datetimelikes must match exactly

ValueError: You are trying to merge on object and int64 columns. If you wish to proceed you should use pd.concat

I’m using an oracle automatic deployment provided by oracle as part of their environment. I do not have a lot of experience with Python but one possible ible solution is to read the numeric values form the csv file as integer or float but I’m almost certain the solution might be a little more elaborated than that 😉. Anyway thanks for your time. I’m really excited to test your solution and finish the lab. Thanks again.

opened by yankodavila 2
Has the PAR for the stack deploy image expired.

Cannot deploy stack as getting PAR expired message.

2021/11/07 10:50:11[TERRAFORM_CONSOLE] [INFO] Error Message: work request did not succeed, workId: ocid1.coreservicesworkrequest.oc1.eu-amsterdam-1.abqw2ljrwz2n7qqj7ghdwtnlrqol355oumc7a6coushvgdrebskspaewh7ea, entity: image, action: CREATED. Message: Import image not found: PAR is invalid (maybe is expired or deleted), please check.

PAR in stack file is https://objectstorage.eu-frankfurt-1.oraclecloud.com/p/khhPjc_IMuyBOMfZUcJajIzCpoZ5aC-D7VMCU__GVZRlIQueXLIIcaaqLOZIuT1a/n/emeasespainsandbox/b/publichol/o/redbullhol-20210809-1523

opened by Mel-A-M 1

Releases(v0.1.8)

v0.1.8(Feb 18, 2022)

Optimized the models generation for Quickstarts Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.7...v0.1.8
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(20.78 KB)
v0.1.7(Feb 17, 2022)

add quickstart configuration by @snafuz in https://github.com/oracle-devrel/redbull-analytics-hol/pull/43

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.6...v0.1.7
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(17.20 KB)
v0.1.6(Feb 17, 2022)
What's Changed

add quickstart configuration by @snafuz in https://github.com/oracle-devrel/redbull-analytics-hol/pull/43

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.5...v0.1.6
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(17.20 KB)
v0.1.5(Feb 16, 2022)
What's Changed

Livelabs02162022 by @jasperan in https://github.com/oracle-devrel/redbull-analytics-hol/pull/41

fix: updated Alyssa Cotton's changes by @jasperan in https://github.com/oracle-devrel/redbull-analytics-hol/pull/42

New Contributors

@jasperan made their first contribution in https://github.com/oracle-devrel/redbull-analytics-hol/pull/41

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.4...v0.1.5
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(11.33 KB)
v0.1.4(Jan 25, 2022)
What's Changed

Update Port for Jupyter Lab. Changed with last Stack script by @operard in https://github.com/oracle-devrel/redbull-analytics-hol/pull/38

automatically set the latest Oracle Linux 7.9 image build number as default OS image by @snafuz in https://github.com/oracle-devrel/redbull-analytics-hol/pull/40

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.3...v0.1.4
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(11.33 KB)
v0.1.3(Nov 10, 2021)
What's Changed

fix: ORM zip file not being generated properly

Fixed it so that ORM can be used to deploy the lab.

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.2...v0.1.3
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(11.21 KB)
v0.1.0(Nov 9, 2021)
The lab has been refactored to not use a custom compute image, but rather to build out the compute instance.

What's Changed

feat: removing custom image usage by @timclegg in https://github.com/oracle-devrel/redbull-analytics-hol/pull/34

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.0.12...v0.1.0
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.62 KB)
v0.0.12(Sep 6, 2021)

Redbull HOL Beginner Extension Period to access Image
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(9.01 KB)
v0.0.11(Aug 10, 2021)

Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.06 KB)
v0.0.10(Aug 10, 2021)

The SSH public key is optional, but present in the ORM dialog. Happy deploying!
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.06 KB)
v0.0.9(Aug 9, 2021)

The SSH key isn't directly needed for the hands-on lab, so making this optional. Also some doc updates.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.83 KB)
v0.0.8(Aug 9, 2021)

Updated docs and a bug in the deployment.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.83 KB)
v0.0.7(Aug 6, 2021)

This release has a refactored "one-click" (or really close to it!) hands-on lab.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.82 KB)
v0.0.6(Aug 4, 2021)

This repo now can build its own ZIP files for ORM deployments. These are automatically built and stored in the release (as it's made).
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.19 KB)
v0.0.5(Jul 28, 2021)

Fixing situations where the group name and/or dynamic group name creation would fail, if it already existed. This might occur in situations where the HoL would be deployed more than once in the same tenancy. This eliminates the potential for collision with the same group names being used.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.40 KB)
v0.0.4(Jul 23, 2021)

Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(10.23 KB)
v0.0.3(Jul 15, 2021)

Fixed home region detection.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.26 KB)
v0.2(Jul 14, 2021)
This release makes it easier to deploy the infrastructure, whether using ORM, Cloud Shell or Terraform CLI.

Added DevRel defined tags (and ignored the default tags)

Compatible with ORM, Cloud Shell and Terraform CLI

Updated README to include instructions for all three methods

Refactored, removing unnecessary resources (Vault, public Subnet, etc.).

Added a nerd knob so that it could use an existing Group (rather than create a new one)

Fixed ORM RegEx filters to allow dashes (-) and underscores (_), for the names

Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.19 KB)
v0.1(Jun 21, 2021)

This release includes the beginner series of tutorials, along with the Terraform stack to create the required OCI resources.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(9.24 KB)

Owner

Oracle DevRel

GitHub Repository

SNV calling pipeline developed explicitly to process individual or trio vcf files obtained from Illumina based pipeline (grch37/grch38).

SNV Pipeline SNV calling pipeline developed explicitly to process individual or trio vcf files obtained from Illumina based pipeline (grch37/grch38).

1 Nov 02, 2021

Fancy data functions that will make your life as a data scientist easier.

WhiteBox Utilities Toolkit: Tools to make your life easier Fancy data functions that will make your life as a data scientist easier. Installing To ins

3 Oct 03, 2022

Common bioinformatics database construction

biodb Common bioinformatics database construction 1.taxonomy （Substance classification database） Download the database wget -c https://ftp.ncbi.nlm.ni

2 Jan 04, 2022

Tokyo 2020 Paralympics, Analytics

Tokyo 2020 Paralympics, Analytics Thanks for checking out my app! It was built entirely using matplotlib and Tokyo 2020 Paralympics data. This applica

1 Nov 18, 2021

track your GitHub statistics

GitHub-Stalker track your github statistics 👀 features find new followers or unfollowers find who got a star on your project or remove stars find who

34 Nov 18, 2022

Projeto para realizar o RPA Challenge . Utilizando Python e as bibliotecas Selenium e Pandas.

RPA Challenge in Python Projeto para realizar o RPA Challenge (www.rpachallenge.com), utilizando Python. O objetivo deste desafio é criar um fluxo de

1 Apr 12, 2022

Mining the Stack Overflow Developer Survey

Mining the Stack Overflow Developer Survey A prototype data mining application to compare the accuracy of decision tree and random forest regression m

1 Nov 16, 2021

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

3.7k Jan 03, 2023

VHub - An API that permits uploading of vulnerability datasets and return of the serialized data

2 Feb 14, 2022

peptides.py is a pure-Python package to compute common descriptors for protein sequences

peptides.py Physicochemical properties and indices for amino-acid sequences. 🗺️ Overview peptides.py is a pure-Python package to compute common descr

32 Dec 31, 2022

MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]

MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020] by Kaisiyuan Wang, Qianyi Wu, Linsen Song, Zhuoqian Yang, Wa

112 Dec 28, 2022

BIGDATA SIMULATION ONE PIECE WORLD CENSUS

ONE PIECE is a Japanese manga of great international success. The story turns inhabited in a fictional world, tells the adventures of a young man whose body gained rubber properties after accidentall

3 Jun 30, 2022

fds is a tool for Data Scientists made by DAGsHub to version control data and code at once.

Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc

359 Dec 22, 2022

The micro-framework to create dataframes from functions.

762 Jan 07, 2023

Stream-Kafka-ELK-Stack - Weather data streaming using Apache Kafka and Elastic Stack.

Streaming Data Pipeline - Kafka + ELK Stack Streaming weather data using Apache Kafka and Elastic Stack. Data source: https://openweathermap.org/api O

2 Jan 20, 2022

scikit-survival is a Python module for survival analysis built on top of scikit-learn.

scikit-survival scikit-survival is a Python module for survival analysis built on top of scikit-learn. It allows doing survival analysis while utilizi

876 Jan 04, 2023

Kennedy Institute of Rheumatology University of Oxford Project November 2019

TradingBot6M Kennedy Institute of Rheumatology University of Oxford Project November 2019 Run Change api.txt to binance api key: https://www.binance.c

2 Nov 16, 2021

cLoops2: full stack analysis tool for chromatin interactions

cLoops2: full stack analysis tool for chromatin interactions Introduction cLoops2 is an extension of our previous work, cLoops. From loop-calling base

25 Dec 14, 2022

NumPy and Pandas interface to Big Data

Blaze translates a subset of modified NumPy and Pandas-like syntax to databases and other computing systems. Blaze allows Python users a familiar inte

3.1k Jan 05, 2023

A set of tools to analyse the output from TraDIS analyses

QuaTradis (Quadram TraDis) A set of tools to analyse the output from TraDIS analyses Contents Introduction Installation Required dependencies Bioconda

2 Feb 16, 2022

Learn machine learning the fun way, with Oracle and RedBull Racing

Related tags

Overview

Red Bull Racing Analytics Hands-On Labs

Introduction

Getting Started

Prerequisites

Notes/Issues

URLs

Contributing

License

Comments

Refactored Terraform code

Issue with hands on lab guide - launchapp.sh missing

fix: Updating schema.yaml syntax

Exploratory Data Analysis Merge Issue

Has the PAR for the stack deploy image expired.

Releases(v0.1.8)

v0.1.8(Feb 18, 2022)

v0.1.7(Feb 17, 2022)

v0.1.6(Feb 17, 2022)

What's Changed

v0.1.5(Feb 16, 2022)

What's Changed

New Contributors

v0.1.4(Jan 25, 2022)

What's Changed

v0.1.3(Nov 10, 2021)

What's Changed

v0.1.0(Nov 9, 2021)

What's Changed

v0.0.12(Sep 6, 2021)

v0.0.11(Aug 10, 2021)

v0.0.10(Aug 10, 2021)

v0.0.9(Aug 9, 2021)

v0.0.8(Aug 9, 2021)

v0.0.7(Aug 6, 2021)

v0.0.6(Aug 4, 2021)

v0.0.5(Jul 28, 2021)

v0.0.4(Jul 23, 2021)

v0.0.3(Jul 15, 2021)

v0.2(Jul 14, 2021)

v0.1(Jun 21, 2021)

Owner

Oracle DevRel

SNV calling pipeline developed explicitly to process individual or trio vcf files obtained from Illumina based pipeline (grch37/grch38).

Fancy data functions that will make your life as a data scientist easier.

Common bioinformatics database construction

Tokyo 2020 Paralympics, Analytics

track your GitHub statistics

Projeto para realizar o RPA Challenge . Utilizando Python e as bibliotecas Selenium e Pandas.

Mining the Stack Overflow Developer Survey

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

VHub - An API that permits uploading of vulnerability datasets and return of the serialized data

peptides.py is a pure-Python package to compute common descriptors for protein sequences

MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]

BIGDATA SIMULATION ONE PIECE WORLD CENSUS

fds is a tool for Data Scientists made by DAGsHub to version control data and code at once.

The micro-framework to create dataframes from functions.

Stream-Kafka-ELK-Stack - Weather data streaming using Apache Kafka and Elastic Stack.

scikit-survival is a Python module for survival analysis built on top of scikit-learn.

Kennedy Institute of Rheumatology University of Oxford Project November 2019

cLoops2: full stack analysis tool for chromatin interactions

NumPy and Pandas interface to Big Data

A set of tools to analyse the output from TraDIS analyses