Statistics and Mathematics for Machine Learning, Deep Learning , Deep NLP

Last update: Dec 29, 2022

Related tags

Text Data & NLP Stat4ML

Overview

Stat4ML

Statistics and Mathematics for Machine Learning, Deep Learning , Deep NLP

This is the first course from our trio courses:

Statistics Foundation for ML

https://github.com/Bellman281/Stat4ML/

Introduction to Statistical Learning https://github.com/Bellman281/Intro_Statistical_Learning
Advanced Statistical Learning for DL ( to be anounced)

Registration Form for cohort 2 of STAT4ML:

https://forms.gle/ZqLJLmv1K5nGVx3m7

Notes about the course:

Instructor : Omid Safarzadeh,

LinkedIn: https://www.linkedin.com/in/omidsafarzadeh/

IG : @deepdatascientists

Course Text Book: Statistical Inference 2nd Edition by George Casella (Author), Roger L. Berger (Author) :

https://www.amazon.com/Statistical-Inference-George-Casella-dp-0534243126/dp/0534243126/ref=mt_other?_encoding=UTF8&me=&qid=

Pre Requisitives

Recall from Calculus:

    Derivative
          Chain rule
    Integral
          Techniques of Integration
          Substitution
    Integration by parts

Matrix Algebra Review:

    Matrix operations
    Matrix Multiplication
       Properties of determinants
       Inverse Matrix
       Matrix Transpose
       Properties of transpose
    Partioned Matrices
    Eigenvalues and Eigenvectors
    Matrix decomposition
       LU decomposition
       Cholesky decomposition
       QR decomposition
       SVD
    Matrix Differentiation

Course 1 :

Slide 1 : Probability Theory Foundation

 Sample Space
 Probability Theory Foundation
    Axiomatic Foundations
    The Calculus of Probabilities
 Independence
 Conditional Probability
    Bayes Theorem
 Random Variables
 Probability Function
    Distribution Functions
    Density function

Slide 2: Moments

   Moments
       Expected Value
       Variance
       Covariance and Correlation
   Moment Generating Functions
       Normal mgf
   Matrix Notation for Moments

Slide 3: Distribution Functions

   Distributions
     Discrete Distribution
       Discrete Uniform Distribution
       Binomial Distribution
       Poisson Distribution
     Continuous Distribution
       Uniform Distribution
       Exponential Distribution
       Normal Distribution
       Lognormal Distribution
       Laplace Distribution
       Beta Distribution

Slide 4: Conditional and Multivariate Distributions

Joint and Marginal Distribution
Conditional Distributions and Independence
Bivariate Transformations
Hierarchical Models and Mixture Distribution
Bivariate Normal Distribution
Multivariate Distribution

Slide 5: Convergence Concepts

Random Samples
   Sums of Random Variable from a Random Sample
Inequalities
Convergence Concepts:
   Almost Sure Convergence
   Convergence in Probability
   Convergence in Distribution
The Delta Method

Slide 6: Maximum Likelihood Estimation

Maximum Likelihood Estimation
  Motivation and the Main Ideas
  Properties of the Maximum Likelihood Estimator

Slide 7: Bayesian and posterior distribution Estimation

   Computing the posterior
   Maximum likelihood estimation (MLE)
Maximum a posteriori (MAP) estimation
   Posterior mean
   MAP properties
Bayesian linear regression

Statistics and Mathematics for Machine Learning, Deep Learning , Deep NLP

Related tags

Overview

Stat4ML

Registration Form for cohort 2 of STAT4ML:

Pre Requisitives

Recall from Calculus:

Matrix Algebra Review:

Course 1 :

Slide 1 : Probability Theory Foundation

Slide 2: Moments

Slide 3: Distribution Functions

Slide 4: Conditional and Multivariate Distributions

Slide 5: Convergence Concepts

Slide 6: Maximum Likelihood Estimation

Slide 7: Bayesian and posterior distribution Estimation

Owner

Omid Safarzadeh

precise iris segmentation

Weird Sort-and-Compress Thing

Unsupervised Language Modeling at scale for robust sentiment classification

A Python 3.6+ package to run .many files, where many programs written in many languages may exist in one file.

(ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.

This repository collects together basic linguistic processing data for using dataset dumps from the Common Voice project

Unsupervised Language Model Pre-training for French

SummerTime - Text Summarization Toolkit for Non-experts

Pre-training with Extracted Gap-sentences for Abstractive SUmmarization Sequence-to-sequence models

Yet Another Sequence Encoder - Encode sequences to vector of vector in python !

Conditional Transformer Language Model for Controllable Generation

A demo for end-to-end English and Chinese text spotting using ABCNet.

A framework for evaluating Knowledge Graph Embedding Models in a fine-grained manner.

Phrase-BERT: Improved Phrase Embeddings from BERT with an Application to Corpus Exploration

Control the classic General Instrument SP0256-AL2 speech chip and AY-3-8910 sound generator with a Raspberry Pi and this Python library.

Materials (slides, code, assignments) for the NYU class I teach on NLP and ML Systems (Master of Engineering).

Japanese NLP Library

PG-19 Language Modelling Benchmark

skweak: A software toolkit for weak supervision applied to NLP tasks

CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training