Unofficial JAX implementations of Deep Learning models

Last update: Jan 05, 2023

Overview

JAX Models

Table of Contents

About The Project
Getting Started
Contributing
License
Contact

About The Project

The JAX Models repository aims to provide open sourced JAX/Flax implementations for research papers originally without code or code written with frameworks other than JAX. The goal of this project is to make a collection of models, layers, activations and other utilities that are most commonly used for research. All papers and derived or translated code is cited in either the README or the docstrings. If you think that any citation is missed then please raise an issue.

All implementations provided here are available on Papers With Code.

Available model implementations for JAX are:

MetaFormer is Actually What You Need for Vision (Weihao Yu et al., 2021)
Augmenting Convolutional networks with attention-based aggregation (Hugo Touvron et al., 2021)
MPViT : Multi-Path Vision Transformer for Dense Prediction (Youngwan Lee et al., 2021)
MLP-Mixer: An all-MLP Architecture for Vision (Ilya Tolstikhin et al., 2021)
Patches Are All You Need (Anonymous et al., 2021)
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers (Enze Xie et al., 2021)
A ConvNet for the 2020s (Zhuang Liu et al., 2021)
Masked Autoencoders Are Scalable Vision Learners (Kaiming He et al., 2021)

Available layers for out-of-the-box integration:

DropPath (Stochastic Depth) (Gao Huang et al., 2021)
Squeeze-and-Excitation Layer (Jie Hu et al. 2019)
Depthwise Convolution (François Chollet, 2017)

Prerequisites

Prerequisites can be installed separately through the requirements.txt file in the main directory using:

pip install -r requirements.txt

The use of a virtual environment is highly recommended to avoid version incompatibilites.

Installation

This project is built with Python 3 for the latest JAX/Flax versions and can be directly installed via pip.

pip install jax-models

If you wish to use the latest version then you can directly clone the repository too.

git clone https://github.com/DarshanDeshpande/jax-models.git

Usage

To see all model architectures available:

from jax_models.models.model_registry import list_models
from pprint import pprint

pprint(list_models())

To load your desired model:

from jax_models.models.model_registry import load_model
load_model('mpvit-base', attach_head=True, num_classes=1000, dropout=0.1)

Contributing

Please raise an issue if any implementation gives incorrect results, crashes unexpectedly during training/inference or if any citation is missing.

You can contribute to jax_models by supporting me with compute resources or by contributing your own resources to provide pretrained weights.

If you wish to donate to this inititative then please drop me a mail here.

License

Distributed under the Apache 2.0 License. See LICENSE for more information.

Contact

Feel free to reach out for any issues or requests related to these implementations

Darshan Deshpande - Email | Twitter | LinkedIn

You might also like...

Very deep VAEs in JAX/Flax

Very Deep VAEs in JAX/Flax Implementation of the experiments in the paper Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on I

42 Dec 12, 2022

Conservative Q Learning for Offline Reinforcement Reinforcement Learning in JAX

CQL-JAX This repository implements Conservative Q Learning for Offline Reinforcement Reinforcement Learning in JAX (FLAX). Implementation is built on

8 Nov 7, 2022

PyTorch implementations of neural network models for keyword spotting

Honk: CNNs for Keyword Spotting Honk is a PyTorch reimplementation of Google's TensorFlow convolutional neural networks for keyword spotting, which ac

475 Dec 15, 2022

Unofficial implementation of Proxy Anchor Loss for Deep Metric Learning

Proxy Anchor Loss for Deep Metric Learning Unofficial pytorch, tensorflow and mxnet implementations of Proxy Anchor Loss for Deep Metric Learning. Not

3 Jun 9, 2021

Time-series-deep-learning - Developing Deep learning LSTM, BiLSTM models, and NeuralProphet for multi-step time-series forecasting of stock price.

Stock Price Prediction Using Deep Learning Univariate Time Series Predicting stock price using historical data of a company using Neural networks for

7 Nov 27, 2022

FedJAX is a library for developing custom Federated Learning (FL) algorithms in JAX.

FedJAX: Federated learning with JAX What is FedJAX? FedJAX is a library for developing custom Federated Learning (FL) algorithms in JAX. FedJAX priori

208 Dec 14, 2022

Objax Apache-2Objax (🥉19 · ⭐ 580) - Objax is a machine learning framework that provides an Object.. Apache-2 jax

Objax Tutorials | Install | Documentation | Philosophy This is not an officially supported Google product. Objax is an open source machine learning fr

729 Jan 2, 2023

Plug-n-Play Reinforcement Learning in Python with OpenAI Gym and JAX

coax is built on top of JAX, but it doesn't have an explicit dependence on the jax python package. The reason is that your version of jaxlib will depend on your CUDA version.

128 Dec 27, 2022

JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"

Optimal Model Design for Reinforcement Learning This repository contains JAX code for the paper Control-Oriented Model-Based Reinforcement Learning wi

43 Sep 28, 2022

Comments

Missing Axis Swap in ExtractPatches and MergePatches

In patch_utils.py, the modules ExtractPatches and MergePatches are missing an axis swap between the reshapes, resulting in the extracted patches becoming horizontal stripes. For example, if we follow the code in ExtractPatches:

>>> inputs = jnp.arange(16).reshape(1, 4, 4, 1)
>>> inputs[0, :, :, 0]

DeviceArray([[ 0,  1,  2,  3],
             [ 4,  5,  6,  7],
             [ 8,  9, 10, 11],
             [12, 13, 14, 15]], dtype=int32)

>>> patch_size = 2
>>> batch, height, width, channels = inputs.shape
>>> height, width = height // patch_size, width // patch_size
>>> x = jnp.reshape(inputs, (batch, height, patch_size, width, patch_size, channels))
>>> x = jnp.reshape(x, (batch, height * width, patch_size ** 2 * channels))
>>> x[0, 0, :]

DeviceArray([0, 1, 2, 3], dtype=int32)

We see that the first patch extracted is not the patch containing [0, 1, 4, 5], but the horizontal stripe [0, 1, 2, 3]. To fix this problem, we should add an axis swap. For ExtractPatches, this should be:

batch, height, width, channels = inputs.shape
height, width = height // patch_size, width // patch_size
x = jnp.reshape(
    inputs, (batch, height, patch_size, width, patch_size, channels)
)
x = jnp.swapaxes(x, 2, 3)
x = jnp.reshape(x, (batch, height * width, patch_size ** 2 * channels))

For MergePatches, this should be:

batch, length, _ = inputs.shape
height = width = int(length**0.5)
x = jnp.reshape(inputs, (batch, height, width, patch_size, patch_size, -1))
x = jnp.swapaxes(x, 2, 3)
x = jnp.reshape(x, (batch, height * patch_size, width * patch_size, -1))

bug

opened by young-geng 4

fix convnext to make it work with jax.jit

Hey, first of all, thanks for the nice codebase. When doing inference using the convnext model, I noticed the following issue:

Calling x.item() will call float(x), which breaks the jit tracer. We can remove the list comprehension in unnecessary conversion to make jax.jit work. Without jax.jit, the model is very slow for me, running with only ~30% GPU utilization (RTX 3090).

This issue could apply to other models as well, maybe it is a good idea to include a test for applying jax.jit to each model?

opened by maxidl 1

Releases(v0.5-van)

v0.5-van(Feb 27, 2022)

Weights for Visual Attention Network (Meng-Hao Guo et al., 2022). All weights translated from the official repository. Full credits go to the original authors.
Source code(tar.gz)
Source code(zip)
van_base.weights(101.50 MB)
van_large.weights(170.97 MB)
van_small.weights(52.93 MB)
van_tiny.weights(15.70 MB)
v0.4-cait(Feb 18, 2022)

Weights for Going deeper with Image Transformers (Hugo Touvron et al., 2021) These weights have been translated from the official Github repository and all credits for the weights go to the original authors.
Source code(tar.gz)
Source code(zip)
cait_m36_384.weights(1034.64 MB)
cait_m48_448.weights(1359.81 MB)
cait_s24_224.weights(178.98 MB)
cait_s24_384.weights(179.54 MB)
cait_s36_384.weights(260.81 MB)
cait_xs24_384.weights(101.75 MB)
cait_xxs24_224.weights(45.62 MB)
cait_xxs24_384.weights(45.90 MB)
cait_xxs36_224.weights(66.01 MB)
cait_xxs36_384.weights(66.29 MB)
v0.3-pvit(Feb 11, 2022)

Weights from Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions (Wenhai Wang et al., 2021). All credits for these weights go to the original authors.
Source code(tar.gz)
Source code(zip)
pvit_b0.weights(13.99 MB)
pvit_b1.weights(53.44 MB)
pvit_b2.weights(96.76 MB)
pvit_b2_linear.weights(86.04 MB)
pvit_b3.weights(172.58 MB)
pvit_b4.weights(238.65 MB)
pvit_b5.weights(312.66 MB)
v0.2-convnext(Feb 6, 2022)

Weights for ConvNeXt (Zhuang Liu et al, 2022) translated from the official repository.

All credits for the weights go to the original authors.
Source code(tar.gz)
Source code(zip)
convnext_base_224_1k.weights(337.96 MB)
convnext_base_224_22k.weights(419.45 MB)
convnext_base_224_22k_1k.weights(337.96 MB)
convnext_base_384_1k.weights(337.96 MB)
convnext_base_384_22k_1k.weights(337.96 MB)
convnext_large_224_1k.weights(754.43 MB)
convnext_large_224_22k.weights(876.62 MB)
convnext_large_224_22k_1k.weights(754.43 MB)
convnext_large_384_1k.weights(754.43 MB)
convnext_large_384_22k_1k.weights(754.43 MB)
convnext_small_224_1k.weights(191.59 MB)
convnext_tiny_224_1k.weights(109.06 MB)
convnext_xlarge_224_22k.weights(1498.80 MB)
convnext_xlarge_224_22k_1k.weights(1335.90 MB)
convnext_xlarge_384_22k_1k.weights(1335.90 MB)
v0.1-swin(Jan 24, 2022)

This release contains weights for the entire stack of Swin Transformer models (SwinTiny224, SwinSmall224, SwinBase224, SwinBase384, SwinLarge224, SwinLarge384).

These weights have been ported from the official repository and timm.
Source code(tar.gz)
Source code(zip)
swin_base_224_22k.weights(416.30 MB)
swin_base_384_22k.weights(416.82 MB)
swin_large_224_22k.weights(871.91 MB)
swin_large_384_22k.weights(872.69 MB)
swin_small_224_1k.weights(189.24 MB)
swin_tiny_224_1k.weights(107.91 MB)

Owner

Helping Machines Learn Better 💻😃

GitHub Repository

A PyTorch library for Vision Transformers

VFormer A PyTorch library for Vision Transformers Getting Started Read the contributing guidelines in CONTRIBUTING.rst to learn how to start contribut

142 Nov 28, 2022

A curated list of neural network pruning resources.

A curated list of neural network pruning and related resources. Inspired by awesome-deep-vision, awesome-adversarial-machine-learning, awesome-deep-learning-papers and Awesome-NAS.

1.7k Jan 09, 2023

Retrieval.pytorch - The code we used in [2020 DIGIX]

2 Feb 07, 2022

Codes for AAAI 2022 paper: Context-aware Health Event Prediction via Transition Functions on Dynamic Disease Graphs

Context-Aware-Healthcare Codes for AAAI 2022 paper: Context-aware Health Event Prediction via Transition Functions on Dynamic Disease Graphs Download

9 Dec 26, 2022

ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.

ESRGAN (Enhanced SRGAN) [ 🚀 BasicSR] [Real-ESRGAN] ✨ New Updates. We have extended ESRGAN to Real-ESRGAN, which is a more practical algorithm for rea

4.7k Jan 02, 2023

Predictive Maintenance LSTM

Predictive-Maintenance-LSTM - Predictive maintenance study for Complex case study, we've obtained failure causes by operational error and more deeply by design mistakes.

1 Dec 31, 2021

ruptures: change point detection in Python

Welcome to ruptures ruptures is a Python library for off-line change point detection. This package provides methods for the analysis and segmentation

1.1k Jan 03, 2023

A PyTorch-Based Framework for Deep Learning in Computer Vision

TorchCV: A PyTorch-Based Framework for Deep Learning in Computer Vision @misc{you2019torchcv, author = {Ansheng You and Xiangtai Li and Zhen Zhu a

2.2k Jan 09, 2023

Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!

Serpent.AI - Game Agent Framework (Python) Update: Revival (May 2020) Development work has resumed on the framework with the aim of bringing it into 2

6.4k Jan 05, 2023

Point-NeRF: Point-based Neural Radiance Fields

Point-NeRF: Point-based Neural Radiance Fields Project Sites | Paper | Primary c

662 Jan 01, 2023

Implements Stacked-RNN in numpy and torch with manual forward and backward functions

Recurrent Neural Networks Implements simple recurrent network and a stacked recurrent network in numpy and torch respectively. Both flavours implement

1 Nov 16, 2021

This implements the learning and inference/proposal algorithm described in "Learning to Propose Objects, Krähenbühl and Koltun"

Learning to propose objects This implements the learning and inference/proposal algorithm described in "Learning to Propose Objects, Krähenbühl and Ko

90 Sep 10, 2021

SOFT: Softmax-free Transformer with Linear Complexity, NeurIPS 2021 Spotlight

SOFT: Softmax-free Transformer with Linear Complexity SOFT: Softmax-free Transformer with Linear Complexity, Jiachen Lu, Jinghan Yao, Junge Zhang, Xia

272 Dec 25, 2022

A computational block to solve entity alignment over textual attributes in a knowledge graph creation pipeline.

How to apply? Create your config.ini file following the example provided in config.ini Choose one of the options below to run: Run with Python3 pip in

3 Jun 23, 2022

AI grand challenge 2020 Repo (Speech Recognition Track)

KorBERT를 활용한 한국어 텍스트 기반 위협 상황인지(2020 인공지능 그랜드 챌린지) 본 프로젝트는 ETRI에서 제공된 한국어 korBERT 모델을 활용하여 폭력 기반 한국어 텍스트를 분류하는 다양한 분류 모델들을 제공합니다. 본 개발자들이 참여한 2020 인공지

23 Jan 25, 2022

Official repository of the paper "A Variational Approximation for Analyzing the Dynamics of Panel Data". Mixed Effect Neural ODE. UAI 2021.

Official repository of the paper (UAI 2021) "A Variational Approximation for Analyzing the Dynamics of Panel Data", Mixed Effect Neural ODE. Panel dat

7 Nov 26, 2022

Fine-Tune EleutherAI GPT-Neo to Generate Netflix Movie Descriptions in Only 47 Lines of Code Using Hugginface And DeepSpeed

GPT-Neo-2.7B Fine-Tuning Example Using HuggingFace & DeepSpeed Installation cd venv/bin ./pip install -r ../../requirements.txt ./pip install deepspe

180 Jan 05, 2023