This's an implementation of deepmind Visual Interaction Networks paper using pytorch

Last update: Dec 06, 2022

Overview

Visual-Interaction-Networks

An implementation of Deepmind visual interaction networks in Pytorch.

Introduction

For the purpose of understanding the challenge of relational reasoning. they publised VIN that involves predicting the future in a physical scene. From just a glance, humans can infer not only what objects are where, but also what will happen to them over the upcoming seconds, minutes and even longer in some cases. For example, if you kick a football against a wall, your brain predicts what will happen when the ball hits the wall and how their movements will be affected afterwards (the ball will ricochet at a speed proportional to the kick and - in most cases - the wall will remain where it is).

Architecture

Data

I used [email protected] physics engine to generate the data.

Just run the physics_engine.py

Usage

Main Dependencies

Python 3.5
pytorch 0.3
numpy 1.13.1

RUN

Edit configration file to meet your need.
Run vin.py

References

https://github.com/jaesik817/visual-interaction-networks_tensorflow

This's an implementation of deepmind Visual Interaction Networks paper using pytorch

Related tags

Overview

Visual-Interaction-Networks

Introduction

Architecture

Data

Usage

Main Dependencies

RUN

References

Owner

Mahmoud Gamal Salem

🕵 Artificial Intelligence for social control of public administration

Rainbow DQN implementation that outperforms the paper's results on 40% of games using 20x less data 🌈

Implementation for NeurIPS 2021 Submission: SparseFed

Tensorflow implementation of Fully Convolutional Networks for Semantic Segmentation

CV backbones including GhostNet, TinyNet and TNT, developed by Huawei Noah's Ark Lab.

YOLOv4-v3 Training Automation API for Linux

Data augmentation for NLP, accepted at EMNLP 2021 Findings

Code for testing various M1 Chip benchmarks with TensorFlow.

Hyperopt for solving CIFAR-100 with a convolutional neural network (CNN) built with Keras and TensorFlow, GPU backend

PantheonRL is a package for training and testing multi-agent reinforcement learning environments.

Official implementation of "GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators" (NeurIPS 2020)

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

An Implementation of Fully Convolutional Networks in Tensorflow.

Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification

Pytorch implementation of NEGEV method. Paper: "Negative Evidence Matters in Interpretable Histology Image Classification".

9th place solution in "Santa 2020 - The Candy Cane Contest"

Spherical CNNs

Relative Positional Encoding for Transformers with Linear Complexity

Code for the TPAMI paper: "Syntax Customized Video Captioning by Imitating Exemplar Sentences"

🏃‍♀️ A curated list about human motion capture, analysis and synthesis.