Caffe implementation for Hu et al. Segmentation for Natural Language Expressions

Last update: Jul 27, 2021

Related tags

Overview

Segmentation from Natural Language Expressions

This repository contains the Caffe reimplementation of the following paper:

R. Hu, M. Rohrbach, T. Darrell, Segmentation from Natural Language Expressions. in arXiv:1603.06180, 2016. (PDF)

@article{hu2016segmentation,
  title={Segmentation from Natural Language Expressions},
  author={Hu, Ronghang and Rohrbach, Marcus and Darrell, Trevor},
  journal={arXiv preprint arXiv:1603.06180},
  year={2016}
}

Project Page: http://ronghanghu.com/text_objseg

Installation

Install Caffe following the instructions here.
Download this repository or clone with Git, and then cd into the root directory of the repository.

Training and evaluation on ReferIt Dataset

Download dataset and VGG network

Download ReferIt dataset:

./referit/referit-dataset/download_referit_dataset.sh

Download the caffemodel for VGG-16 network parameters trained on ImageNET 1000 classes.

Training

You may need to add the repository root directory to Python's module path:

export PYTHONPATH=/path/to/text_objseg_caffe/:$PYTHONPATH

Build training batches for bounding boxes:

python referit/build_training_batches_det.py

Build training batches for segmentation:

python referit/build_training_batches_seg.py

Configure the config.py file in the directory det_model and train the language-based bounding box localization model:

python det_model/train_det_model.py

Configure the config.py file in the directory seg_low_res_model and train the low resolution language-based segmentation model (from the previous bounding box localization model):

python seg_low_res_model/train_low_res_model.py

Configure the config.py file in the directory seg_model and train the high resolution language-based segmentation model (from the previous low resolution segmentation model):

python seg_model/train_seg_model.py

Evaluation

You may need to add the repository root directory to Python's module path:

export PYTHONPATH=path/to/text_objseg_caffe:$PYTHONPATH

Configure the test_config.py file in the directory seg_model and run evaluation for the high resolution language-based segmentation model:

python seg_model/test_seg_model.py

This should reproduce the results in the paper. You may also evaluate the language-based bounding box localization model:

python det_model/test_det_model.py

The results can be compared to this paper.

Demo

There is a demo that you can try! Run the demo in ./demo/text_objseg_demo.ipynb with Jupyter Notebook (IPython Notebook).

Caffe implementation for Hu et al. Segmentation for Natural Language Expressions

Related tags

Overview

Segmentation from Natural Language Expressions

Installation

Training and evaluation on ReferIt Dataset

Download dataset and VGG network

Training

Evaluation

Demo

Owner

PyTorch implementation of "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks"

Some experiments with tennis player aging curves using Hilbert space GPs in PyMC. Only experimental for now.

This repository contains the exercises and its solution contained in the book "An Introduction to Statistical Learning" in python.

Minimal diffusion models - Minimal code and simple experiments to play with Denoising Diffusion Probabilistic Models (DDPMs)

Asterisk is a framework to generate high-quality training datasets at scale

Template repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.

Code accompanying our paper Feature Learning in Infinite-Width Neural Networks

PyTorch implementation of DeepUME: Learning the Universal Manifold Embedding for Robust Point Cloud Registration (BMVC 2021)

Restricted Boltzmann Machines in Python.

Dense Gaussian Processes for Few-Shot Segmentation

Code for our EMNLP 2021 paper “Heterogeneous Graph Neural Networks for Keyphrase Generation”

Differentiable rasterization applied to 3D model simplification tasks

Converts given image (png, jpg, etc) to amogus gif.

Intent parsing and slot filling in PyTorch with seq2seq + attention

Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

Spherical CNNs

Subpopulation detection in high-dimensional single-cell data

PyTorch Implement for Path Attention Graph Network

Astrostatistics class for the MSc degree in Astrophysics at the University of Milan-Bicocca (Italy)

The implementation of DeBERTa