CowHerd is a partially-observed reinforcement learning environment

Last update: Mar 06, 2022

Related tags

Overview

CowHerd

CowHerd is a partially-observed reinforcement learning environment, where the player walks around an area and is rewarded for milking cows. The cows try to escape and the player can place fences to help capture them. The implementation of CowHerd is based on the Crafter environment.

Play Yourself

You can play the game yourself with an interactive window and keyboard input. The mapping from keys to actions, health level, and inventory state are printed to the terminal.

# Install with GUI
pip3 install 'cowherd[gui]'

# Start the game
cowherd

# Alternative way to start the game
python3 -m cowherd.run_gui

The following optional command line flags are available:

Flag	Default	Description
`--window`	800 800	Window size in pixels, used as width and height.
`--fps`	5	How many times to update the environment per second.
`--record .mp4`	None	Record a video of the trajectory.
`--num_cows`	3	The number of cows in the environment.
`--view`	7 7	The layout size in cells; determines view distance.
`--length`	None	Time limit for the episode.
`--seed`	None	Determines world generation and creatures.

Training Agents

Installation: pip3 install -U cowherd

The environment follows the OpenAI Gym interface:

import cowherd

env = cowherd.Env(seed=0)
obs = env.reset()
assert obs.shape == (64, 64, 3)

done = False
while not done:
  action = env.action_space.sample()
  obs, reward, done, info = env.step(action)

Environment Details

Reward

A reward of +1 is given every time the player milks one of the cows.

Termination

Episodes terminate after 1000 steps.

Observation Space

Each observation is an RGB image that shows a local view of the world around the player, as well as the inventory state of the agent.

Action Space

The action space is categorical. Each action is an integer index representing one of the possible actions:

Integer	Name	Description
0	`noop`	Do nothing.
1	`move_left`	Walk to the left.
2	`move_right`	Walk to the right.
3	`move_up`	Walk upwards.
4	`move_down`	Walk downwards.
5	`do`	Pick up a placed fence or milk a cow.
6	`place_fence`	Place a fence in front of the player.

Questions

Please open an issue on Github.

CowHerd is a partially-observed reinforcement learning environment

Related tags

Overview

CowHerd

Play Yourself

Training Agents

Environment Details

Reward

Termination

Observation Space

Action Space

Questions

Owner

Danijar Hafner

[ICML 2021, Long Talk] Delving into Deep Imbalanced Regression

Self-Supervised Image Denoising via Iterative Data Refinement

Companion code for the paper "Meta-Learning the Search Distribution of Black-Box Random Search Based Adversarial Attacks" by Yatsura et al.

UI2I via StyleGAN2 - Unsupervised image-to-image translation method via pre-trained StyleGAN2 network

Multi-atlas segmentation (MAS) is a promising framework for medical image segmentation

BisQue is a web-based platform designed to provide researchers with organizational and quantitative analysis tools for 5D image data. Users can extend BisQue by implementing containerized ML workflows.

This is a file about Unet implemented in Pytorch

Human Activity Recognition example using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six activity categories - Guillaume Chevalier

UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning

The code for paper "Learning Implicit Fields for Generative Shape Modeling".

Small utility to demangle Nim symbols in callgrind files

PyTorch code for our paper "Image Super-Resolution with Non-Local Sparse Attention" (CVPR2021).

https://arxiv.org/abs/2102.11005

This repository contains the code for the binaural-detection model used in the publication arXiv:2111.04637

This code provides various models combining dilated convolutions with residual networks

MINERVA: An out-of-the-box GUI tool for offline deep reinforcement learning

Construct a neural network frame by Numpy

Official Code Release for "TIP-Adapter: Training-free clIP-Adapter for Better Vision-Language Modeling"

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Semantic Segmentation.

Pytorch implementation of NeurIPS 2021 paper: Geometry Processing with Neural Fields.