a test times augmentation toolkit based on paddle2.0.

Last update: Dec 03, 2022

Related tags

Overview

Patta

Image Test Time Augmentation with Paddle2.0!

           Input
             |           # input batch of images 
        / / /|\ \ \      # apply augmentations (flips, rotation, scale, etc.)
       | | | | | | |     # pass augmented batches through model
       | | | | | | |     # reverse transformations for each batch of masks/labels
        \ \ \ / / /      # merge predictions (mean, max, gmean, etc.)
             |           # output batch of masks/labels
           Output

Quick Start

Test
Predict
Use Tools

Transforms
Aliases
Merge modes
Installation

Quick start (Default Transforms)

Test

We support that you can use the following to test after defining the network.

Segmentation model wrapping [docstring]:

import patta as tta
tta_model = tta.SegmentationTTAWrapper(model, tta.aliases.d4_transform(), merge_mode='mean')

Classification model wrapping [docstring]:

tta_model = tta.ClassificationTTAWrapper(model, tta.aliases.five_crop_transform())

Keypoints model wrapping [docstring]:

tta_model = tta.KeypointsTTAWrapper(model, tta.aliases.flip_transform(), scaled=True)

Note: the model must return keypoints in the format Tensor([x1, y1, ..., xn, yn])

Predict

We support that you can use the following to test when you have the static model: *.pdmodel、*.pdiparams、*.pdiparams.info.

Load model [docstring]:

import patta as tta
model = tta.load_model(path='output/model')

Segmentation model wrapping [docstring]:

tta_model = tta.SegmentationTTAWrapper(model, tta.aliases.d4_transform(), merge_mode='mean')

Classification model wrapping [docstring]:

tta_model = tta.ClassificationTTAWrapper(model, tta.aliases.five_crop_transform())

Keypoints model wrapping [docstring]:

tta_model = tta.KeypointsTTAWrapper(model, tta.aliases.flip_transform(), scaled=True)

Use-Tools

Segmentation model [docstring]:

We recommend modifying the file seg.py according to your own model.

python seg.py --model_path='output/model' \
                 --batch_size=16 \
                 --test_dataset='test.txt'

Note: Related to paddleseg

Advanced-Examples (DIY Transforms)

Custom transform:

# defined 2 * 2 * 3 * 3 = 36 augmentations !
transforms = tta.Compose(
    [
        tta.HorizontalFlip(),
        tta.Rotate90(angles=[0, 180]),
        tta.Scale(scales=[1, 2, 4]),
        tta.Multiply(factors=[0.9, 1, 1.1]),        
    ]
)

tta_model = tta.SegmentationTTAWrapper(model, transforms)

Custom model (multi-input / multi-output)

# Example how to process ONE batch on images with TTA
# Here `image`/`mask` are 4D tensors (B, C, H, W), `label` is 2D tensor (B, N)

for transformer in transforms: # custom transforms or e.g. tta.aliases.d4_transform() 
    
    # augment image
    augmented_image = transformer.augment_image(image)
    
    # pass to model
    model_output = model(augmented_image, another_input_data)
    
    # reverse augmentation for mask and label
    deaug_mask = transformer.deaugment_mask(model_output['mask'])
    deaug_label = transformer.deaugment_label(model_output['label'])
    
    # save results
    labels.append(deaug_mask)
    masks.append(deaug_label)
    
# reduce results as you want, e.g mean/max/min
label = mean(labels)
mask = mean(masks)

Optional Transforms

Transform	Parameters	Values
HorizontalFlip	-	-
VerticalFlip	-	-
Rotate90	angles	List[0, 90, 180, 270]
Scale	scales interpolation	List[float] "nearest"/"linear"
Resize	sizes original_size interpolation	List[Tuple[int, int]] Tuple[int,int] "nearest"/"linear"
Add	values	List[float]
Multiply	factors	List[float]
FiveCrops	crop_height crop_width	int int

Aliases (Combos)

flip_transform (horizontal + vertical flips)
hflip_transform (horizontal flip)
d4_transform (flips + rotation 0, 90, 180, 270)
multiscale_transform (scale transform, take scales as input parameter)
five_crop_transform (corner crops + center crop)
ten_crop_transform (five crops + five crops on horizontal flip)

Merge-modes

mean
gmean (geometric mean)
sum
max
min
tsharpen (temperature sharpen with t=0.5)

Installation

PyPI:

# After downloading the whole dir
$ git clone https://github.com/AgentMaker/PaTTA.git
$ pip install PaTTA/

# or

$ pip install git+https://github.com/AgentMaker/PaTTA.git

Run tests

# run test_transforms.py and test_base.py for test
python test/test_transforms.py
python test/test_base.py

Comments

preprocess issue

issue 1

当我将crop_size调至(1024,512), 报错

Traceback (most recent call last): File "PaTTA/tools/seg.py", line 41, in main(args.batch_size, imgs_list, args.crop_size) File "PaTTA/tools/seg.py", line 26, in main tensor_img = tta_model(tensor_img) File "/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/paddle/fluid/dygraph/layers.py", line 902, in call outputs = self.forward(*inputs, **kwargs) File "/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/patta/wrappers.py", line 39, in forward augmented_output = self.model(augmented_image, *args)[0] File "/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/paddle/fluid/dygraph/layers.py", line 902, in call outputs = self.forward(*inputs, **kwargs) File "/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/paddle/fluid/dygraph/io.py", line 1170, in i_m_p_l return _run_dygraph(self, input, program_holder) File "/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/paddle/fluid/dygraph/io.py", line 733, in _run_dygraph 'is_test': instance._is_test File "/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/paddle/fluid/dygraph/tracer.py", line 45, in trace_op not stop_gradient) ValueError: (InvalidArgument) Broadcast dimension mismatch. Operands could not be broadcast together with the shape of X = [16, 48, 128, 256] and the shape of Y = [16, 48, 384, 384]. Received [128] in X is not equal to [384] in Y at i:2. [Hint: Expected x_dims_array[i] == y_dims_array[i] || x_dims_array[i] <= 1 || y_dims_array[i] <= 1 == true, but received x_dims_array[i] == y_dims_array[i] || x_dims_array[i] <= 1 || y_dims_array[i] <= 1:0 != true:1.] (at /paddle/paddle/fluid/operators/elementwise/elementwise_op_function.h:160) [operator < elementwise_add > error] [operator < run_program > error]

事实上修改任意crop_size都报错，但是改为1536,1536即数据集的图片尺寸，上述错误解决，但是issue2出现

issue 2

Traceback (most recent call last): File "PaTTA/tools/seg.py", line 41, in main(args.batch_size, imgs_list, args.crop_size) File "PaTTA/tools/seg.py", line 26, in main tensor_img = tta_model(tensor_img) File "/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/paddle/fluid/dygraph/layers.py", line 902, in call outputs = self.forward(*inputs, **kwargs) File "/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/patta/wrappers.py", line 39, in forward augmented_output = self.model(augmented_image, *args)[0] File "/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/paddle/fluid/dygraph/layers.py", line 902, in call outputs = self.forward(*inputs, **kwargs) File "/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/paddle/fluid/dygraph/io.py", line 1170, in i_m_p_l return _run_dygraph(self, input, program_holder) File "/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/paddle/fluid/dygraph/io.py", line 733, in _run_dygraph 'is_test': instance._is_test File "/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/paddle/fluid/dygraph/tracer.py", line 45, in trace_op not stop_gradient) ValueError: (InvalidArgument) The 'shape' in ReshapeOp is invalid. The input tensor X'size must be equal to the capacity of 'shape'. But received X's shape = [16, 512, 384, 384], X's size = 1207959552, 'shape' is [1, 512, 147456], the capacity of 'shape' is 75497472. [Hint: Expected capacity == in_size, but received capacity:75497472 != in_size:1207959552.] (at /paddle/paddle/fluid/operators/reshape_op.cc:222) [operator < reshape2 > error] [operator < run_program > error]

opened by CoderChen01 10
修复各种测试，并利用 GitHub Actions 自动化测试

貌似原来测试是跑不起来的，有些还是 pytorch 的代码，因此修复了下测试，并添加了 CI 配置以自动化测试。

另外 Resize 代码也是跑不起来的，原因是 paddle 里参数 align_corners 应当只是 bool，不允许是 None，因此对 transform 和 functional 里的代码也做了少许调整。

opened by SigureMo 4
[PaddlePaddle Hackathon] add image augment algorithms
Task: https://github.com/AgentMaker/PaTTA/issues/5

Description: 新增不低于5个图像方向的数据增强算法，并且这些算法能够略微、显著提升推理成绩，以提升 PaTTA 可用性

[x] 算法 * 7

HorizontalShift 水平平移（DualTransform）

VerticalShift 竖直平移（DualTransform）

AdjustContrast 调节图片对比度（ImageOnlyTransform）

AdjustBrightness 调节图片亮度（ImageOnlyTransform）

AverageBlur 均值滤波（ImageOnlyTransform）

GaussianBlur 高斯滤波（ImageOnlyTransform）

Sharpen 锐化（ImageOnlyTransform）

[x] 文档（README + docstring）

[x] 单元测试

[x] AI Studio 自测（部分公开，有效期三天）：https://aistudio.baidu.com/studio/project/partial/verify/2586123/9bf6d33c51e34ff1984273b17488dc8b

以上所有算法均使用批处理方式进行，避免在 Python 中调用低效的 for 循环，其中后三种滤波方式使用 paddle.nn.functional.conv2d 实现，边缘直接使用 pad 0 后卷积，未作 OpenCV 中的那些特殊处理，但非边缘部分处理效果与直接调用 OpenCV 效果一致～
PaddlePaddle Hackathon
opened by SigureMo 2
【PaddlePaddle Hackathon】97 新增图像数据增强算法
（此 ISSUE 为 PaddlePaddle Hackathon 活动的任务 ISSUE，更多详见PaddlePaddle Hackathon）

PaTTA 是一个致力于让模型表现更加稳定的飞桨模型测试增强工具箱，其原理为在测试时对要推理的数据进行增强，通过投票形式选出更稳健的推理结果。

【任务说明】

任务标题：新增图像数据增强算法

技术标签：Python

任务难度：简单

详细描述：数据增强是一种比较有效的模型能力提升方式，更多的组合可使得模型在训练时更加关注目标特征，从而进一步提升模型成绩。目前 PaTTA 中仅具备高频的图像数据增强算法。本这个项目，需要你新增不低于5个图像方向的数据增强算法，并且这些算法能够略微、显著提升推理成绩，以提升 PaTTA 可用性。

【提交内容】

项目 PR 到 PaTTA

技术说明文档

【项目技术要求】

具有基础的 Python 开发能力

有过在深度学习中使用图像增强的经历

PaddlePaddle Hackathon
opened by GT-ZhangAcer 2
【PaddlePaddle Hackathon】AgentMaker 任务合集

Hi，大家好，非常高兴的告诉大家，首届 PaddlePaddle Hackathon 开始啦。PaddlePaddle Hackathon 是面向全球开发者的深度学习领域编程活动，鼓励开发者了解与参与 PaddlePaddle。本次共有四大方向（PaddlePaddle、Paddle Family、Paddle Friends、Paddle Anything）四大方向，共计100个任务共大家完成。详细信息可以参考 PaddlePaddle Hackathon 说明。大家是否已经迫不及待了呢~

本 ISSUE 是 Paddle Friends 专区 AgentMaker 方向任务合集。具体任务列表如下：

| 序号 | 难度 | 任务 ISSUE | | ---- | ---- | --------------------------------------------------------- | | 96 | ⭐️ | 【PaddlePaddle Hackathon】96 图像分类模型解释性可视化探究 | | 97 | ⭐️ | 【PaddlePaddle Hackathon】97 新增图像数据增强算法 | | 98 | ⭐️ | 【PaddlePaddle Hackathon】98 搜索测试图像增强最佳方案探索 | | 99 | ⭐️ | 【PaddlePaddle Hackathon】99 为 AgentOCR 工具适配 JavaScript 环境 | | 100 | ⭐️ ⭐️ | 【PaddlePaddle Hackathon】100 制作 Rubick 深度学习相关小插件 |

若想要认领本次活动任务，请至 PaddlePaddle Hackathon Pinned ISSUE 完成活动报名以及任务认领。

活动官网：PaddlePaddle Hackathon
PaddlePaddle Hackathon

opened by GT-ZhangAcer 0
【PaddlePaddle Hackathon】98 搜索测试图像增强最佳方案探索
（此 ISSUE 为 PaddlePaddle Hackathon 活动的任务 ISSUE，更多详见PaddlePaddle Hackathon）

PaTTA 就是一个致力于让模型表现更加稳定的飞桨模型测试增强工具箱，其原理为在测试时对要推理的数据进行增强，通过投票形式选出更稳健的推理结果。

【任务说明】

任务标题：搜索测试图像增强最佳方案探索

技术标签：Python、PaddlePaddle

任务难度：简单

详细描述：在一般的深度学习赛事中，模型融合、TTA 等策略虽然能有效提升选手成绩，但这些方案在性能上往往难以应用于真实场景。虽然 PaTTA 提供了 TTA 工具，但我们也可以思考是否可以通过统计等方式，在用户预测单张图像时尽可能推荐出一个推理性能均衡点，在较低的速度影响下依旧可以提升模型效果。在这个项目中，需要你在同样环境下，在 Cifar100 数据集上进行推理，做到速度影响在 5% 以内，精度仍可具备至少 0.1% 的提升。

【提交内容】

项目 PR 到 PaTTA

技术说明文档

【技术要求】

可跑通 PaddlePaddle 核心框架下任一图像分类任务

PaddlePaddle Hackathon
opened by GT-ZhangAcer 0
【PaddlePaddle Hackathon】96 图像分类模型解释性可视化探究
（此 ISSUE 为 PaddlePaddle Hackathon 活动的任务 ISSUE，更多详见PaddlePaddle Hackathon）

PaTTA 是一个致力于让模型表现更加稳定的飞桨模型测试增强工具箱。

【任务说明】

任务标题：图像分类模型解释性可视化探究

技术标签：PaTTA、Python、PaddlePaddle

任务难度：简单

详细描述：深度学习模型在结构上很难具备“可解释”能力，然而这并不影响我们通过梯度、噪音等方式去解释模型到底在关注什么，也就意味着我们在一些比赛中也可以从通过该方式来了解模型的“关注点”从而提升比赛成绩。

在这个任务中，你需要从产品设计出发，也可以考虑如何优化可解释型算法，目的是将解释性工具箱 InterpretDL 或者自己实现的可解释性模块加入 PaTTA 工具箱中，为模型分析提供更多可能，使得用户在使用 PaTTA 工具箱进行推理结果增强时，可以通过简单的方式调用可视化解释性功能，向使用者提供解释性分析情况。

PaTTA 主页：https://github.com/AgentMaker/PaTTA

InterpretDL 主页：https://github.com/PaddlePaddle/InterpretDL

【提交内容】

项目 PR 到 PaTTA

技术说明文档

【技术要求】

具有基础的 Python 开发能力

有使用 Matplotlib 或 OpenCV 等任一 Python 图像库的使用经历

PaddlePaddle Hackathon
opened by GT-ZhangAcer 0

Releases(0.0.2)

0.0.2(Mar 19, 2021)

Add pip install method and update whl in pypi.
Source code(tar.gz)
Source code(zip)

Owner

AgentMaker

Mainly focus on reinforcement learning and deep learning for point clouds

GitHub Repository

An open collection of annotated voices in Japanese language

声庭 (Koniwa): オープンな日本語音声とアノテーションのコレクション Koniwa (声庭): An open collection of annotated voices in Japanese language 概要 Koniwa(声庭)は利用・修正・再配布が自由でオープンな音声とアノテ

32 Dec 14, 2022

A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational engines. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.

Parrot Parrot is a paraphrase based utterance augmentation framework purpose built to accelerate training NLU models. A paraphrase framework is more t

690 Jan 04, 2023

Question and answer retrieval in Turkish with BERT

trfaq Google supported this work by providing Google Cloud credit. Thank you Google for supporting the open source! 🎉 What is this? At this repo, I'm

13 Oct 10, 2022

Official code for "Parser-Free Virtual Try-on via Distilling Appearance Flows", CVPR 2021

Parser-Free Virtual Try-on via Distilling Appearance Flows, CVPR 2021 Official code for CVPR 2021 paper 'Parser-Free Virtual Try-on via Distilling App

395 Jan 03, 2023

TLA - Twitter Linguistic Analysis

TLA - Twitter Linguistic Analysis Tool for linguistic analysis of communities TLA is built using PyTorch, Transformers and several other State-of-the-

47 Aug 14, 2022

Almost State-of-the-art Text Generation library

Ps: we are adding transformer model soon Text Gen 🐐 Almost State-of-the-art Text Generation library Text gen is a python library that allow you build

63 Jun 24, 2022

An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.

Extracting OpenAI CLIP (Global/Grid) Features from Image and Text This repo aims at providing an easy to use and efficient code for extracting image &

13 Jan 06, 2023

Image2pcl - Enter the metaverse with 2D image to 3D projections

Image2PCL Enter the metaverse with 2D image to 3D projections! This is an implem

0 Feb 05, 2022

Checking spelling of form elements

Checking spelling of form elements. You can check the source files of external workflows/reports and configuration files

15 Sep 12, 2022

Data preprocessing rosetta parser for python

datapreprocessing_rosetta_parser I've never done any NLP or text data processing before, so I wanted to use this hackathon as a learning opportunity,

2 Nov 28, 2021

A Chinese to English Neural Model Translation Project

ZH-EN NMT Chinese to English Neural Machine Translation This project is inspired by Stanford's CS224N NMT Project Dataset used in this project: News C

29 Nov 26, 2022

Japanese Long-Unit-Word Tokenizer with RemBertTokenizerFast of Transformers

Japanese-LUW-Tokenizer Japanese Long-Unit-Word (国語研長単位) Tokenizer for Transformers based on 青空文庫 Basic Usage from transformers import RemBertToken

3 Dec 22, 2021

gaiic2021-track3-小布助手对话短文本语义匹配复赛rank3、决赛rank4

决赛答辩已经过去一段时间了，我们队伍ac milan最终获得了复赛第3，决赛第4的成绩。在此首先感谢一些队友的carry～经过2个多月的比赛，学习收获了很多，也认识了很多大佬，在这里记录一下自己的参赛体验和学习收获。

102 Dec 19, 2022

Geometry-Consistent Neural Shape Representation with Implicit Displacement Fields

Geometry-Consistent Neural Shape Representation with Implicit Displacement Fields [project page][paper][cite] Geometry-Consistent Neural Shape Represe

100 Dec 19, 2022

Persian-lexicon - A lexicon of 70K unique Persian (Farsi) words

Persian Lexicon This repo uses Uppsala Persian Corpus (UPC) to construct a lexic

7 Apr 01, 2022

Prompt tuning toolkit for GPT-2 and GPT-Neo

mkultra mkultra is a prompt tuning toolkit for GPT-2 and GPT-Neo. Prompt tuning injects a string of 20-100 special tokens into the context in order to

61 Jan 01, 2023

Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project.

Demo Code for "Talking Head Anime from a Single Image 2: More Expressive" This repository contains demo programs for the Talking Head Anime

901 Jan 06, 2023

:mag: Transformers at scale for question answering & neural search. Using NLP via a modular Retriever-Reader-Pipeline. Supporting DPR, Elasticsearch, HuggingFace's Modelhub...

Haystack is an end-to-end framework that enables you to build powerful and production-ready pipelines for different search use cases. Whether you want

6.4k Jan 09, 2023

YACLC - Yet Another Chinese Learner Corpus

汉语学习者文本多维标注数据集YACLC V1.0 中文 | English 汉语学习者文本多维标注数据集（Yet Another Chinese Learner

47 Dec 15, 2022

Utility for Google Text-To-Speech batch audio files generator. Ideal for prompt files creation with Google voices for application in offline IVRs

Google Text-To-Speech Batch Prompt File Maker Are you in the need of IVR prompts, but you have no voice actors? Let Google talk your prompts like a pr

1 Aug 19, 2021

a test times augmentation toolkit based on paddle2.0.

Related tags

Overview

Patta

Table of Contents

Quick start (Default Transforms)

Test

Segmentation model wrapping [docstring]:

Classification model wrapping [docstring]:

Keypoints model wrapping [docstring]:

Predict

Load model [docstring]:

Segmentation model wrapping [docstring]:

Classification model wrapping [docstring]:

Keypoints model wrapping [docstring]:

Use-Tools

Segmentation model [docstring]:

Advanced-Examples (DIY Transforms)

Custom transform:

Custom model (multi-input / multi-output)

Optional Transforms

Aliases (Combos)

Merge-modes

Installation

Run tests

Comments

preprocess issue

issue 1

当我将crop_size调至(1024,512), 报错

事实上修改任意crop_size都报错，但是改为1536,1536即数据集的图片尺寸，上述错误解决，但是issue2出现

issue 2

修复各种测试，并利用 GitHub Actions 自动化测试

[PaddlePaddle Hackathon] add image augment algorithms

【PaddlePaddle Hackathon】97 新增图像数据增强算法

【PaddlePaddle Hackathon】AgentMaker 任务合集

【PaddlePaddle Hackathon】98 搜索测试图像增强最佳方案探索

【PaddlePaddle Hackathon】96 图像分类模型解释性可视化探究

Releases(0.0.2)

0.0.2(Mar 19, 2021)

Owner

AgentMaker

An open collection of annotated voices in Japanese language

A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational engines. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.

Question and answer retrieval in Turkish with BERT

Official code for "Parser-Free Virtual Try-on via Distilling Appearance Flows", CVPR 2021

TLA - Twitter Linguistic Analysis

Almost State-of-the-art Text Generation library

An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.

Image2pcl - Enter the metaverse with 2D image to 3D projections

Checking spelling of form elements

Data preprocessing rosetta parser for python

A Chinese to English Neural Model Translation Project

Japanese Long-Unit-Word Tokenizer with RemBertTokenizerFast of Transformers

gaiic2021-track3-小布助手对话短文本语义匹配复赛rank3、决赛rank4

Geometry-Consistent Neural Shape Representation with Implicit Displacement Fields

Persian-lexicon - A lexicon of 70K unique Persian (Farsi) words

Prompt tuning toolkit for GPT-2 and GPT-Neo

Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project.

:mag: Transformers at scale for question answering & neural search. Using NLP via a modular Retriever-Reader-Pipeline. Supporting DPR, Elasticsearch, HuggingFace's Modelhub...

YACLC - Yet Another Chinese Learner Corpus

Utility for Google Text-To-Speech batch audio files generator. Ideal for prompt files creation with Google voices for application in offline IVRs