Code of paper: "DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks"

Last update: Nov 10, 2022

Overview

DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks

Abstract: Adversarial training has been proven to be a powerful regularization method to improve generalization of models. In this work, a novel masked weight adversarial training method, DropAttack, is proposed for improving generalization potential of neural network models. It enhances the coverage and diversity of adversarial attack by intentionally adding worst-case adversarial perturbations to both the input and hidden layers and randomly masking the attack perturbations on a certain proportion weight parameters. It then improves the generalization of neural networks by minimizing the internal adversarial risk generated by exponentially different attack combinations. Further, the method is a general technique that can be adopted to a wide variety of neural networks with different architectures. To validate the effectiveness of the proposed method, five public datasets were used in the fields of natural language processing (NLP) and computer vision (CV) for experimental evaluating. This study compared DropAttack with other adversarial training methods and regularization methods. It was found that the proposed method achieves state-of-the-art performance on all datasets. In addition, the experimental results of this study show that DropAttack method can achieve similar performance when it uses only a half training data required in standard training. Theoretical analysis revealed that DropAttack can perform gradient regularization at random on some of the input and weight parameters of the model. Further, visualization experiments of this study show that DropAttack can push the minimum risk of the neural network model to a lower and flatter loss landscapes.

For technical details and additional experimental results, please refer to our paper:

“DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks”

Experimental results:

DropAttack indeed selects flatter loss landscapes via masked adversarial perturbations.

[The code of loss visualization]

Citation

@article{ni2021dropattack,
  title={DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks},
  author={Ni, Shiwen and Li, Jiawen and Kao, Hung-Yu},
  journal={arXiv preprint arXiv:2108.12805},
  year={2021}
}

Requirements

pytorch
pandas
numpy
nltk
sklearn
torchtext

Please star it, thank you! :）

Code of paper: "DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks"

Related tags

Overview

DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks

For technical details and additional experimental results, please refer to our paper:

Experimental results:

Citation

Requirements

Please star it, thank you! :）

Owner

倪仕文 (Shiwen Ni)

Video-Captioning - A machine Learning project to generate captions for video frames indicating the relationship between the objects in the video

Gym environment for FLIPIT: The Game of "Stealthy Takeover"

Pytorch implementation for "Density-aware Chamfer Distance as a Comprehensive Metric for Point Cloud Completion" (NeurIPS 2021)

Like Dirt-Samples, but cleaned up

An all-in-one application to visualize multiple different local path planning algorithms

Using CNN to mimic the driver based on training data from Torcs

Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

The implementation our EMNLP 2021 paper "Enhanced Language Representation with Label Knowledge for Span Extraction".

This project uses Template Matching technique for object detecting by detection of template image over base image.

Code release for Local Light Field Fusion at SIGGRAPH 2019

I created My own Virtual Artificial Intelligence named genesis, He can assist with my Tasks and also perform some analysis,,

Code of Adverse Weather Image Translation with Asymmetric and Uncertainty aware GAN

Optimizes image files by converting them to webp while also updating all references.

Python3 / PyTorch implementation of the following paper: Fine-grained Semantics-aware Representation Enhancement for Self-supervisedMonocular Depth Estimation. ICCV 2021 (oral)

PyDeepFakeDet is an integrated and scalable tool for Deepfake detection.

Public repo for the ICCV2021-CVAMD paper "Is it Time to Replace CNNs with Transformers for Medical Images?"

Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification

Pgn2tex - Scripts to convert pgn files to latex document. Useful to build books or pdf from pgn studies

Estimating Example Difficulty using Variance of Gradients

implicit displacement field