Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".

Last update: Nov 26, 2022

Related tags

Overview

GN-Transformer AST

This is the official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".

Data Preparing

Preprocess the dataset by yourself

The code we used to preprocess the Java and Python datasets are under in ./preprocess, please read README.md in /Java and /Python respectively to see how to preprocess the corpus.

The original corpus we used are from here:

Java corpus: https://github.com/xing-hu/TL-CodeSum

Python corpus: https://github.com/EdinburghNLP/code-docstring-corpus

Directly use our preprocessed dataset

You can directly download our preprocessed dataset:

Java: https://drive.google.com/file/d/1hVJaA2JA377Iz3bstHLIGaffUh_ogVnG/view?usp=sharing

Python: https://drive.google.com/file/d/1lQhczrERskISdBcWeS6VWLwCMpBAh-YF/view?usp=sharing

Or you can run the data_prepare.sh in ./data to prepare the dataset.

Training

Enter the script folders and run the gntransformer.sh, the training and testing will start.

#GPU: gpu device ids

#NAME: name of the model

Java:

cd ./scripts/java

bash gntransformer.sh #GPU #NAME

Python:

cd ./scripts/python

bash gntransformer.sh #GPU #NAME

Examples:

bash gntransformer.sh 0 some_name # one gpu

bash gntransformer.sh 0,1 some_name # two gpus

...

Trained models

You can download our trained models here:

Java: https://drive.google.com/file/d/1vnIuGLBNGU_AHDwL7yZIkoaByWiLKYxb/view?usp=sharing

Python: https://drive.google.com/file/d/1tk3Wc4YpSo_oLKCi6h3Kitvsux3vWFUO/view?usp=sharing

Or directly run download_models.sh in ./models to download the trained models.

Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".

Related tags

Overview

GN-Transformer AST

Data Preparing

Preprocess the dataset by yourself

Directly use our preprocessed dataset

Training

Java:

Python:

Examples:

Trained models

Owner

Cheng Jun-Yan

Breaking the Dilemma of Medical Image-to-image Translation

Learning High-Speed Flight in the Wild

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

Official implementation of the paper 'Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution' in CVPR 2022

Learning kernels to maximize the power of MMD tests

Implementation for the EMNLP 2021 paper "Interactive Machine Comprehension with Dynamic Knowledge Graphs".

Multi-Stage Episodic Control for Strategic Exploration in Text Games

Library for machine learning stacking generalization.

🐸STT integration examples

Riemann Noise Injection With PyTorch

Vision Transformer and MLP-Mixer Architectures

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Synthesize photos from PhotoDNA using machine learning 🌱

Stitch it in Time: GAN-Based Facial Editing of Real Videos

A simple python stock Predictor

A self-supervised 3D representation learning framework named viewpoint bottleneck.

Implementation for our ICCV 2021 paper: Dual-Camera Super-Resolution with Aligned Attention Modules

PyTorch implementation code for the paper MixCo: Mix-up Contrastive Learning for Visual Representation

The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".

PyTorch implementation of SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching