CCCL: Contrastive Cascade Graph Learning.

Last update: Dec 05, 2022

Overview

CCGL: Contrastive Cascade Graph Learning

This repo provides a reference implementation of Contrastive Cascade Graph Learning (CCGL) framework as described in the paper:

CCGL: Contrastive Cascade Graph Learning
Xovee Xu, Fan Zhou, Kunpeng Zhang, and Siyuan Liu
Submitted for review
arXiv:2107.12576

Dataset

You can download all five datasets (Weibo, Twitter, ACM, APS, and DBLP) via any one of the following links:

Google Drive	Dropbox	Onedrive	Tencent Drive	Baidu Netdisk
				`trqg`

Environmental Settings

Our experiments are conducted on Ubuntu 20.04, a single NVIDIA 1080Ti GPU, 48GB RAM, and Intel i7 8700K. CCGL is implemented by Python 3.7, TensorFlow 2.3, Cuda 10.1, and Cudnn 7.6.5.

Create a virtual environment and install GPU-support packages via Anaconda:

# create virtual environment
conda create --name=ccgl python=3.7 cudatoolkit=10.1 cudnn=7.6.5

# activate virtual environment
conda activate ccgl

# install other dependencies
pip install -r requirements.txt

Usage

Here we take Weibo dataset as an example to demonstrate the usage.

Preprocess

Step 1: divide, filter, generate labeled and unlabeled cascades:

cd ccgl
# labeled cascades
python src/gene_cas.py --input=./datasets/weibo/ --unlabel=False
# unlabeled cascades
python src/gene_cas.py --input=./datasets/weibo/ --unlabel=True

Step 2: augment both labeled and unlabeled cascades (here we use the AugSIM strategy):

python src/augmentor.py --input=./datasets/weibo/ --aug_strategy=AugSIM

Step 3: generate cascade embeddings:

python src/gene_emb.py --input=./datasets/weibo/

Pre-training

python src/pre_training.py --name=weibo-0 --input=./datasets/weibo/ --projection_head=4-1

The saved pre-training model is named as weibo-0.

Fine-tuning

python src/fine_tuning.py --name=weibo-0 --num=0 --input=./datasets/weibo/ --projection_head=4-1

Here we load the pre-trained model weibo-0 and save the teacher network as weibo-0-0.

Distillation

python src/distilling.py --name=weibo-0-0 --num=0 --input=./datasets/weibo/ --projection_head=4-1

Here we load the teacher network weibo-0-0 and save the student network as weibo-0-0-student-0.

(Optional) Run the Base model

python src/base_model.py --input=./datasets/weibo/

CCGL model weights

We provide pre-trained, fine-tuned, and distilled CCGL model weights. Please see details in the following table.

Model	Dataset	Label Fraction	Projection Head	MSLE	Weights
Pre-trained CCGL model	Weibo	100%	4-1	-	Download
Pre-trained CCGL model	Weibo	10%	4-4	-	Download
Pre-trained CCGL model	Weibo	1%	4-3	-	Download
Fine-tuned CCGL model	Weibo	100%	4-1	2.70	Download
Fine-tuned CCGL model	Weibo	10%	4-4	2.87	Download
Fine-tuned CCGL model	Weibo	1%	4-3	3.30	Download

Load weights into the model:

# construct model, carefully check projection head designs:
# use different number of Dense layers
...
# load weights for fine-tuning, distillation, or evaluation
model.load_weights(weight_path)

Check src/fine_tuning.py and src/distilling.py for weights loading examples.

Default hyper-parameter settings

Unless otherwise specified, we use following default hyper-parameter settings.

Param	Value	Param	Value
Augmentation strength	0.1	Pre-training epochs	30
Augmentation strategy	AugSIM	Projection Head (100%)	4-1
Batch size	64	Projection Head (10%)	4-4
Early stopping patience	20	Projection Head (1%)	4-3
Embedding dimension	64	Model size	128 (4x)
Learning rate	5e-4	Temperature	0.1

Change Logs

Jul 21, 2021: fix a bug and some annotations

Cite

If you find our paper & code are useful for your research, please consider citing us 😘 :

@article{xu2021ccgl, 
  author = {Xovee Xu and Fan Zhou and Kunpeng Zhang and Siyuan Liu}, 
  title = {{CCGL}: Contrastive Cascade Graph Learning}, 
  journal = {arXiv:2107.12576},
  year = {2021}, 
}

We also have a survey paper you might be interested:

@article{zhou2021survey,
  author = {Fan Zhou and Xovee Xu and Goce Trajcevski and Kunpeng Zhang}, 
  title = {A Survey of Information Cascade Analysis: Models, Predictions, and Recent Advances}, 
  journal = {ACM Computing Surveys (CSUR)}, 
  volume = {54},
  number = {2},
  year = {2021},
  articleno = {27},
  numpages = {36},
  doi = {10.1145/3433000},
}

Acknowledgment

We would like to thank Xiuxiu Qi, Ce Li, Qing Yang, and Wenxiong Li for sharing their computing resources and help us to test the codes. We would also like to show our gratitude to the authors of SimCLR (and Sayak Paul), node2vec, DeepHawkes, and others, for sharing their codes and datasets.

Contact

For any questions please open an issue or drop an email to: xovee at ieee.org

CCCL: Contrastive Cascade Graph Learning.

Related tags

Overview

CCGL: Contrastive Cascade Graph Learning

Dataset

Environmental Settings

Usage

Preprocess

Pre-training

Fine-tuning

Distillation

(Optional) Run the Base model

CCGL model weights

Default hyper-parameter settings

Change Logs

Cite

Acknowledgment

Contact

Owner

Xovee Xu

Language models are open knowledge graphs ( non official implementation )

BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

OHLC Average Prediction of Apple Inc. Using LSTM Recurrent Neural Network

Multi-label Co-regularization for Semi-supervised Facial Action Unit Recognition (NeurIPS 2019)

A multilingual version of MS MARCO passage ranking dataset

Rank1 Conversation Emotion Detection Task

Python scripts for performing object detection with the 1000 labels of the ImageNet dataset in ONNX.

DeepLM: Large-scale Nonlinear Least Squares on Deep Learning Frameworks using Stochastic Domain Decomposition (CVPR 2021)

Code and data for "TURL: Table Understanding through Representation Learning"

Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻‍🦸🏽

Official implementation for (Show, Attend and Distill: Knowledge Distillation via Attention-based Feature Matching, AAAI-2021)

School of Artificial Intelligence at the Nanjing University (NJU)School of Artificial Intelligence at the Nanjing University (NJU)

A framework for using LSTMs to detect anomalies in multivariate time series data. Includes spacecraft anomaly data and experiments from the Mars Science Laboratory and SMAP missions.

This project aims to explore the deployment of Swin-Transformer based on TensorRT, including the test results of FP16 and INT8.

Dilated Convolution for Semantic Image Segmentation

DeepConsensus uses gap-aware sequence transformers to correct errors in Pacific Biosciences (PacBio) Circular Consensus Sequencing (CCS) data.

[NeurIPS-2021] Slow Learning and Fast Inference: Efficient Graph Similarity Computation via Knowledge Distillation

Deep Learning agent of Starcraft2, similar to AlphaStar of DeepMind except size of network.

code and models for "Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation"