Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining

Last update: Dec 20, 2022

Overview

LMSOC: An Approach for Socially Sensitive Pretraining

Code for reproducing the paper LMSOC: An Approach for Socially Sensitive Pretraining to appear at 2021 Conference on Empirical Methods in Natural Language Processing: Findings.

Abstract

While large-scale pretrained language models have been shown to learn effective linguistic representations for many NLP tasks, there remain many real-world contextual aspects of language that current approaches do not capture. For instance, consider a cloze-test "I enjoyed the ____ game this weekend": the correct answer depends heavily on where the speaker is from, when the utterance occurred, and the speaker's broader social milieu and preferences. Although language depends heavily on the geographical, temporal, and other social contexts of the speaker, these elements have not been incorporated into modern transformer-based language models. We propose a simple but effective approach to incorporate speaker social context into the learned representations of large-scale language models. Our method first learns dense representations of social contexts using graph representation learning algorithms and then primes language model pretraining with these social context representations. We evaluate our approach on geographically-sensitive language-modeling tasks and show a substantial improvement (more than 100% relative lift on MRR) compared to baselines.

Citation

Please cite as:

Kulkarni, V., Mishra, S., & Haghighi, A. (2021). LMSOC: An Approach for Socially Sensitive Pretraining. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: Findings. arXiv

@inproceedings{kulkarni2021lmsoc,
  title={LMSOC: An Approach for Socially Sensitive Pretraining},
  author={Kulkarni, Vivek and Mishra, Shubhanshu and Haghighi, Aria},
  booktitle={Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: Findings},
  year={2021}
  address={Online},
  publisher={Association for Computational Linguistics},
  pages={1--9},
  eprint={2110.10319},
  archivePrefix={arXiv},
  primaryClass={cs.CL}
}

Reproducibility

NOTE: Dependencies are specified in the notebooks. But we have also encluded an requirements.txt and environment.yml files to install dependencies using pip or conda.

Create Social Context Embeddings via the example notebook embed_time_toy_task.ipynb which contains the implementation of how to embed time for Task 1 in the paper.
Upload the files in data/ to the location where you will run the next notebook.
The notebook lmsoc_train_and_eval_toy_task.ipynb contains the LMSOC training code.
- NOTE: This notebook assumes you have already trained social context embeddings for the data you have (for example, here the social context is time).
- It is a runnable colab notebook which demonstrates the entire process of training and evaluating LMSOC as described in the paper.
- If run, it will reproduce the experimental setup for Task 1 and ultimately yield Figure 2.
- In order to run this notebook in colab, open this notebook in Google Colab and upload the files in "data" directory to your colab workspace.

Security Issues?

Please report sensitive security issues via Twitter's bug-bounty program (https://hackerone.com/twitter) rather than GitHub.

Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining

Related tags

Overview

LMSOC: An Approach for Socially Sensitive Pretraining

Abstract

Citation

Reproducibility

Security Issues?

Owner

Twitter Research

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Website which uses Deep Learning to generate horror stories.

CR-FIQA: Face Image Quality Assessment by Learning Sample Relative Classifiability

A python module for scientific analysis of 3D objects based on VTK and Numpy

Multimodal Descriptions of Social Concepts: Automatic Modeling and Detection of (Highly Abstract) Social Concepts evoked by Art Images

An OpenAI Gym environment for multi-agent car racing based on Gym's original car racing environment.

This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".

Tracing Versus Freehand for Evaluating Computer-Generated Drawings (SIGGRAPH 2021)

Neural Radiance Fields Using PyTorch

Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks

Some code of the implements of Geological Modeling Using 3D Pixel-Adaptive and Deformable Convolutional Neural Network

KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch

(JMLR' 19) A Python Toolbox for Scalable Outlier Detection (Anomaly Detection)

Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples

The code succinctly shows how our ensemble learning based on deep learning CNN is used for LAM-avulsion-diagnosis.

Fast sparse deep learning on CPUs

Easy genetic ancestry predictions in Python

Official implementation of Deep Burst Super-Resolution

Genetic Programming in Python, with a scikit-learn inspired API

[CVPR 2022] Pytorch implementation of "Templates for 3D Object Pose Estimation Revisited: Generalization to New objects and Robustness to Occlusions" paper