Behavioral Testing of Clinical NLP Models

This repository contains code for testing the behavior of clinical prediction models based on patient letters. For a detailed description of the testing framework see our paper What Do You See in this Patient? Behavioral Testing of Clinical NLP Models.

Usage

Install requirements: pip install -r requirements.txt

Run main.py, e.g. for diagnosis prediction test on gender, age and ethnicity:

python main.py 
    --test_set_path ./path_to_test_set
    --model_path bvanaken/CORe-clinical-diagnosis-prediction
    --task diagnosis
    --shift_keys gender,age,ethnicity
    --save_dir ./results
    --gpu False

Parameter	Description
test_set_path	Path to original test set file
model_path	Path to model or Huggingface model hub checkpoint
task	Current options: diagnosis, mortality
shift_keys	Which patient characteristics to test. Current options: age, gender, ethnicity, weight, intersectional (gender + ethnicity)
save_dir	Directory to save results, default: "./results"
gpu	Whether to use a gpu during inference or not, default: False

Using Non-Transformer models

The framework currently focuses on testing Transformer-based models. However, it is easy to extend it to any other prediction model. To do so, simply create a new class implementing the Predictor interface and add it to the TASK_MAP in main.py.

Cite

@inproceedings{vanAken2021,
  author    = {Betty van Aken and
               Sebastian Herrmann and
               Alexander Löser},
  title     = {What Do You See in this Patient? Behavioral Testing of Clinical NLP Models},
  booktitle = {Bridging the Gap: From Machine Learning Research to Clinical Practice, 
               Research2Clinics Workshop @ NeurIPS 2021},
  year      = {2021}
}

Behavioral Testing of Clinical NLP Models

Related tags

Overview

Behavioral Testing of Clinical NLP Models

Usage

Using Non-Transformer models

Cite

Owner

Betty van Aken

Simple Speech to Text, Text to Speech

Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

Stanford CoreNLP provides a set of natural language analysis tools written in Java

Rhyme with AI

Stand-alone language identification system

Code for Editing Factual Knowledge in Language Models

ConvBERT-Prod

Implementation of some unbalanced loss like focal_loss, dice_loss, DSC Loss, GHM Loss et.al

A python framework to transform natural language questions to queries in a database query language.

Count the frequency of letters or words in a text file and show a graph.

COVID-19 Chatbot with Rasa 2.0: open source conversational AI

Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/

AIDynamicTextReader - A simple dynamic text reader based on Artificial intelligence

🧪 Cutting-edge experimental spaCy components and features

Legal text retrieval for python

The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques

Sploitus - Command line search tool for sploitus.com. Think searchsploit, but with more POCs

Maha is a text processing library specially developed to deal with Arabic text.

DiY Oxygen Concentrator based on the OxiKit

BeautyNet is an AI powered model which can tell you whether you're beautiful or not.