JUSTICE: A Benchmark Dataset for Supreme Court’s Judgment Prediction

CSCI 544 Final Project done by: Mohammed Alsayed, Shaayan Syed, Mohammad Alali, Smit Patel, Hemanth Bodala

Abstract

Artificial intelligence is being utilized in many domains as of late, and the legal system is no exception. However, as it stands now, the number of well-annotated datasets pertaining to legal documents from the Supreme Court of the United States (SCOTUS) is very limited for public use. Even though the Supreme Court rulings are public domain knowledge, trying to do meaningful work with them becomes a much greater task due to the need to manually gather and process that data from scratch each time. Hence, our goal is to create a high-quality dataset of SCOTUS court cases so that they may be readily used in natural language processing (NLP) research and other data-driven applications. Additionally, recent advances in NLP provide us with the tools to build predictive models that can be used to reveal patterns that influence court decisions. By using advanced NLP algorithms to analyze previous court cases, the trained models are able to predict and classify a court's judgment given the case's facts from the plaintiff and the defendant in textual format; in other words, the model is emulating a human jury by generating a final verdict.

Links

arXiv Link: https://arxiv.org/abs/2112.03414

YouTube Link: https://youtu.be/vJ6NQ_UAcVo

Dataset Links:

Minimal JSON compact form (216MB):

https://www.dropbox.com/s/9kyk0dr2gf3ls23/oyez.json?dl=0

Prettified JSON human-readable form (431 MB):

https://www.dropbox.com/s/52a58aac8iujupv/oyez_pretty.json?dl=0

JUSTICE: A Benchmark Dataset for Supreme Court’s Judgment Prediction

Related tags

Overview

JUSTICE: A Benchmark Dataset for Supreme Court’s Judgment Prediction

Abstract

Links

Owner

Smit Patel

Do Neural Networks for Segmentation Understand Insideness?

Code for the Lovász-Softmax loss (CVPR 2018)

SysWhispers Shellcode Loader

EgGateWayGetShell py脚本

A method that utilized Generative Adversarial Network (GAN) to interpret the black-box deep image classifier models by PyTorch.

A GridMixup augmentation, inspired by GridMask and CutMix

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

Package for extracting emotions from social media text. Tailored for financial data.

Score refinement for confidence-based 3D multi-object tracking

Project ArXiv Citation Network

This is the official implementation of "One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval".

MLOps will help you to understand how to build a Continuous Integration and Continuous Delivery pipeline for an ML/AI project.

Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

DANA paper supplementary materials

ICCV2021 Oral SA-ConvONet: Sign-Agnostic Optimization of Convolutional Occupancy Networks

Generative Adversarial Networks for High Energy Physics extended to a multi-layer calorimeter simulation

Neural Articulated Radiance Field

This repo includes the supplementary of our paper "CEMENT: Incomplete Multi-View Weak-Label Learning with Long-Tailed Labels"

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

Streamlit component for TensorBoard, TensorFlow's visualization toolkit