University Challenge 2021

This repository contains:

The TeX file of the technical write-up describing the University / HYPER Challenge 2021 under latex-doc/
The Python starter-kit for the competition
The Docker starter-kit for the competition with the Python starter-kit inside

Option 1: Hypergraph partitioning using Python

The Python starter-kit is located under hg_tools/. Please see the README.md file within that folder for further instructions. The partition output file is written under hg_tools/output/.

Option 2: Hypergraph partitioning using Docker

The following instructions show a reproducible execution of the Docker starter-kit.

Dependencies

You must first have installed docker and docker-compose.

Datasets

You need to copy your datasets under hg_tools/data/ folder.

Build and run within a docker container

To build, type

docker-compose build

To run, type

docker-compose run hg_tools data/sample.mtx 2 0.01
# docker-compose run hg_tools data/CurlCurl_4.mtx.gz 10 0.01
# docker-compose run hg_tools data/wikipedia-20070206.mtx.gz 10 0.01

The partition output file is written under docker-output/.

University Challenge 2021 With Python

Related tags

Overview

University Challenge 2021

Option 1: Hypergraph partitioning using Python

Option 2: Hypergraph partitioning using Docker

Dependencies

Datasets

Build and run within a docker container

Owner

Python data processing, analysis, visualization, and data operations

PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)

Basis Set Format Converter

Detecting Underwater Objects (DUO)

TheMachineScraper 🐱‍👤 is an Information Grabber built for Machine Analysis

An ETL Pipeline of a large data set from a fictitious music streaming service named Sparkify.

Data Intelligence Applications - Online Product Advertising and Pricing with Context Generation

DaCe is a parallel programming framework that takes code in Python/NumPy and other programming languages

BAyesian Model-Building Interface (Bambi) in Python.

DenseClus is a Python module for clustering mixed type data using UMAP and HDBSCAN

Integrate bus data from a variety of sources (batch processing and real time processing).

scikit-survival is a Python module for survival analysis built on top of scikit-learn.

A highly efficient and modular implementation of Gaussian Processes in PyTorch

Data Scientist in Simple Stock Analysis of PT Bukalapak.com Tbk for Long Term Investment

signac-flow - manage workflows with signac

Pandas and Spark DataFrame comparison for humans

Predictive Modeling & Analytics on Home Equity Line of Credit

Fast, flexible and easy to use probabilistic modelling in Python.

This is a tool for speculation of ancestral allel, calculation of sfs and drawing its bar plot.

Data and code accompanying the paper Politics and Virality in the Time of Twitter