COVID19_detection

背景

当前，世界正遭受全球COVID19大流行的困扰。数十亿人受到影响，数百万的人员伤亡已经发生。因此，鉴定受SARS-CoV-2病毒感染或已经受其污染的个人至关重要。这种识别有助于公共卫生组织和政府制定行动计划，以减少这种大流行的影响。从这种意义上讲，Hilab是一家远程实验室公司，它执行数十种类型的血液检查，包括针对COVID19的血清学检查，该公司已经在巴西进行了数百万次检查。为了改善对这种病毒的检测，可以使用机器学习方法来帮助实验室专家进行决策。因此，本项目将致力于解决构建用于检测COVID19的具有高置信度和准确性的机器学习模型的难题。

方法

决策树（Decision tree）
随机森林（Random forest）
支持向量机（SVN）
主成分分析（PCA）

数据集

数据集地址：https://drive.google.com/drive/folders/1FfIx5WmEc_C7d3Ai7ONIQE4s-o2xQZz5?usp=sharing

项目结构

/
-dataset/		#数据集存放目录
--test/			#测试集目录
---test.csv		#测试集文件
--train/  		#训练集目录
---train_1.csv	#训练集文件1（此文件与测试集相同，默认不使用）
---train_2.csv	#训练集文件2
.......
---train_7.csv	#训练集文件7

-data_preprocess.py	#数据集提取与预处理
-pca.py				#pca降维的相关实验
-decision_tree.py	#决策树
-random_forest.py	#随机森林
-SVM.py				#SVM
-README.md			#说明文件

A set of procedures that can realize covid19 virus detection based on blood.

Related tags

Overview

COVID19_detection

背景

方法

数据集

项目结构

Owner

Nuyoah-xlh

A Python and R autograding solution

Reading streams of Twitter data, save them to Kafka, then process with Kafka Stream API and Spark Streaming

A pipeline that creates consensus sequences from a Nanopore reads. I

Python scripts aim to use a Random Forest machine learning algorithm to predict the water affinity of Metal-Organic Frameworks

Nobel Data Analysis

An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks

Data Analytics on Genomes and Genetics

Analytical view of olist e-commerce in Brazil

An extension to pandas dataframes describe function.

Stream-Kafka-ELK-Stack - Weather data streaming using Apache Kafka and Elastic Stack.

PrimaryBid - Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift

PySpark bindings for H3, a hierarchical hexagonal geospatial indexing system

DefAP is a program developed to facilitate the exploration of a material's defect chemistry

Extract Thailand COVID-19 Cluster data from daily briefing pdf.

Functional tensors for probabilistic programming

bigdata_analyse 大数据分析项目

ELFXtract is an automated analysis tool used for enumerating ELF binaries

Data-sets from the survey and analysis

A multi-platform GUI for bit-based analysis, processing, and visualization

Random dataframe and database table generator