Garbage classification using structure data.

Last update: Dec 10, 2021

Related tags

Overview

垃圾分类模型使用说明

1.包含以下数据文件

文件	描述
data/MaterialMapping.csv	物体以及其归类的信息
data/TestRecords	光谱原始测试数据 CSV 文件
data/TestRecordDesc.zip	CSV 文件描述文件
data/Boundaries.csv	物体轮廓信息

2.包含以下模型文件

文件夹	描述
output/Category/	包含预测大类别的分类模型
output/Material/	包含预测大类别（4类）的分类模型
output/Backgroud/	包含预测小类别（50类）的分类模型

3.环境配置

进入garbage路径，在anaconda命令行运行pip install -r requirements.txt

4.数据预处理

在anaconda命令行运行python data_preprocess.py，即可在data文件夹中生成AllEmbracingDataset.csv。若将来更新数据，按照和原来相同的格式和路径保存在data文件夹中，即可用data_preprocess.py生成更新后的数据集

运行数据预处理Python脚本，将上述数据的信息集合到一个数据文件中

python code/data_preprocess.py -data_dir D:/datasets/garbage \
                        -test \
                        -groupbyObjID

运行脚本生成的数据文件 datasets/AllEmbracingDataset.csv 数据集

5.模型训练Python脚本

python code/train_gbdt_lr.py -data_dir D:/datasets/garbage/ \
                    -use_groupbyID True \
                    -output_dir output/ \
                    -skip_data_preprocess

其他 Python脚本说明：

feature_engineering.py 特征工程代码
ref.py 数据处理和模型推理所需的配置文件
utils.py 数据处理所需的一些函数
gbdt_feature.py 用gbdt模型生成特征

6.模型推理Python脚本

python code/predict_gbdt_lr.py -data_dir D:/datasets/garbage/ \
                    -use_groupbyID True \
                    -output_dir output/ \
                    -skip_data_preprocess \
                    -save_dir output/

注1：只要同一个ObjID的多条数据的预测结果有一个不是背景零，最终预测结果就不是背景零。

注2：预测出的Material只会是在训练数据中出现过的唯一标记号。这次数据中不同的唯一标记号共有148个，具体可参见output/log/log.txt中的LabelEncoder.classes

预测结果文件(predictions.csv)说明：对每个物体（即每个ObjID，通常对应多条测试记录）给出多个预测结果汇总后的预测结果。

#	域名	意义
1	ObjID	被测物体唯一标记。同一物体会对应多条测试记录
2	Category	物体分类，从训练数据中获取
3	Material	物体对应的唯一标识号，从训练数据中获取
4	pred_Category	模型所预测出的物体分类
5	pred_Material	模型所预测出的物体唯一标识号
6	pred_background	模型预测的背景和物体（背景标记为 0，物体标记为 1）
7	pred_Category_final	模型所预测出的物体分类
8	pred_Material_final	模型所预测出的物体材料分类

7. 模型精度

对于Category、Material和Background三种场景的预测，我们均使用GBDT+LR模型。尝试过SVM、XGBoost、LightGBM和GBDT+LR模型，对比之下，GBDT+LR模型表现最好。在测试集上的Accuracy如下：

场景	Accuracy
Category	0.7583130575831306
Material	0.6042173560421735
Background	0.996044825313118

Garbage classification using structure data.

Related tags

Overview

垃圾分类模型使用说明

1.包含以下数据文件

2.包含以下模型文件

3.环境配置

4.数据预处理

5.模型训练Python脚本

6.模型推理Python脚本

7. 模型精度

Owner

wenqi

DeRF: Decomposed Radiance Fields

RAMA: Rapid algorithm for multicut problem

Neural Articulated Radiance Field

Adaptation through prediction: multisensory active inference torque control

[ICCV'21] Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment

A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.

A heterogeneous entity-augmented academic language model based on Open Academic Graph (OAG)

Codes for "Template-free Prompt Tuning for Few-shot NER".

Fuse radar and camera for detection

Official Implementation for the paper DeepFace-EMD: Re-ranking Using Patch-wise Earth Mover’s Distance Improves Out-Of-Distribution Face Identification

Stream images from a connected camera over MQTT, view using Streamlit, record to file and sqlite

PyTorch Kafka Dataset: A definition of a dataset to get training data from Kafka.

vit for few-shot classification

VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning

AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition

Official PyTorch Implementation of "AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting".

Natural Intelligence is still a pretty good idea.

Seeing if I can put together an interactive version of 3b1b's Manim in Streamlit

Segmentation vgg16 fcn - cityscapes

Python3 / PyTorch implementation of the following paper: Fine-grained Semantics-aware Representation Enhancement for Self-supervisedMonocular Depth Estimation. ICCV 2021 (oral)