tf2-keras implement yolov5

Last update: Jan 08, 2023

Related tags

Deep Learning yolov5_in_tf2_keras

Overview

YOLOv5 in tesnorflow2.x-keras

模型测试

训练 COCO2017(val 5k)

检测效果

精度/召回率

Requirements

pip3 install -r requirements.txt

Get start

训练

python3 train.py

tensorboard

tensorboard --host 0.0.0.0 --logdir ./logs/ --port 8053 --samples_per_plugin=images=40

查看

http://127.0.0.1:8053

测试, 修改detect.py里面input_image和model_path

python3 detect.py

训练自己的数据

labelme打标自己的数据
打开data/labelme2coco.py脚本, 修改如下地方

input_dir = '这里写labelme打标时保存json标记文件的目录'
output_dir = '这里写要转CoCo格式的目录，建议建一个空目录'
labels = "这里是你打标时所有的类别名, txt文本即可, 每行一个类, 类名无需加引号"

执行data/labelme2coco.py脚本会在output_dir生成对应的json文件和图片
修改train.py文件中coco_annotation_file以及num_class, 注意classes通过CoCoDataGenrator(*).coco.cats[label_id]['name']可获得，由于coco中类别不连续，所以通过coco.cats拿到的数组下标拿到的类别可能不准.
开始训练, python3 train.py

Comments

关于类别损失计算的问题

您好，loss这段不是很理解， https://github.com/yyccR/yolov5_in_tf2_keras/blob/3e6645cbf94d2a1e11c33663e80113daa4590321/loss.py#L142-L152 请问targets最后两位应该是置信度1和最佳的anchor索引吗？ https://github.com/yyccR/yolov5_in_tf2_keras/blob/3e6645cbf94d2a1e11c33663e80113daa4590321/loss.py#L288-L293 那这边split出来的true_obj, true_cls应该就是对应的置信度1和最佳的anchor索引吧。那这个类别损失 https://github.com/yyccR/yolov5_in_tf2_keras/blob/3e6645cbf94d2a1e11c33663e80113daa4590321/loss.py#L356 计算的不是最佳anchor索引吗，是跟obj_mask 有关系吗

opened by whalefa1I 5
sparse_categorical_crossentropy训练时有nan结果

有的数据会在这行出现nan https://github.com/yyccR/yolov5_in_tf2_keras/blob/033a1156c1481f4258bf24a4a8215af39682da94/loss.py#L357 查看了input的is_nan，都正常。而且把sparse_categorical_crossentropy换成binary_crossentropy就好了。请问这两者在这里计算有差别吗，是否可以进行替换

opened by whalefa1I 3

lebelme2coco处理逻辑有误

我在实际使用您的代码训练自己的数据集时发现，labelme2coco.py 好像缺少对shape_type == "rectangle"时的处理，导致我最后生成的json文件annotations项为空。以下是labelme2coco.py文件100行到124行代码： ` if shape_type == "polygon": mask = labelme.utils.shape_to_mask( img.shape[:2], points, shape_type ) # cv2.imshow("",np.array(mask, dtype=np.uint8)*255) # cv2.waitKey(0)

            if group_id is None:
                group_id = uuid.uuid1()

            instance = (label, group_id)
            # print(instance)

            if instance in masks:
                masks[instance] = masks[instance] | mask
            else:
                masks[instance] = mask
            # print(masks[instance].shape)

            if shape_type == "rectangle":
                (x1, y1), (x2, y2) = points
                x1, x2 = sorted([x1, x2])
                y1, y2 = sorted([y1, y2])
                points = [x1, y1, x2, y1, x2, y2, x1, y2]
            if shape_type == "circle": 
            ....

` 代码永远不会执行到shape_type == "rectangle"或shape_type == "circle"。

opened by aijialin 2

layers.py
根據ultralytics/yolov5:

https://github.com/ultralytics/yolov5/blob/63ddb6f0d06f6309aa42bababd08c859197a27af/models/common.py#L70-L73

這一段程式:

https://github.com/yyccR/yolov5_in_tf2_keras/blob/46298d7c98073750176d64896ee9dc01b55c5aca/layers.py#L127-L132

是不是應該改寫成:

def call(self, inputs, *args, **kwargs): y = self.multiheadAttention(self.q(inputs), self.v(inputs), self.k(inputs)) + inputs x = self.fc1(x) x = self.fc2(x) x = x + y return x
opened by AugustusHsu 1
What is the mAP on COCO17 val ?

Hi @yyccR, thanks for your repo. I want to know if you can reach the same mAP as in original YOLOV5 (Train on COCO17 train and test on COCO17 val)? And do you have plan to release some pretrained checkpoint ?

opened by Tyler-D 1

Releases(v1.1)

v1.1(Jun 24, 2022)
v1.1 几个总结：

[1]. 调整tf.keras.layers.BatchNormalization的__call__方法中training=True

[2]. 新增TFLite/onnx格式导出与验证，详见/data/h5_to_tflite.py, /data/h5_to_onnx.py

[3]. 修改backbone网络里batch_size，在训练和测试时需指定，避免tflite导出时FlexOps问题

[4]. YoloHead里对类别不再做softmax，直接sigmoid，支持多类别输出

[5]. release里的yolov5s-best.h5为kaggle猫狗脸数据集的重新训练权重，训练:测试为8:2，val精度大概如下：

| class | [email protected] | [email protected]:0.95 | precision | recall | | :-: | :-: | :-: | :-: | :-: | | cat | 0.962680 | 0.672483 | 0.721003 | 0.958333 | | dog | 0.934285 | 0.546893 | 0.770701 | 0.923664 | | total | 0.948482 | 0.609688 | 0.745852 | 0.940999 |

[6]. release里的yolov5s-best.tflite为上述yolov5s-best.h5的tflite量化模型，建议用Netron软件打开查看输入输出

[7]. release里的yolov5s-best.onnx为上述yolov5s-best.h5的onnx模型，建议用Netron软件打开查看输入输出

[8]. android 模型测试效果如下：

就这样，继续加油！💪🏻💪🏻💪🏻
Source code(tar.gz)
Source code(zip)
yolov5s-best.h5(27.51 MB)
yolov5s-best.onnx(27.25 MB)
yolov5s-best.tflite(6.95 MB)
v1.0(Jun 21, 2022)
v1.0 几个总结：

[1]. 模型结构总的与 ultralytics/yolov5 v6.0 保持一致

[2]. 其中Conv层替换swish为Relu

[3]. 整体数据增强与 ultralytics/yolov5 保持一致

[4]. readme中训练所需的数据集为kaggle公开猫狗脸检测数据集，已放到release列表中

[5]. 为什么不训练coco数据集？因为没资源，跑一个coco要很久的，服务器一直都有任务在跑所以没空去跑 - . -

[6]. release里的yolov5s-best.h5为上述kaggle猫狗脸数据集的训练权重，训练:测试为8:2，val精度大概如下：

| class | [email protected] | [email protected]:0.95 | precision | recall | | :-: | :-: | :-: | :-: | :-: | | cat | 0.905156 | 0.584378 | 0.682848 | 0.886555 | | dog | 0.940633 | 0.513005 | 0.724036 | 0.934866 | | total | 0.922895 | 0.548692 | 0.703442 | 0.910710 |

就这样，继续加油！💪🏻💪🏻💪🏻
Source code(tar.gz)
Source code(zip)
JPEGImages.zip(260.17 MB)
yolov5s-best.h5(27.51 MB)

Owner

yangcheng

GitHub Repository

Checking fibonacci - Generating the Fibonacci sequence is a classic recursive problem

Fibonaaci Series Generating the Fibonacci sequence is a classic recursive proble

1 Feb 15, 2022

Focal Loss for Dense Rotation Object Detection

Convert ResNets weights from GluonCV to Tensorflow Abstract GluonCV released some new resnet pre-training weights and designed some new resnets (such

17 Nov 24, 2021

Mitsuba 2: A Retargetable Forward and Inverse Renderer

Mitsuba Renderer 2 Documentation Mitsuba 2 is a research-oriented rendering system written in portable C++17. It consists of a small set of core libra

2k Jan 07, 2023

This repository contains the source code of our work on designing efficient CNNs for computer vision

Efficient networks for Computer Vision This repo contains source code of our work on designing efficient networks for different computer vision tasks:

386 Nov 26, 2022

Official respository for "Modeling Defocus-Disparity in Dual-Pixel Sensors", ICCP 2020

Official respository for "Modeling Defocus-Disparity in Dual-Pixel Sensors", ICCP 2020 BibTeX @INPROCEEDINGS{punnappurath2020modeling, author={Abhi

22 Oct 01, 2022

Automates Machine Learning Pipeline with Feature Engineering and Hyper-Parameters Tuning :rocket:

MLJAR Automated Machine Learning Documentation: https://supervised.mljar.com/ Source Code: https://github.com/mljar/mljar-supervised Table of Contents

2.4k Dec 31, 2022

discovering subdomains, hidden paths, extracting unique links

python-website-crawler discovering subdomains, hidden paths, extracting unique links pip install -r requirements.txt discover subdomain: You can give

4 Sep 05, 2022

Leaderboard, taxonomy, and curated list of few-shot object detection papers.

70 Jan 07, 2023

The AugNet Python module contains functions for the fast computation of image similarity.

AugNet AugNet: End-to-End Unsupervised Visual Representation Learning with Image Augmentation arxiv link In our work, we propose AugNet, a new deep le

74 Dec 28, 2022

Capsule endoscopy detection DACON challenge

capsule_endoscopy_detection (DACON Challenge) Overview Yolov5, Yolor, mmdetection기반의 모델을 사용 (총 11개 모델 앙상블) 모든 모델은 학습 시 Pretrained Weight을 yolov5, yolo

11 Nov 25, 2022

Sequential model-based optimization with a `scipy.optimize` interface

Scikit-Optimize Scikit-Optimize, or skopt, is a simple and efficient library to minimize (very) expensive and noisy black-box functions. It implements

2.5k Jan 04, 2023

Code and data accompanying our SVRHM'21 paper.

Code and data accompanying our SVRHM'21 paper. Requires tensorflow 1.13, python 3.7, scikit-learn, and pytorch 1.6.0 to be installed. Python scripts i

5 Nov 17, 2021

Easily Process a Batch of Cox Models

ezcox: Easily Process a Batch of Cox Models The goal of ezcox is to operate a batch of univariate or multivariate Cox models and return tidy result. ⏬

15 May 23, 2022

Deep Unsupervised 3D SfM Face Reconstruction Based on Massive Landmark Bundle Adjustment.

(ACMMM 2021 Oral) SfM Face Reconstruction Based on Massive Landmark Bundle Adjustment This repository shows two tasks: Face landmark detection and Fac

51 Dec 13, 2022

A simple consistency training framework for semi-supervised image semantic segmentation

PseudoSeg: Designing Pseudo Labels for Semantic Segmentation PseudoSeg is a simple consistency training framework for semi-supervised image semantic s

143 Dec 13, 2022

Objax Apache-2Objax (🥉19 · ⭐ 580) - Objax is a machine learning framework that provides an Object.. Apache-2 jax

Objax Tutorials | Install | Documentation | Philosophy This is not an officially supported Google product. Objax is an open source machine learning fr

729 Jan 02, 2023

Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper

Ponder(ing) Transformer Implementation of a Transformer that learns to adapt the number of computational steps it takes depending on the difficulty of

65 Oct 04, 2022

An OpenAI-Gym Package for Training and Testing Reinforcement Learning algorithms with OpenSim Models

Authors: Utkarsh A. Mishra and Dr. Dimitar Stanev Advisors: Dr. Dimitar Stanev and Prof. Auke Ijspeert, Biorobotics Laboratory (BioRob), EPFL Video Pl

16 Dec 13, 2022

Unofficial Pytorch Lightning implementation of Contrastive Syn-to-Real Generalization (ICLR, 2021)

17 Sep 23, 2021

Code for our paper "Graph Pre-training for AMR Parsing and Generation" in ACL2022

AMRBART An implementation for ACL2022 paper "Graph Pre-training for AMR Parsing and Generation". You may find our paper here (Arxiv). Requirements pyt

60 Jan 03, 2023