YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

Last update: Dec 30, 2022

Overview

YOLOv5_DOTA_OBB

YOLOv5 in DOTA_OBB dataset with CSL_label.(Oriented Object Detection)

Datasets and pretrained checkpoint

Datasets : DOTA
Pretrained Checkpoint or Demo Files :
- train,detect_and_evaluate_demo_files.(6666)
- yolov5x.pt.(6666)
- yolov5l.pt.(6666)
- yolov5m.pt.(6666)
- yolov5s.pt.(6666)
- YOLOv5_DOTAv1.5_OBB.pt.(6666)

Fuction

train.py. Train.
detect.py. Detect and visualize the detection result. Get the detection result txt.
evaluation.py. Merge the detection result and visualize it. Finally evaluate the detector

Installation (Linux Recommend, Windows not Recommend)

1. Python 3.8 or later with all requirements.txt dependencies installed, including torch>=1.7. To install run:

$   pip install -r requirements.txt

2. Install swig

$   cd  \.....\yolov5_DOTA_OBB\utils
$   sudo apt-get install swig

3. Create the c++ extension for python

$   swig -c++ -python polyiou.i
$   python setup.py build_ext --inplace

More detailed explanation

想要了解相关实现的细节和原理可以看我的知乎文章:
YOLOv5_DOTAv1.5(遥感旋转目标检测，全踩坑记录);

Usage Example

1. 'Get Dataset'

Split the DOTA_OBB image and labels. Trans DOTA format to YOLO longside format.
You can refer to hukaixuan19970627/DOTA_devkit_YOLO.
The Oriented YOLO Longside Format is:

$  classid    x_c   y_c   longside   shortside    Θ    Θ∈[0, 180)


* longside: The longest side of the oriented rectangle.

* shortside: The other side of the oriented rectangle.

* Θ: The angle between the longside and the x-axis(The x-axis rotates clockwise).x轴顺时针旋转遇到最长边所经过的角度

WARNING: IMAGE SIZE MUST MEETS 'HEIGHT = WIDTH'

2. 'train.py'

All same as ultralytics/yolov5. You better train demo files first before train your custom dataset.
Single GPU training:

$ python train.py  --batch-size 4 --device 0

Multi GPU training: DistributedDataParallel Mode

python -m torch.distributed.launch --nproc_per_node 4 train.py --sync-bn --device 0,1,2,3

3. 'detect.py'

Download the demo files.
Then run the demo. Visualize the detection result and get the result txt files.

$  python detect.py

4. 'evaluation.py'

Run the detect.py demo first. Then change the path with yours:

evaluation
(
        detoutput=r'/....../DOTA_demo_view/detection',
        imageset=r'/....../DOTA_demo_view/row_images',
        annopath=r'/....../DOTA_demo_view/row_DOTA_labels/{:s}.txt'
)
draw_DOTA_image
(
        imgsrcpath=r'/...../DOTA_demo_view/row_images',
        imglabelspath=r'/....../DOTA_demo_view/detection/result_txt/result_merged',
        dstpath=r'/....../DOTA_demo_view/detection/merged_drawed'
)

Run the evaluation.py demo. Get the evaluation result and visualize the detection result which after merged.

$  python evaluation.py

有问题反馈

在使用中有任何问题，欢迎反馈给我，可以用以下联系方式跟我交流

知乎（@略略略）
代码问题提issues,其他问题请知乎上联系

感激

感谢以下的项目,排名不分先后

关于作者

  Name  : "胡凯旋"
  describe myself："咸鱼一枚"

YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

Related tags

Overview

YOLOv5_DOTA_OBB

Datasets and pretrained checkpoint

Fuction

Installation (Linux Recommend, Windows not Recommend)

More detailed explanation

Usage Example

有问题反馈

感激

关于作者

Owner

governance proposal to make fei redeemable for eth

This project modify tensorflow object detection api code to predict oriented bounding boxes. It can be used for scene text detection.

基于Paddle框架的PSENet复现

Convolutional Recurrent Neural Networks(CRNN) for Scene Text Recognition

A machine learning software for extracting information from scholarly documents

Document Layout Analysis

Image augmentation for machine learning experiments.

Python Computer Vision application that allows users to draw/erase on the screen using their webcam.

Some bits of javascript to transcribe scanned pages using PageXML

A simple component to display annotated text in Streamlit apps.

Creating a virtual tv using opencv in python3.

A curated list of resources dedicated to scene text localization and recognition

Image processing in Python

Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"

Morphological edge detection or object's boundary detection using erosion and dialation in OpenCV python

SRA's seminar on Introduction to Computer Vision Fundamentals

Generate text images for training deep learning ocr model

Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector

Official PyTorch implementation for "Mixed supervision for surface-defect detection: from weakly to fully supervised learning"

Sign Language Recognition service utilizing a deep learning model with Long Short-Term Memory to perform sign language recognition.