This repository summarized computer vision theories.

Last update: Feb 04, 2022

Related tags

Computer Vision CV_theory

Overview

CV_theory

Basic Overview

This repository summarized computer vision theories.

PSNR

mse = np.mean((img1 - img2) ** 2)
# MSE 구하는 식

PLXEL_MAX = 255.0
# 8bit MAX는 255의 값을 가짐

return 20 * math.log10(PLXEL_MAX/math.sqrt(mse))
#PSNR 구하는 식

[output]
openCV를 이용한 PSNR : 52.37698680492553
주어진 수식을 이용한 함수구현 : 52.37698680492553

Color transform

for i in range(height):
    for j in range(width):
        y2[i][j] = 0.299 * r[i][j] + 0.587 * g[i][j] + 0.114 * b[i][j]
        cb2[i][j] = (-0.172*r[i][j]) - (0.339*g[i][j]) + (0.511*b[i][j]) + 128
        cr2[i][j] = (0.511*r[i][j])- (0.428*g[i][j]) - (0.083*b[i][j]) + 128
# RGB 영상을 YCbCr로 변환 수식

for i in range(height):
    for j in range(width):
        r[i][j] = y2[i][j] + 1.371*(cr2[i][j] - 128)
        g[i][j] = y2[i][j] - 0.698*(cr2[i][j] - 128) - 0.336*(cb2[i][j] - 128)
        b[i][j] = y2[i][j] + 1.732*(cb2[i][j] - 128)
# yCbCr을 RGB 변환 수식

Filterring Smoothing

After converting the original image to Ycrcb, only the Y value was filtered with 3*3 kernels and smoothing was performed.

kernel = np.ones((3, 3), np.float32) / 9
# 3*3 커널값 저장

for i in range(5):
    Y = cv2.filter2D(Y, -1, kernel)
# 5번 필터링

Histogram equalization

height, width, channel = src.shape


hist, bins = np.histogram(Y.flatten(), 256, [0, 256])
# 이미지 히스토그램 구해주기

cdf = hist.cumsum()
# 각 멤버값을 누적하여 더한 1차원 배열 생성

cdf_m = np.ma.masked_equal(cdf, 0)
# cdf에서 값이 0인 부분  mask 처리


cdf_m = (cdf_m - cdf_m.min()) * 255 / (cdf_m.max() - cdf_m.min())
#  균일화 방정식 코드

cdf = np.ma.filled(cdf_m, 0). astype("uint8")
# mask처리된 부분을 o으로 다시 리턴

out = (np.dstack((Y, cr, cb)))
out_rgb = cv2.cvtColor(out, cv2.COLOR_YCrCb2RGB)

img2 = cdf[out_rgb]

dst -> function in cv2 , dst2 -> Self-made function

Hough Line Detection

Contributing

Let's connect 👨‍💻 and forge the future together. 😁 ✌

Check the Repositories and don't forget to give a star. 👇

⭐ From S-jooyoung

This repository summarized computer vision theories.

Related tags

Overview

CV_theory

Basic Overview

PSNR

Color transform

Filterring Smoothing

Histogram equalization

Hough Line Detection

Contributing

Owner

Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.

Course material for the Multi-agents and computer graphics course

A simple component to display annotated text in Streamlit apps.

An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments

This tool will help you convert your text to handwriting xD

A facial recognition program that plays a alarm (mp3 file) when a person i seen in the room. A basic theif using Python and OpenCV

Fine tuning keras-ocr python package with custom synthetic dataset from scratch

Ackermann Line Follower Robot Simulation.

Optical character recognition for Japanese text, with the main focus being Japanese manga

Automatically download multiple papers by keywords in CVPR

Semantic-based Patch Detection for Binary Programs

This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.

FastOCR is a desktop application for OCR API.

2 telegram-bots: for image recognition and for text generation

A fastai/PyTorch package for unpaired image-to-image translation.

Markup for note taking

A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.

TedEval: A Fair Evaluation Metric for Scene Text Detectors

Tesseract Open Source OCR Engine (main repository)

Source code of RRPN ---- Arbitrary-Oriented Scene Text Detection via Rotation Proposals