SRA's seminar on Introduction to Computer Vision Fundamentals

Overview

Introduction to Computer Vision

This repository includes basics to :

  1. Python
  2. Numpy: A python library
  3. Git
  4. Computer Vision.

The aim of this repository is to provide:

  • A brief idea of algorithms involved in Computer Vision .
  • Introduction to Version Control System: Git and GitHub.
  • Computer Vision and Image Processing basics, idea of implementation of various algorithms involved using numpy (instead of any dedicated image processing library like OpenCV.)
  • Introduction to a commonly used Image Processing Library: OpenCV

Demonstration

Table Of Contents

Comments
  • Add suboptimal 2D convolution

    Add suboptimal 2D convolution

    This pull request intends to add a suboptimal implementation of generic 2D convolution. This is done for the purpose of giving a rough idea to Fys about how to work with python arrays/loops, etc. Fys will be asked to improve this implementation and complete tasks related to convolution on top of it.

    opened by meshtag 5
  • Morphology notes updated.

    Morphology notes updated.

    I have added images for dilation and erosion, replaced the previous gif of dilation and erosion with new ones and added a few lines explaining morphology.

    opened by Aryaman22102002 2
  • Updated cv-basics/

    Updated cv-basics/

    Optimised code and flow as discussed in:

    1. cv-basics/5_opencv_overview.ipynb
    2. python-numpy-basics/7_classes_and_objects.ipynb

    Added an image :

    1. cv-basics/image/bcci.png
    opened by dhairyashah1 1
  • Port to C++ : Assignments related to PIXELS seminar

    Port to C++ : Assignments related to PIXELS seminar

    Is your feature request related to a problem? Please describe. This feature request is created to keep a record of porting and potential addition of new assignments related to the seminar in C++ as discussed in this thread.

    Describe the solution you'd like

    • Create a separate main folder for containing all assignments. Individual assignments related to specific topics might be grouped together inside the main parent folder of assignments.
    • You might chose to add reference links in individual questions, which may provide additional material on a related topic for that question (this is suggested solely for the purpose of providing more (potentially real world) info related to the topic asked in original question and hence, should not in any way lead to the solution).
    enhancement 
    opened by meshtag 0
  • Add Content: Interpolations.

    Add Content: Interpolations.

    Is your feature request related to a problem? Please describe. As discussed in the thread, concepts of interpolation can also be added.

    Describe the solution you'd like

    • Create a implementations of interpolation from scratch using necessary OpenCV C++ API.
    • Add a Makefile to compile and build executables.
    • Add a .md file to explain the theory of interpolations and instructions to build and run the executables.

    Additional context Reference: Ancient Secrets of computer vision.

    Note: Content is not finalised and open for discussion

    enhancement 
    opened by amanchhaparia 0
  • Add Content: Image Storing Formats.

    Add Content: Image Storing Formats.

    Is your feature request related to a problem? Please describe. As discussed in the thread, It is important to have a familiarity of how images are store.

    Describe the solution you'd like

    • Add the theory of basic image storing formats such as .bmp, .tiff, .jpg, png etc.
    • Implement a .cpp file on how image can be read from the bmp format.
      • Consider only 8 bit grayscale BitMap image (Since they are easy to read and contains only 2D form of data).
      • Use simple posix read() api to read the image bitmap file.
      • Directly storing the values of various attributes of image in struct is suggested.
    • A similar example can be added to demonstrate how to edit/write a grayscale bitmap image.
    • Add a Makefile to compile and build the executable.
    • Add a .md file explaining the theory and instructions to build and run the executables.

    Note: Content is not finalised and open for discussion.

    enhancement C++ 
    opened by amanchhaparia 2
  • Add Content: Build Systems

    Add Content: Build Systems

    Is your feature request related to a problem? Please describe. As discussed in the thread, Concepts of Build System should be added.

    Describe the solution you'd like

    • Content should be added for manual creating and linking the object files.
    • Importance of build systems.
    • Add the contents for Makefile.
    • Add contents for Cmake.

    Additional context Can refer from here: Embedded Study Group Week 2.

    Note: Content is not finalised and open for discussion.

    enhancement Build-Systems 
    opened by amanchhaparia 0
  • Add Content: C++ basic concepts for seminar.

    Add Content: C++ basic concepts for seminar.

    Is your feature request related to a problem? Please describe. Since the seminar is being ported to C++ as discussed in this thread, it is important to teach some important C++ concepts.

    Describe the solution you'd like

    • Some advance concepts of C++ like handling 2D arrays/vector, pointer etc.

    Note: Content is not finalised and open for discussion.

    enhancement C++ 
    opened by amanchhaparia 1
Releases(v1.0)
  • v1.0(Sep 7, 2022)

    This release contains the 1st version of the PIXELS Seminar conducted in 2021. The content of this release is implemented in Python and uses numpy and OpenCV Python API.

    • This release can be used as a reference to basic Image Processing using Python.
    • Contains a tutorial for necessary numpy methods.
    • Tutorials on commonly used OpenCV functions in Python.
    • Implementation of blob detection a very commonly used algorithm in Python.
    Source code(tar.gz)
    Source code(zip)
Owner
Society of Robotics and Automation
The Society of Robotics and Automation is a society for VJTI students. As the name suggests, we deal with Robotics, Machine Vision and Automation .
Society of Robotics and Automation
~1000 book pages + OpenCV + python = page regions identified as paragraphs, lines, images, captions, etc.

cosc428-structor I had an open-ended Computer Vision assignment to complete, and an out-of-copyright book that I wanted to turn into an ebook. Convent

Chad Oliver 45 Dec 06, 2022
Automatically resolve RidderMaster based on TensorFlow & OpenCV

AutoRiddleMaster Automatically resolve RidderMaster based on TensorFlow & OpenCV 基于 TensorFlow 和 OpenCV 实现的全自动化解御迷士小马谜题 Demo How to use Deploy the ser

神龙章轩 5 Nov 19, 2021
A small C++ implementation of LSTM networks, focused on OCR.

clstm CLSTM is an implementation of the LSTM recurrent neural network model in C++, using the Eigen library for numerical computations. Status and sco

Tom 794 Dec 30, 2022
ocroseg - This is a deep learning model for page layout analysis / segmentation.

ocroseg This is a deep learning model for page layout analysis / segmentation. There are many different ways in which you can train and run it, but by

NVIDIA Research Projects 71 Dec 06, 2022
Usando o Amazon Textract como OCR para Extração de Dados no DynamoDB

dio-live-textract2 Repositório de código para o live coding do dia 05/10/2021 sobre extração de dados estruturados e gravação em banco de dados a part

hugoportela 0 Jan 19, 2022
A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.

The project is based on older versions of tesseract and other tools, and is now superseded by another project which allows for more granular control o

Maxim 32 Jul 24, 2022
A fastai/PyTorch package for unpaired image-to-image translation.

Unpaired image-to-image translation A fastai/PyTorch package for unpaired image-to-image translation currently with CycleGAN implementation. This is a

Tanishq Abraham 120 Dec 02, 2022
Fun program to overlay a mask to yourself using a webcam

Superhero Mask Overlay Description Simple project made for fun. It consists of placing a mask (a PNG image with transparent background) on your face.

KB Kwan 10 Dec 01, 2022
✌️Using this you can control your PC/Laptop volume by Hand Gestures created with Python.

Hand Gesture Volume Controller ✋ Hand recognition 👆 Finger recognition 🔊 you can decrease and increase volume Demo Code Firstly I have created a Mod

Abbas Ataei 19 Nov 17, 2022
Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation, CVPR 2020 (Oral)

SEAM The implementation of Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentaion. You can also download the repos

Hibercraft 459 Dec 26, 2022
Localization of thoracic abnormalities model based on VinBigData (top 1%)

Repository contains the code for 2nd place solution of VinBigData Chest X-ray Abnormalities Detection competition. The goal of competition was to auto

33 May 24, 2022
Detecting Text in Natural Image with Connectionist Text Proposal Network (ECCV'16)

Detecting Text in Natural Image with Connectionist Text Proposal Network The codes are used for implementing CTPN for scene text detection, described

Tian Zhi 1.3k Dec 22, 2022
TableBank: A Benchmark Dataset for Table Detection and Recognition

TableBank TableBank is a new image-based table detection and recognition dataset built with novel weak supervision from Word and Latex documents on th

844 Jan 04, 2023
Opencv-image-filters - A camera to capture videos in real time by placing filters using Python with the help of the Tkinter and OpenCV libraries

Opencv-image-filters - A camera to capture videos in real time by placing filters using Python with the help of the Tkinter and OpenCV libraries

Sergio Díaz Fernández 1 Jan 13, 2022
Text to QR-CODE

QR CODE GENERATO USING PYTHON Author : RAFIK BOUDALIA. Installation Use the package manager pip to install foobar. pip install pyqrcode Usage from tki

Rafik Boudalia 2 Oct 13, 2021
Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.

Total-Text-Dataset (Official site) Updated on April 29, 2020 (Detection leaderboard is updated - highlighted E2E methods. Thank you shine-lcy.) Update

Chee Seng Chan 671 Dec 27, 2022
Python-based tools for document analysis and OCR

ocropy OCRopus is a collection of document analysis programs, not a turn-key OCR system. In order to apply it to your documents, you may need to do so

OCRopus 3.2k Dec 31, 2022
Introduction to Augmented Reality (AR) with Python 3 and OpenCV 4.2.

Introduction to Augmented Reality (AR) with Python 3 and OpenCV 4.2.

fernanda rodríguez 85 Jan 02, 2023
python ocr using tesseract/ with EAST opencv detector

pytextractor python ocr using tesseract/ with EAST opencv text detector Uses the EAST opencv detector defined here with pytesseract to extract text(de

Danny Crasto 38 Dec 05, 2022
Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.

Convolutional Recurrent Neural Network This software implements the Convolutional Recurrent Neural Network (CRNN), a combination of CNN, RNN and CTC l

Baoguang Shi 2k Dec 31, 2022