Combining Stereo Vision and Deep Learning Techniques for Object Detection in the 3D World

Combining Stereo Vision and Deep Learning Techniques for Object Detection in the 3D World
Author :
Publisher :
Total Pages :
Release :
ISBN-10 : OCLC:1224073658
ISBN-13 :
Rating : 4/5 (58 Downloads)

Book Synopsis Combining Stereo Vision and Deep Learning Techniques for Object Detection in the 3D World by : Andrea Gimeno I Jovés

Download or read book Combining Stereo Vision and Deep Learning Techniques for Object Detection in the 3D World written by Andrea Gimeno I Jovés and published by . This book was released on 2020 with total page pages. Available in PDF, EPUB and Kindle. Book excerpt: The objective of this project is to develop a deep learning algorithm so that, together with the use of a stereo camera, it is capable of detecting a person and locating them in the 3D world. The person's location in the x-y plane is obtained from the object detector model, which consists of a convolutional neural network, specifically the U-Net, that outputs heat maps. On the other hand, the person's location in terms of depth (z) is obtained from the depth map given by the ZED stereo camera. The document begins by presenting the techniques used today for object detection (using heat maps). This is followed by an explanation of the key theory behind neural networks; from the simplest neural networks to the convolutional neural networks. To finish with the theoretical part of the project, the hardware and software equipment used is presented. To develop and implement the deep learning algorithm, the first thing that is done is the dataset creation. In order to do that, different images have been selected and prepared to enter the network and train the model (using PyTorch) adapted to the needs of this task. Eight different combination of parameters have been used and eight different models have been obtained. Previously, the metric that will be used to evaluate and compare the different models obtained and choose the one that best suits this application, is defined. Once the final model is chosen, it is stored in the Jetson AGX Xavier and tested using ZED camera images. In this case, the model is verified to being accurate detecting people and the cases where the algorithm fails are identified. The next step of this project consists of applying stereo vision techniques to extract the distance at which the detected person is. A ROS node is created to communicate the ZED camera with the deep learning algorithm. Once the node is ready, it is executed to test the whole program in real time. The ZED color images are passed through the network to detect the person (x, y), and from the ZED depth map, the distance (z) is obtained. From the results obtained, both for the person detection and for the distance extraction, the existing errors in the designed algorithm are identified, and improvements are made by applying filters and code modifications. Thanks to the improvements applied to the results, a sufficient precise algorithm is obtained, capable of detecting a person within a distance range in real time.


Combining Stereo Vision and Deep Learning Techniques for Object Detection in the 3D World Related Books

Combining Stereo Vision and Deep Learning Techniques for Object Detection in the 3D World
Language: en
Pages:
Authors: Andrea Gimeno I Jovés
Categories:
Type: BOOK - Published: 2020 - Publisher:

DOWNLOAD EBOOK

The objective of this project is to develop a deep learning algorithm so that, together with the use of a stereo camera, it is capable of detecting a person and
Object Detection by Stereo Vision Images
Language: en
Pages: 293
Authors: R. Arokia Priya
Categories: Computers
Type: BOOK - Published: 2022-09-14 - Publisher: John Wiley & Sons

DOWNLOAD EBOOK

OBJECT DETECTION BY STEREO VISION IMAGES Since both theoretical and practical aspects of the developments in this field of research are explored, including rece
Object Detection with Deep Learning Models
Language: en
Pages: 345
Authors: S Poonkuntran
Categories: Computers
Type: BOOK - Published: 2022-11-01 - Publisher: CRC Press

DOWNLOAD EBOOK

Object Detection with Deep Learning Models discusses recent advances in object detection and recognition using deep learning methods, which have achieved great
Representations and Techniques for 3D Object Recognition and Scene Interpretation
Language: en
Pages: 172
Authors: Derek Hoiem
Categories: Computers
Type: BOOK - Published: 2011 - Publisher: Morgan & Claypool Publishers

DOWNLOAD EBOOK

One of the grand challenges of artificial intelligence is to enable computers to interpret 3D scenes and objects from imagery. This book organizes and introduce
Visual Object Tracking with Deep Neural Networks
Language: en
Pages: 208
Authors: Pier Luigi Mazzeo
Categories: Computers
Type: BOOK - Published: 2019-12-18 - Publisher: BoD – Books on Demand

DOWNLOAD EBOOK

Visual object tracking (VOT) and face recognition (FR) are essential tasks in computer vision with various real-world applications including human-computer inte