Deep Learning in Object Recognition, Detection, and Segmentation

Deep Learning in Object Recognition, Detection, and Segmentation
Author :
Publisher :
Total Pages : 165
Release :
ISBN-10 : 1680831178
ISBN-13 : 9781680831177
Rating : 4/5 (78 Downloads)

Book Synopsis Deep Learning in Object Recognition, Detection, and Segmentation by : Xiaogang Wang

Download or read book Deep Learning in Object Recognition, Detection, and Segmentation written by Xiaogang Wang and published by . This book was released on 2016 with total page 165 pages. Available in PDF, EPUB and Kindle. Book excerpt: As a major breakthrough in artificial intelligence, deep learning has achieved very impressive success in solving grand challenges in many fields including speech recognition, natural language processing, computer vision, image and video processing, and multimedia. This article provides a historical overview of deep learning and focus on its applications in object recognition, detection, and segmentation, which are key challenges of computer vision and have numerous applications to images and videos. The discussed research topics on object recognition include image classification on ImageNet, face recognition, and video classification. The detection part covers general object detection on ImageNet, pedestrian detection, face landmark detection (face alignment), and human landmark detection (pose estimation). On the segmentation side, the article discusses the most recent progress on scene labeling, semantic segmentation, face parsing, human parsing and saliency detection. Object recognition is considered as whole-image classification, while detection and segmentation are pixelwise classification tasks. Their fundamental differences will be discussed in this article. Fully convolutional neural networks and highly efficient forward and backward propagation algorithms specially designed for pixelwise classification task will be introduced. The covered application domains are also much diversified. Human and face images have regular structures, while general object and scene images have much more complex variations in geometric structures and layout. Videos include the temporal dimension. Therefore, they need to be processed with different deep models. All the selected domain applications have received tremendous attentions in the computer vision and multimedia communities. Through concrete examples of these applications, we explain the key points which make deep learning outperform conventional computer vision systems. (1) Different than traditional pattern recognition systems, which heavily rely on manually designed features, deep learning automatically learns hierarchical feature representations from massive training data and disentangles hidden factors of input data through multi-level nonlinear mappings. (2) Different than existing pattern recognition systems which sequentially design or train their key components, deep learning is able to jointly optimize all the components and crate synergy through close interactions among them. (3) While most machine learning models can be approximated with neural networks with shallow structures, for some tasks, the expressive power of deep models increases exponentially as their architectures go deep. Deep models are especially good at learning global contextual feature representation with their deep structures. (4) Benefitting from the large learning capacity of deep models, some classical computer vision challenges can be recast as high-dimensional data transform problems and can be solved from new perspectives. Finally, some open questions and future works regarding to deep learning in object recognition, detection, and segmentation will be discussed.


Deep Learning in Object Recognition, Detection, and Segmentation Related Books

Deep Learning in Object Recognition, Detection, and Segmentation
Language: en
Pages: 165
Authors: Xiaogang Wang
Categories: Machine learning
Type: BOOK - Published: 2016 - Publisher:

DOWNLOAD EBOOK

As a major breakthrough in artificial intelligence, deep learning has achieved very impressive success in solving grand challenges in many fields including spee
Deep Learning in Object Recognition, Detection, and Segmentation
Language: en
Pages: 186
Authors: Xiaogang Wang
Categories:
Type: BOOK - Published: 2016-07-14 - Publisher: Foundations and Trends (R) in Signal Processing

DOWNLOAD EBOOK

Deep Learning in Object Recognition, Detection, and Segmentation provides a comprehensive introductory overview of a topic that is having major impact on many a
Practical Machine Learning for Computer Vision
Language: en
Pages: 481
Authors: Valliappa Lakshmanan
Categories: Computers
Type: BOOK - Published: 2021-07-21 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

This practical book shows you how to employ machine learning models to extract information from images. ML engineers and data scientists will learn how to solve
Deep Learning for Computer Vision
Language: en
Pages: 564
Authors: Jason Brownlee
Categories: Computers
Type: BOOK - Published: 2019-04-04 - Publisher: Machine Learning Mastery

DOWNLOAD EBOOK

Step-by-step tutorials on deep learning neural networks for computer vision in python with Keras.
Visual Object Recognition
Language: en
Pages: 184
Authors: Kristen Grauman
Categories: Computers
Type: BOOK - Published: 2011 - Publisher: Morgan & Claypool Publishers

DOWNLOAD EBOOK

The visual recognition problem is central to computer vision research. From robotics to information retrieval, many desired applications demand the ability to i