Modern Computer Vision with PyTorch: Concepts and hands-on Implementations of over 50 real-world image applications of deep learning

by V Kishore Ayyadevara, Yeshwanth Reddy

Length: 647 pages
Edition: 1
Language: English
Publisher: Packt Publishing
Publication Date: 2020-12-09
ISBN-10: 1839213477
ISBN-13: 9781839213472

Description

Packed with hands-on implementations of deep learning techniques to build image processing applications using PyTorch. Each chapter is accompanied by a GitHub folder with code notebooks and questions to cement your understanding.

Key Features

Implement solutions to 50 real-world computer vision applications using PyTorch
Understand the theory and working details before implementing the NN architectures
Get acquainted with the best practices by using a custom library we have created solely for this book

Book Description

Deep learning for computer vision (CV) has had a considerable positive impact on several applications.

First you will learn to implement a neural network (NN) from scratch using both NumPy, PyTorch and then learn the best practices of tweaking a NN’s hyper-parameters.

As we progress, you will learn about CNNs, transfer-learning with a focus on classifying images. You will also learn about the practical aspects to take care of while building a NN model.

Next you will learn about multi-object detection, segmentation and implement them using R-CNN family, SSD, YOLO, U-Net, Mask-RCNN architectures. You will then learn to use Detectron2 framework to simplify the process of building a NN for object detection and human-pose-estimation. Finally you will implement 3-D object detection.

Subsequently, you will learn about auto-encoders and GANs with a strong focus on image manipulation and generation. Here, you will implement VAE, DCGAN, CGAN, Pix2Pix, CycleGan, StyleGAN2, SRGAN, Style-Transfer.

You will then learn to combine NLP and CV techniques while performing OCR, Image Captioning, object detection with transformers. Next, you will learn to combine RL with CV techniques to implement a self-driving car agent.

Finally, you’ll wrap up with moving a NN model to production and learn conventional CV techniques using open-cv library.

What you will learn

Train a neural network from scratch in NumPy and then in PyTorch
Implement 2D, 3D multi-object detection and segmentation
Generate digits, DeepFakes, HD-Faces with autoencoders and advanced GANs
Manipulate images using CycleGAN, Pix2PixGAN, StyleGAN2 and SRGAN
Combine CV, NLP to perform OCR, image captioning, object detection
Combine CV, RL to build agents that play pong and self-drive a car
Deploy a Deep Learning model on AWS server using FastAPI, Docker
Dive deep and implement over 35 NN architectures and common OpenCV utilities

Who This Book Is For

This book is for beginners to PyTorch and intermediate-level machine learning practitioners who are looking to get well-versed with computer vision techniques using deep learning and PyTorch. Those who are just getting started with neural networks will also find this book useful. Basic knowledge of the Python programming language and machine learning is all you need to get started with this computer vision using deep learning book.

Free ChaptersTry Audible and Get Two Free Audiobooks »

To access the link, solve the captcha.

Recommended BooksMore Similar Books »