AI Powered Computer Vision (AIARC-103)

AI Powered Computer Vision is a comprehensive 3-day course that prepares students for obtaining and organizing data for fine-tuning a pre-trained ViT (Vision Transformer)! Select your own ViT Image Classification model, and fine-tune it with datasets you’ve prepared, or gathered. This course offers a combination of lectures and hands-on labs, providing participants with a solid understanding of AI concepts and the skills to design and implement AI solutions.


Throughout the course, you will learn about ViT transformer-based architectures, the fundamentals of Python programming for AI models, and the deployment of open source Transformer models. You will begin with data preparation by gathering, cleaning, labeling, and organizing data for the purpose of fine-tuning a ViT.


You will gain practical experience through hands-on exercises with open source ViT frameworks, allowing you to work with fine-tuned models and run workloads on different models to understand their strengths and weaknesses. Additionally, the course covers model formats and provides in-depth exploration of AI programming environments like PyTorch.


Towards the end of the course, you will delve into advanced topics such as context extension through fine-tuning and quantization for a specific application target environment: Road Surface Image Classification. By the completion of the course, you will have the opportunity to earn an AI certification from Alta3 Research, further enhancing your credentials in the field of Artificial Intelligence. This course is ideal for Python Developers, DevSecOps Engineers, and Managers or Directors seeking a overview of AI and its practical application in the enterprise.


Course Information

Price: $2,195.00
Duration: 3 days
Certification: 
Exam: 
Learning Credits:
Course Delivery Options

Check out our full list of training locations and learning formats. Please note that the location you choose may be an Established HD-ILT location with a virtual live instructor.

Train face-to-face with the live instructor.

Access to on-demand training content anytime, anywhere.

Attend the live class from the comfort of your home or office.

Interact with a live, remote instructor from a specialized, HD-equipped classroom near you. An SLI sales rep will confirm location availability prior to registration confirmation.

All Sunset Learning dates are guaranteed to run!

Register

Prerequisites:

  • Python – PCEP Certification or Equivalent Experience
  • Familiarity with Linux
  • Familiarity with Natural Language Processing


 

Target Audience:

  • Project Managers
  • Architects
  • Developers
  • Data Acquisition Specialists


 

Course Objectives:

  • Understand Visual Transformer Model architecture.
  • Describe the role of patches and patch embeddings in ViT.
  • Install and use ViT frameworks like Google ViT, Microsoft BEiT, and Swin.
  • Prepare Dataset for ViT.
  • Establish guidelines for obtaining RAW Data for ViT Fine-Tuning.
  • Clean and Curate Data for ViT Fine-Tuning.
  • Fine-Tune ViT Models with PyTorch.
  • Deploy and Maximize ViT Model Performance.
  • Perform End-to-End procedure for Fine-Tuning a Road Surface ViT.


 

Course Outline:

The Visual Transformer Model

  • Lecture: What is Intelligence?
  • Lecture: Generative AI
  • Lecture: The Transformer Model
  • Lecture: Feed Forward Neural Networks

Computer Vision

  • Lecture: Introduction to Computer Vision
  • Lecture: NLP to ViT: Key Modifications
  • Lecture + Lab: Patch Embedding
  • Lecture + Lab: Positional Encoding in Vison Transformer
  • Lecture: CNN vs ViT – A Comparison

Pre-trained ViT

  • Lecture: Preparing A100 for Server Operations
  • Lecture: Selecting a Pre-Trained ViT Model
  • Lecture + Lab: Operating Google ViT Model for Face Recognition 
  • Lecture + Lab: Operating Microsoft BEiT Model for Scene Segmentation 

Data Curation for Road Surface ViT

  • Lecture: Curating Data for ViT
  • Lecture + Lab: Gathering Raw Data
  • Lecture + Lab: Data Cleaning and Preparation
  • Lecture + Lab: Data Labeling
  • Lecture + Lab: Data Organization
  • Lecture: Premade Datasets for Fine Tuning
  • Lecture + Lab: Obtain and Prepare Premade Datasets

Fine Tuning for Road Surface Image Classification

  • Lecture: Fine-Tuning a Pre-Trained ViT
  • Lecture: PyTorch
  • Lecture + Lab: Fine Tuning ViT with PyTorch
  • Lecture + Lab: Operating our Road Surface Image Classification ViT Model