Computer Vision
Definition
A field of AI that enables machines to interpret and understand visual information from images, videos, and the real world.
Computer Vision (CV) gives machines the ability to see and understand visual content. Core tasks include image classification, object detection, image segmentation, and pose estimation. The field was revolutionized by convolutional neural networks (CNNs), starting with AlexNet in 2012, and has since advanced through architectures like ResNet, YOLO, and Vision Transformers. Applications span autonomous driving, medical imaging, facial recognition, augmented reality, and industrial quality inspection. Modern multimodal models combine computer vision with language understanding, enabling systems to describe images, answer visual questions, and generate images from text.
Related Terms
Deep Learning
A subset of machine learning that uses multi-layered neural networks to learn hierarchical represent...
CNN (Convolutional Neural Network)
A neural network architecture designed for processing grid-structured data like images, using convol...
Text-to-Image
AI systems that generate images from natural language descriptions, typically using diffusion models...