161 terms tracked

AI Glossary

AI moves fast. Here's a glossary to help you keep up. Plain-English definitions for every major AI term.

161 terms · 24 letters covered · Always free

Total Terms

161

and growing weekly

Letters Covered

of the alphabet

Accuracy

Accuracy is the proportion of correct predictions out of total predictions made by a classification model, cal…

Read definition

Architecture

Activation Function

Activation Function is a mathematical function applied to the output of a neural network node that introduces…

Read definition

Safety

Adversarial Attack

Adversarial Attack is a technique where carefully crafted inputs are designed to deceive AI models into making…

Read definition

Core Concepts

Agentic AI

Agentic AI is aI systems that can autonomously plan, reason, and take actions to achieve goals without constan…

Read definition

Applications

AI Agent

AI Agent is an autonomous software system that perceives its environment, makes decisions, and takes actions t…

Read definition

Safety

AI Alignment

AI Alignment is the challenge of ensuring AI systems pursue goals that are consistent with human values and in…

Read definition

Safety

AI Ethics

AI Ethics is an interdisciplinary field examining the moral implications of artificial intelligence, addressin…

Read definition

Business

AI Governance

AI Governance is the frameworks, regulations, and organizational structures that guide the development and dep…

Read definition

Safety

AI Safety

AI Safety is the interdisciplinary field focused on ensuring AI systems operate reliably, ethically, and witho…

Read definition

Business

AI-as-a-Service

AI-as-a-Service (AIaaS) delivers artificial intelligence capabilities through cloud-based APIs and platforms,…

Read definition

Safety

Alignment Tax

Alignment Tax is the performance cost incurred when making AI models safer and more aligned with human values.…

Read definition

Core Concepts

Annotation

Annotation is the process of labeling data with metadata that AI models can learn from during supervised train…

Read definition

Business

API

API application Programming Interface — in the AI context, a standardized way for developers to send requests…

Read definition

Core Concepts

Artificial Intelligence

Artificial Intelligence (AI) is a field of computer science focused on building systems capable of performing…

Read definition

Architecture

Attention Mechanism

Attention Mechanism is a technique that allows neural networks to focus on the most relevant parts of the inpu…

Read definition

Evaluation

AUC-ROC

AUC-ROC (Area Under the Receiver Operating Characteristic Curve) is a classification performance metric that m…

Read definition

Architecture

Autoregressive Model

Autoregressive Model is a type of generative model that produces output one element at a time, with each new e…

Read definition

5 terms

Training

Backpropagation

Backpropagation is the algorithm that computes gradients of the loss function with respect to each weight in a…

Read definition

Infrastructure

Batch Size

Batch Size is the number of training examples processed simultaneously in one forward and backward pass of a n…

Read definition

Core Concepts

Benchmark

Benchmark is a standardized test or dataset used to evaluate and compare the performance of AI models. Common…

Read definition

Safety

Bias

Bias in AI refers to systematic errors in model predictions that arise from skewed training data, flawed assum…

Read definition

Techniques

BLEU Score

BLEU Score is bilingual Evaluation Understudy score, a metric for evaluating the quality of machine-translated…

Read definition

14 terms

Core Concepts

Catastrophic Forgetting

Catastrophic Forgetting is a phenomenon where neural networks lose previously learned knowledge when trained o…

Read definition

Techniques

Chain of Thought

Chain of Thought is a prompting technique where AI models show their reasoning step-by-step before arriving at…

Read definition

Applications

Chatbot

A Chatbot is an AI-powered software application that simulates human conversation through text or voice interf…

Read definition

Techniques

Chinchilla Scaling

Chinchilla Scaling is a training methodology derived from DeepMind's Chinchilla paper showing that many large…

Read definition

Architecture

CLIP

CLIP is contrastive Language-Image Pre-training, a model developed by OpenAI that learns visual concepts from…

Read definition

Infrastructure

Cloud AI

Cloud AI refers to artificial intelligence services delivered through cloud computing platforms — including pr…

Read definition

Architecture

CNN (Convolutional Neural Network)

CNN (Convolutional Neural Network) is a deep learning architecture designed for processing grid-structured dat…

Read definition

Applications

Code Generation

Code Generation is an AI capability where language models produce functional source code from natural language…

Read definition

Infrastructure

Compute

Compute is the computational resources (GPUs, TPUs) required to train and run AI models. Compute costs are the…

Read definition

Core Concepts

Computer Vision

Computer Vision is an AI discipline that trains machines to interpret and understand visual information from i…

Read definition

Safety

Constitutional AI

Constitutional AI is an AI alignment technique developed by Anthropic where AI systems are trained to follow a…

Read definition

Core Concepts

Context Window

Context Window is the maximum amount of text an AI model can process in a single interaction, measured in toke…

Read definition

Techniques

Contrastive Learning

Contrastive Learning is a self-supervised learning technique where models learn by comparing similar and dissi…

Read definition

Infrastructure

CUDA

CUDA (Compute Unified Device Architecture) is NVIDIA proprietary parallel computing platform and programming m…

Read definition

13 terms

Training

Data Augmentation

Data Augmentation is a technique that artificially expands training datasets by applying transformations to ex…

Read definition

Business

Data Flywheel

Data Flywheel is a self-reinforcing cycle where an AI product generates more user data, which improves the mod…

Read definition

Data

Data Labeling

Data Labeling is the process of annotating raw data with meaningful tags or categories that enable supervised…

Read definition

Safety

Data Poisoning

Data Poisoning is an attack where malicious data is injected into a training dataset to compromise an AI model…

Read definition

Data

Dataset

Dataset is a structured collection of data used for training, validating, and testing machine learning models,…

Read definition

Architecture

Decoder

Decoder is a neural network component that generates output sequences from encoded representations. In transfo…

Read definition

Core Concepts

Deep Learning

Deep Learning is a machine learning technique that uses multi-layered neural networks (deep neural networks) t…

Read definition

Architecture

Depthwise Separable Convolution

Depthwise Separable Convolution is an efficient neural network operation that factorizes a standard convolutio…

Read definition

Architecture

Diffusion Model

Diffusion Model is a generative AI architecture that learns to create data by reversing a gradual noise-additi…

Read definition

Architecture

Diffusion Models

Diffusion Models is a class of generative AI models that create images by gradually denoising random noise. St…

Read definition

Techniques

Direct Preference Optimization (DPO)

Direct Preference Optimization (DPO) is a simplified alternative to RLHF for aligning language models with hum…

Read definition

Training

Distillation

Distillation is a model compression technique that transfers knowledge from a large teacher model to a smaller…

Read definition

Infrastructure

Distributed Training

Distributed Training is the practice of training AI models across multiple GPUs or machines simultaneously to…

Read definition

7 terms

Infrastructure

Edge AI

Edge AI is aI processing performed locally on devices (phones, IoT sensors, cars) rather than in the cloud. Ed…

Read definition

Core Concepts

Embedding

Embedding is a dense numerical representation of data (text, images, audio) in a continuous vector space where…

Read definition

Core Concepts

Emergent Abilities

Emergent Abilities is capabilities that appear in large language models only at sufficient scale, such as arit…

Read definition

Architecture

Encoder

Encoder is a neural network component that processes input data and produces a compressed representation captu…

Read definition

Architecture

Encoder-Decoder

Encoder-Decoder is a neural network architecture where an encoder compresses input into a dense representation…

Read definition

Core Concepts

Epoch

Epoch is one complete pass through the entire training dataset during model training. Training typically invol…

Read definition

Safety

Explainability

Explainability is the degree to which an AI model's decision-making process can be understood by humans. Expla…

Read definition

8 terms

Evaluation

F1 Score

F1 Score is the harmonic mean of precision and recall, providing a single metric that balances both false posi…

Read definition

Infrastructure

Feature Store

Feature Store is a centralized repository for storing, managing, and serving machine learning features across…

Read definition

Techniques

Federated Learning

Federated Learning is a machine learning approach where models are trained across multiple decentralized devic…

Read definition

Techniques

Few-Shot Learning

Few-Shot Learning is the ability of AI models to learn new tasks from just a handful of examples, rather than…

Read definition

Training

Fine-Tuning

Fine-Tuning is the process of further training a pre-trained model on a smaller, task-specific dataset to adap…

Read definition

Infrastructure

Flash Attention

Flash Attention is an optimized attention algorithm that dramatically reduces the memory requirements and spee…

Read definition

Core Concepts

Foundation Model

Foundation Model is a large AI model trained on broad data that can be adapted to many downstream tasks throug…

Read definition

Core Concepts

Frontier Models

Frontier Models is the most capable and advanced AI models available at any given time. As of 2026, frontier m…

Read definition

8 terms

Architecture

GAN (Generative Adversarial Network)

GAN (Generative Adversarial Network) is a neural network architecture consisting of two networks — a generator…

Read definition

Core Concepts

Generative AI

Generative AI refers to artificial intelligence systems that create new content — text, images, video, audio,…

Read definition

Architecture

GPT (Generative Pre-trained Transformer)

GPT (Generative Pre-trained Transformer) is OpenAI family of autoregressive language models that predict the n…

Read definition

Infrastructure

GPU

GPU is graphics Processing Unit — a specialized processor originally designed for rendering graphics but now e…

Read definition

Infrastructure

GPU Cloud

GPU Cloud is cloud computing services that provide on-demand access to GPU hardware for AI training and infere…

Read definition

Training

Gradient Descent

Gradient Descent is the fundamental optimization algorithm used to train neural networks, iteratively adjustin…

Read definition

Techniques

Grounding

Grounding is the process of connecting AI model outputs to verified external data sources to reduce hallucinat…

Read definition

Safety

Guardrails

Guardrails is safety mechanisms built into AI systems to prevent harmful, biased, or inappropriate outputs. Gu…

Read definition

3 terms

Core Concepts

Hallucination

Hallucination is when AI models generate information that is factually incorrect or fabricated but presented w…

Read definition

Evaluation

HumanEval

HumanEval is a code generation benchmark created by OpenAI containing 164 hand-written programming problems wi…

Read definition

Core Concepts

Hyperparameter

Hyperparameter is a configuration setting for model training that is set before the learning process begins, a…

Read definition

4 terms

Techniques

In-Context Learning

In-Context Learning is the ability of large language models to adapt their behavior based on examples provided…

Read definition

Infrastructure

Inference

Inference is the process of running a trained AI model to generate predictions or outputs. Inference costs oft…

Read definition

Business

Inference Cost

Inference Cost is the computational expense of running a trained AI model to generate predictions, measured in…

Read definition

Infrastructure

Inference Endpoint

Inference Endpoint is a deployed API server that hosts a trained AI model and accepts requests to generate pre…

Read definition

2 terms

Safety

Jailbreak

Jailbreak is a prompt engineering technique designed to bypass the safety guardrails of AI models, causing the…

Read definition

Safety

Jailbreaking

Jailbreaking is the practice of crafting prompts designed to bypass an AI model safety guardrails and content…

Read definition

1 term

Techniques

Knowledge Distillation

Knowledge Distillation is a technique where a smaller 'student' model is trained to replicate the behavior of…

Read definition

8 terms

Core Concepts

Large Language Model

Large Language Model (LLM) is a neural network with billions or trillions of parameters trained on massive tex…

Read definition

Infrastructure

Latency

Latency in AI systems measures the time delay between sending a request and receiving a response, typically re…

Read definition

Core Concepts

Latent Space

Latent Space is the abstract, lower-dimensional representation space learned by neural networks to encode the…

Read definition

Training

Learning Rate

Learning Rate is a hyperparameter that controls how much a model adjusts its weights in response to each batch…

Read definition

Core Concepts

LLM (Large Language Model)

LLM (Large Language Model) is an AI model trained on vast amounts of text data, capable of understanding and g…

Read definition

Techniques

LoRA (Low-Rank Adaptation)

LoRA (Low-Rank Adaptation) is a parameter-efficient fine-tuning technique that adds small trainable matrices t…

Read definition

Training

Loss Function

Loss Function is a mathematical function that quantifies the difference between a model predictions and actual…

Read definition

Architecture

LSTM

LSTM (Long Short-Term Memory) is a specialized recurrent neural network architecture that uses gating mechanis…

Read definition

11 terms

Core Concepts

Machine Learning

Machine Learning is a subset of artificial intelligence where algorithms learn patterns from data to make pred…

Read definition

Applications

Machine Translation

Machine Translation is an NLP application that automatically translates text or speech from one natural langua…

Read definition

Architecture

Mixture of Agents

Mixture of Agents is an AI system architecture where multiple specialized AI agents collaborate to solve compl…

Read definition

Architecture

Mixture of Experts

Mixture of Experts is an architecture where multiple specialized sub-networks (experts) are combined, with a r…

Read definition

Business

MLOps

MLOps is the set of practices for deploying, monitoring, and maintaining machine learning models in production…

Read definition

Evaluation

MMLU

MMLU (Massive Multitask Language Understanding) is a benchmark comprising 15,908 multiple-choice questions acr…

Read definition

Safety

Model Card

Model Card is a standardized document that accompanies an AI model describing its intended use cases, training…

Read definition

Core Concepts

Model Collapse

Model Collapse is a phenomenon where AI models trained on data generated by other AI models progressively degr…

Read definition

Infrastructure

Model Serving

Model Serving is the infrastructure and process of deploying trained AI models to production environments wher…

Read definition

Architecture

Multi-Head Attention

Multi-Head Attention is a transformer architecture mechanism that runs multiple attention computations in para…

Read definition

Core Concepts

Multimodal AI

Multimodal AI is aI models that can process and generate multiple types of data — text, images, audio, video —…

Read definition

4 terms

Applications

Named Entity Recognition

Named Entity Recognition (NER) is an NLP task that identifies and classifies named entities in text — such as…

Read definition

Core Concepts

Natural Language Processing

Natural Language Processing (NLP) is a branch of artificial intelligence that enables computers to understand,…

Read definition

Techniques

Neural Architecture Search

Neural Architecture Search is an automated process where AI is used to design optimal neural network architect…

Read definition

Core Concepts

Neural Network

Neural Network is a computing system inspired by biological brain structure, composed of interconnected nodes…

Read definition

2 terms

Business

Open Source AI

Open Source AI is aI models and tools whose weights, code, and training data are publicly available for use an…

Read definition

Training

Overfitting

Overfitting occurs when a machine learning model memorizes training data patterns too closely, including noise…

Read definition

6 terms

Evaluation

Perplexity

Perplexity is a metric that evaluates language model quality by measuring how well the model predicts a sample…

Read definition

Architecture

Positional Encoding

Positional Encoding adds information about token position in a sequence to transformer models, which lack inhe…

Read definition

Core Concepts

Pre-training

Pre-training is the initial phase of training a foundation model on massive amounts of unlabeled data to learn…

Read definition

Evaluation

Precision

Precision is a classification metric that measures the proportion of positive predictions that are actually co…

Read definition

Techniques

Prompt Engineering

Prompt Engineering is the art and science of crafting effective inputs to AI models to elicit desired outputs.…

Read definition

Safety

Prompt Injection

Prompt Injection is a security vulnerability where malicious instructions embedded in user input or external d…

Read definition

1 term

Infrastructure

Quantization

Quantization is the process of reducing the precision of model weights (e.g., from 32-bit to 4-bit) to decreas…

Read definition

10 terms

Techniques

RAG (Retrieval-Augmented Generation)

RAG (Retrieval-Augmented Generation) is a technique that enhances LLM responses by retrieving relevant documen…

Read definition

Core Concepts

Reasoning Models

Reasoning Models is aI models specifically designed for complex logical and mathematical reasoning, such as Op…

Read definition

Evaluation

Recall

Recall is a classification metric measuring the proportion of actual positive cases that the model correctly i…

Read definition

Safety

Red Teaming

Red Teaming is the practice of deliberately probing AI systems for vulnerabilities, biases, and failure modes…

Read definition

Training

Regularization

Regularization encompasses techniques that prevent neural networks from overfitting training data by adding co…

Read definition

Core Concepts

Reinforcement Learning

Reinforcement Learning is a machine learning paradigm where an AI agent learns optimal behavior through trial…

Read definition

Techniques

Reinforcement Learning from Human Feedback (RLHF)

Reinforcement Learning from Human Feedback (RLHF) is a training technique where AI models are fine-tuned using…

Read definition

Safety

Responsible AI

Responsible AI is the practice of developing and deploying AI systems that are fair, transparent, accountable,…

Read definition

Techniques

Retrieval

Retrieval is the process of searching and fetching relevant information from external knowledge sources to aug…

Read definition

Architecture

RNN (Recurrent Neural Network)

RNN (Recurrent Neural Network) is a neural network architecture designed for sequential data, where the output…

Read definition

13 terms

Business

SaaS AI

SaaS AI is software-as-a-Service products that embed AI capabilities as core features. SaaS AI includes tools…

Read definition

Core Concepts

Scaling Laws

Scaling Laws is empirical observations that AI model performance improves predictably as compute, data, and pa…

Read definition

Techniques

Self-Supervised Learning

Self-Supervised Learning is a training paradigm where models learn from unlabeled data by predicting missing p…

Read definition

Applications

Sentiment Analysis

Sentiment Analysis is an NLP technique that identifies and classifies the emotional tone expressed in text — p…

Read definition

Business

Sovereign AI

Sovereign AI is the concept that nations should develop their own AI capabilities, models, and infrastructure…

Read definition

Techniques

Speculative Decoding

Speculative Decoding is an inference optimization technique where a smaller, faster draft model generates cand…

Read definition

Applications

Speech-to-Text

Speech-to-Text (STT), also called automatic speech recognition (ASR), is an AI technology that converts spoken…

Read definition

Architecture

State Space Model

State Space Model is an alternative to transformer architecture that processes sequences using principles from…

Read definition

Applications

Summarization

Summarization is an NLP task where AI models condense long documents into shorter versions while preserving ke…

Read definition

Safety

Superalignment

Superalignment is openAI's research initiative focused on ensuring superintelligent AI systems remain aligned…

Read definition

Core Concepts

Supervised Learning

Supervised Learning is a machine learning paradigm where models are trained on labeled datasets — input-output…

Read definition

Core Concepts

Synthetic Data

Synthetic Data is artificially generated data used to train AI models when real-world data is scarce, expensiv…

Read definition

Core Concepts

System Prompt

System Prompt is hidden instructions given to an AI model that define its behavior, personality, and constrain…

Read definition

14 terms

Core Concepts

Temperature

Temperature is a parameter that controls the randomness of AI model outputs. Low temperature (near 0) produces…

Read definition

Core Concepts

Test-Time Compute

Test-Time Compute is additional computational resources allocated during inference to improve model performanc…

Read definition

Applications

Text-to-Image

Text-to-Image generation is an AI capability that creates visual images from natural language descriptions, po…

Read definition

Applications

Text-to-Speech

Text-to-Speech (TTS) is an AI technology that converts written text into natural-sounding spoken audio, using…

Read definition

Applications

Text-to-Video

Text-to-Video generation is an AI capability that creates video content from natural language descriptions, wi…

Read definition

Infrastructure

Throughput

Throughput in AI systems measures the rate of data processing, typically reported as tokens per second for lan…

Read definition

Core Concepts

Token

Token is the basic unit of text processed by language models. A token is roughly 3/4 of a word in English. Mod…

Read definition

Infrastructure

Tokenizer

Tokenizer is a component that converts raw text into a sequence of tokens that a language model can process. T…

Read definition

Applications

Tool Use

Tool Use is the ability of an AI model to interact with external tools and APIs — such as web search, code int…

Read definition

Core Concepts

Tool Use (Function Calling)

Tool Use (Function Calling) is the ability of AI models to invoke external tools, APIs, or databases during co…

Read definition

Infrastructure

TPU

TPU (Tensor Processing Unit) is a custom-designed AI accelerator chip developed by Google specifically for mac…

Read definition

Data

Training Data

Training Data is the dataset used to teach machine learning models patterns and relationships, comprising inpu…

Read definition

Techniques

Transfer Learning

Transfer Learning is the practice of applying knowledge learned from one task or domain to improve performance…

Read definition

Architecture

Transformer

Transformer is a neural network architecture introduced in 2017 that uses self-attention mechanisms to process…

Read definition

3 terms

Training

Underfitting

Underfitting occurs when a machine learning model is too simple to capture the underlying patterns in training…

Read definition

Business

Unicorn

Unicorn is a privately held startup valued at over $1 billion. The AI sector has produced more unicorns than a…

Read definition

Core Concepts

Unsupervised Learning

Unsupervised Learning is a machine learning approach where models discover hidden patterns, groupings, or stru…

Read definition

3 terms

Architecture

VAE (Variational Autoencoder)

VAE (Variational Autoencoder) is a generative model that learns a compressed latent representation of input da…

Read definition

Infrastructure

Vector Database

Vector Database is a specialized database optimized for storing and querying high-dimensional vector embedding…

Read definition

Architecture

Vision-Language Model

Vision-Language Model is aI models that can understand and reason about both images and text simultaneously. V…

Read definition

3 terms

Safety

Watermarking

Watermarking is a technique for embedding invisible statistical patterns in AI-generated content to enable det…

Read definition

Core Concepts

Weight

Weight is a numerical parameter in a neural network that determines the strength of the connection between neu…

Read definition

Core Concepts

World Model

World Model is an AI system's internal representation of how the world works, enabling it to predict future st…

Read definition

1 term

Techniques

Zero-Shot Learning

Zero-Shot Learning is the ability of AI models to perform tasks they were never explicitly trained on, by leve…

Read definition

Common Questions

FAQ

What is the Awaira AI Glossary?

The Awaira AI Glossary is a free reference of 161 AI and machine learning terms with plain-English definitions. It covers foundation models, deep learning, NLP, computer vision, AI safety, and more.

Who is this glossary for?

Anyone trying to understand AI terminology -- from students and journalists to investors and product managers. Every definition is written in plain English, no PhD required.

How often is the glossary updated?

New terms are added weekly as the AI field evolves. Existing definitions are refined based on community feedback and industry developments.

Can I suggest a term?

Yes. If you encounter an AI term not yet defined in the glossary, submit it via our submission page and the team will add it within 48 hours.

AI-500 Rankings AI Tools AI Models All Categories AI Founders

Know the terms. Now explore the companies.

Awaira tracks 400+ AI companies -- valuations, funding, founders, and more. All free.

Browse Companies