API
Definition
Application Programming Interface — in the AI context, a standardized way for developers to send requests to AI models and receive responses, enabling integration of AI capabilities into applications.
AI APIs have democratized access to frontier models, allowing any developer to integrate world-class AI into their applications with a few lines of code. Major AI APIs include OpenAI's API (GPT models), Anthropic's API (Claude models), Google's Gemini API, and numerous specialized APIs for vision, speech, and other tasks. APIs abstract away the complexity of model deployment, scaling, and hardware management. They typically accept structured requests (JSON with parameters like model name, input text, temperature) and return structured responses. Key API concepts include rate limits, authentication (API keys), streaming (receiving tokens as they're generated), and webhooks. The standardization of AI APIs through specifications like the OpenAI-compatible format has created an ecosystem where applications can easily switch between providers.
Related Terms
Inference
The process of using a trained AI model to generate predictions or outputs on new data, as opposed t...
Model Serving
The infrastructure and systems for deploying trained AI models in production to handle real-time req...
AI-as-a-Service
A cloud-based business model where companies provide access to AI capabilities through APIs or platf...
Token
The basic unit of text that language models process, typically representing a word, subword, or char...