Agentic RAG
Definition
A retrieval workflow where an agent plans searches, uses tools, evaluates evidence, and iterates before answering.
RAG & Vector Retrieval Terms terms and explanations from the Agentic AI Glossary.
Definition
A retrieval workflow where an agent plans searches, uses tools, evaluates evidence, and iterates before answering.
Definition
How well an answer addresses the user's question or task.
Definition
A fast search method that finds near-best vector matches efficiently at scale.
Definition
A classic keyword ranking algorithm used in sparse retrieval and hybrid search.
Definition
Repeated text shared between adjacent chunks to preserve context across boundaries.
Definition
The amount of text placed into each retrieval unit during document processing.
Definition
Splitting documents into smaller passages so retrieval can return useful, focused context.
Definition
Producing references or source links that show where an answer was grounded.
Definition
Reducing retrieved content to the most relevant facts before sending it to the model.
Definition
The share of retrieved context that is actually useful for answering the question.
Definition
How much of the needed evidence the retrieval system successfully found.
Definition
How relevant retrieved context is to the query and final answer.
Definition
Retrieval that uses surrounding context, metadata, or rewritten queries to improve relevance.
Definition
A RAG pattern that detects weak retrieval or unsupported answers and corrects the retrieval or response path.
Definition
A measure of vector similarity based on the angle between vectors.
Definition
A reranker that jointly reads the query and candidate passage to produce a relevance score.
Definition
The process of loading, parsing, cleaning, and preparing source documents for retrieval.
Definition
Extracting text, tables, metadata, and structure from source files or webpages.
Definition
A similarity score calculated by multiplying and summing vector components.
Definition
A vector representation of data used for semantic search and similarity matching.
Definition
The degree to which an answer is supported by the provided context.
Definition
A RAG approach that uses graph relationships among entities, documents, and facts to improve reasoning.
Definition
An answer supported by retrieved evidence, source-of-truth data, or citations.
Definition
Methods for identifying unsupported or fabricated model claims.
Definition
Combining keyword and vector retrieval to improve recall and relevance.
Definition
Search based on exact words, phrases, or lexical matching.
Definition
A retrieval approach that combines large context windows with selected external evidence.
Definition
Restricting retrieval by document attributes such as source, date, owner, product, or permission.
Definition
Retrieving with several query variations to improve coverage of relevant context.
Definition
Adding related terms or variants to a query to improve retrieval coverage.
Definition
Transforming a user question into a better search query or multiple focused queries.
Definition
The end-to-end flow for ingestion, chunking, embedding, retrieval, reranking, and answer generation.
Definition
Reordering retrieved results using a stronger relevance model after initial search.
Definition
A pattern that retrieves relevant external knowledge before generation so responses can be grounded in sources.
Definition
Measuring whether the retrieval system returns useful and complete context.
Definition
Search based on meaning rather than exact keyword matching.
Definition
Finding items closest to a query representation in a vector or feature space.
Definition
A numeric representation of meaning, features, or model state.
Definition
A database optimized for storing embeddings and retrieving semantically similar content.
Definition
An approximate-nearest-neighbor index that finds similar vectors quickly without comparing every vector exactly.
Definition
An open-source vector database commonly used for local RAG prototypes and lightweight embedding search.
Definition
Dense Retrieval is a retrieval method focused on dense. It finds useful information before generation, tool use, grounding, or answer verification.
Definition
A distributed search engine used for keyword search, logs, analytics, and hybrid retrieval systems.
Definition
The term Embedding Drift means an embedding drift concept used in vector database and search terms for practical AI engineering work.
Definition
The term Embedding Space means an embedding space concept used in vector database and search terms for practical AI engineering work.
Definition
A library from Meta for efficient vector similarity search and clustering.
Definition
Hierarchical Navigable Small World, a graph-based index commonly used for fast vector search.
Definition
Hybrid Retrieval is a retrieval method focused on hybrid. It finds useful information before generation, tool use, grounding, or answer verification.
Definition
Inverted file indexing, a vector search method that partitions embeddings into clusters for faster lookup.
Definition
A storage system for metadata data that an AI application can save, query, or retrieve during execution.
Definition
An open-source vector database for storing, indexing, and searching large-scale embedding collections.
Definition
Maximal Marginal Relevance, a retrieval method that balances relevance with diversity in selected results.
Definition
An open-source search and analytics engine used for logs, keyword search, and hybrid retrieval.
Definition
A PostgreSQL extension that stores embeddings and supports vector similarity search inside Postgres.
Definition
A managed vector database service used to store and search embeddings at scale.
Definition
Product quantization, a compression method that reduces vector storage size for faster large-scale search.
Definition
The term Precision@k means a precision at k concept used in vector database and search terms for practical AI engineering work.
Definition
A vector database focused on similarity search, filtering, and production-ready embedding retrieval.
Definition
The term Recall@k means a recall at k concept used in vector database and search terms for practical AI engineering work.
Definition
Semantic Cache stores reusable semantic results. It reduces repeated work, lowers cost, and improves speed when similar requests appear again.
Definition
The term Similarity Threshold means a similarity threshold concept used in vector database and search terms for practical AI engineering work.
Definition
Sparse Retrieval is a retrieval method focused on sparse. It finds useful information before generation, tool use, grounding, or answer verification.
Definition
Top-k Retrieval is a retrieval method focused on top-k. It finds useful information before generation, tool use, grounding, or answer verification.
Definition
An index structure for vector that speeds up lookup, retrieval, or similarity matching.
Definition
A storage system for vector data that an AI application can save, query, or retrieve during execution.
Definition
A vector database that combines semantic search, metadata filtering, and knowledge-oriented data modeling.
Explore more chapters or test your knowledge with quizzes.