ScyllaDB University Live | Free Virtual Training Event
Learn more
ScyllaDB Documentation Logo Documentation
  • Deployments
    • Cloud
    • Server
  • Tools
    • ScyllaDB Manager
    • ScyllaDB Monitoring Stack
    • ScyllaDB Operator
  • Drivers
    • CQL Drivers
    • DynamoDB Drivers
    • Supported Driver Versions
  • Resources
    • ScyllaDB University
    • Community Forum
    • Tutorials
Install
Search Ask AI
ScyllaDB Docs ScyllaDB Cloud Vector Search

Vector Search¶

Vector Search in ScyllaDB¶

Vector Search is a powerful method for efficiently searching and retrieving high-dimensional data based on similarity rather than exact matches. It is particularly useful in AI and machine learning applications, where data is often represented as vectors — mathematical representations of objects such as text, images, audio, or video. In these applications, you typically need to retrieve data that is similar to a given query, rather than relying on keyword-based search or exact matches.

ScyllaDB’s Vector Search feature allows you to store, index, and query high-dimensional vector data at scale. Vector Search is built to work within your existing ScyllaDB infrastructure, taking advantage of its high-performance and highly available architecture.

Common Use Cases¶

  • Semantic search — Find documents or passages that match the meaning of a query, not just the keywords.

  • Retrieval-Augmented Generation (RAG) — Provide relevant context to an LLM by retrieving similar documents from a vector store.

  • Recommendation systems — Find items similar to those a user has interacted with.

  • Image and audio search — Find visually or acoustically similar media.

  • Anomaly detection — Identify outliers far from all clusters in vector space.

  • Deduplication — Find near-duplicate records by identifying vectors that are very close together.

See Common Use Cases in the Concepts page for more details.

Getting Started¶

  • Quick Start Guide
    Walk through setting up a vector-enabled table, inserting data, and running your first similarity search in minutes.

Understanding Vector Search¶

  • Vector Search Concepts
    Architecture overview, HNSW algorithm, CDC-based indexing, and data flow between storage and vector search nodes.
  • Glossary
    Definitions of key terms including ANN, HNSW, embeddings, similarity functions, quantization, filtering, and more.

Deployment and Operations¶

  • Vector Search Deployments
    Create, enable, resize, disable, and monitor Vector Search clusters in ScyllaDB Cloud via the UI or API.
  • Sizing and Capacity Planning
    Estimate memory requirements, understand quantization impact, and choose instance types for your workload.
  • Security
    Authentication, authorization, service-level isolation, and network security for vector search.

Working with Vectors¶

  • Working with Vector Search
    CQL usage guide covering the vector data type, vector indexes, similarity functions, index tuning, ANN queries, and driver integration.
  • Filtering Vector Search Results
    Combine similarity search with metadata constraints using global and local vector indexes.
  • Quantization and Rescoring
    Reduce index memory usage with quantization (f16, i8, b1) and recover precision with oversampling and rescoring.

Troubleshooting and Reference¶

  • Troubleshooting
    Common issues and solutions for index creation, query results, data insertion, performance, and connectivity.
  • FAQ
    Frequently asked questions about similarity functions, dimensions, latency, filtering, quantization, and more.
  • Reference
    Technical reference for instance types, CQL syntax, index options, and Cloud API endpoints.

Examples¶

  • Example Applications
    Learn how to use ScyllaDB Vector Search to build RAG applications, semantic caching layers, and how it integrates with popular LLM libraries like LlamaIndex and LangChain.

Was this page helpful?

PREVIOUS
Immutable (WORM) Backups
NEXT
Quick Start Guide to Vector Search
  • Create an issue

On this page

  • Vector Search
    • Vector Search in ScyllaDB
      • Common Use Cases
    • Getting Started
    • Understanding Vector Search
    • Deployment and Operations
    • Working with Vectors
    • Troubleshooting and Reference
    • Examples
ScyllaDB Cloud
Search Ask AI
  • Get Started
    • What Is ScyllaDB Cloud?
    • Free Trial
    • Quick Start Guide
    • Billing and Pricing
  • Create & Connect to Your Cluster
    • Deployment Overview
    • Choose Your Cluster Type
      • Cluster Types Overview
      • X Cloud Clusters
      • X Cloud Autoscaling Behavior and Best Practices
      • Standard Clusters
    • Deploy to Your Own AWS Account (BYOA)
    • Deploy to Your Own GCP Account (BYOA)
    • Configure Availability Zones
    • Connect to Your Cluster
    • Cluster Setup Best Practices
  • Configure Network Access
    • Network Access Options
    • Configure AWS Transit Gateway (TGW) VPC Attachment Connection
    • Configure Virtual Private Cloud (VPC) Peering with AWS
    • Configure Virtual Private Cloud (VPC) Peering with GCP
    • Migrate a Cluster Connection
    • Check Cluster Availability
    • Glossary for Cluster Connections
  • Operate and Manage Clusters
    • Resize a Cluster
    • Add a Datacenter
    • Delete a Cluster
    • Configure Maintenance Windows
    • Configure Notifications
    • Track Resource Usage
    • Monitor Clusters
    • Monitor with Prometheus
    • Backups
  • Use ScyllaDB
    • Application Best Practices
    • Apache Cassandra Query Language (CQL)
    • ScyllaDB Drivers
    • Data Modeling
    • Tracing
    • Change Data Capture (CDC)
    • Role Based Access Control (RBAC)
    • ScyllaDB Alternator (DynamoDB-compatible API)
    • Lightweight Transactions (LWT)
    • ScyllaDB Integrations
  • Security
    • Security Best Practices
    • Security Concepts
    • Database-level Encryption
    • Storage-level Encryption
    • Client-to-node Encryption
    • Service Users
    • User Management
    • SAML Single Sign-On (SSO)
    • Immutable (WORM) Backups
    • Data Privacy and Compliance
  • Vector Search
    • Quick Start Guide
    • Vector Search Concepts
    • Vector Search Deployments
    • Sizing and Capacity Planning
    • Working with Vector Search
    • Filtering
    • Quantization and Rescoring
    • Security
    • Troubleshooting
    • FAQ
    • Glossary
    • Reference
    • Example Project
  • Cost Optimization
    • Cost Optimization Overview
    • Advanced Internode (RPC) Compression
    • Datacenter Placement and Data Transfer Costs
  • Automate with the ScyllaDB Cloud API
    • Programmatic Access Overview
    • Create a Personal Token for Authentication
    • API Reference
    • API Error Codes
    • Terraform Provider for ScyllaDB Cloud
    • ScyllaDB Cloud MCP Server
  • Get Help
    • FAQ
    • Tutorials
    • Getting Help
Docs Tutorials University Contact Us About Us
© 2026, ScyllaDB. All rights reserved. | Terms of Service | Privacy Policy | ScyllaDB, and ScyllaDB Cloud, are registered trademarks of ScyllaDB, Inc.
Last updated on 29 Jun 2026.
Powered by Sphinx 9.1.0 & ScyllaDB Theme 1.9.2