Snowflake Cortex Search 101: Set Up Fuzzy Search on Snowflake (2025)

Searching through piles of raw, unstructured text data manually is virtually impossible. Building custom search systems from scratch can take days or even months of development, requiring you to manage complex vector databases, fine-tune embedding models, and maintain infrastructure that often fails when you need it most. Traditional keyword-based search systems struggle with semantic understanding and context, while pure vector search can miss exact matches. But if you use Snowflake, you can make use of Snowflake Cortex Search. It delivers fast, "fuzzy" keyword and semantic search across all your Snowflake data with almost no setup or tuning required. In minutes, you have a hybrid (vector + keyword) search index in Snowflake that keeps itself fresh; no servers to manage or algorithms to tweak.

In this article, we'll cover everything you need to know about Snowflake Cortex Search, from understanding its features and use cases, to how the search pipeline works, supported embedding models, and a full step-by-step guide (SQL and UI) to create and deploy your first search service and build real applications.

What is Snowflake Cortex Search?

Snowflake Cortex Search is a fully managed search service built natively into the Snowflake Data Cloud platform. It delivers low-latency, high-quality fuzzy search over your text data stored in Snowflake tables and views. The service supports both natural language queries and traditional keyword searches, returning ranked results based on semantic relevance and lexical matching.

Cortex Search addresses two primary use cases: Retrieval-Augmented Generation (RAG) and enterprise search. We’ll cover these in detail shortly.

Because Snowflake Cortex Search is built into Snowflake, you don’t need separate servers or external infra. It scales with your existing Snowflake account and leverages the same security and governance controls.

Snowflake Cortex Search uses a hybrid search architecture. It combines vector-based embeddings with traditional keyword lookup. There's also a final step that fine-tunes the results using semantic reranking. Each query runs through vector search, keyword search, and semantic reranking to deliver the best possible results without manual tuning. You get high-quality search right away. You don’t need to worry about infrastructure, model serving, or manual fine-tuning.

Key Features and Capabilities of Snowflake Cortex Search

Here are some key features of Snowflake Cortex Search.

Feature 1: Hybrid Search Architecture - Vector Embeddings + Keyword Matching + Neural Reranking

Every query generates two sets of results: semantically similar ones (based on meaning) and lexically similar ones (based on keywords). These results are then intelligently reranked by a neural network to deliver the most relevant outcomes.

Feature 2: Zero Infrastructure Management - Fully Managed Search Service

You simply point Snowflake to your data. Snowflake automatically handles all index building, updates, and serving, eliminating the need to maintain any servers or clusters.

Feature 3: Real-Time Index Updates with Dynamic Refresh & TARGET_LAG

Snowflake Cortex Search services use Snowflake’s change tracking and Snowflake Dynamic Table under the hood. You set a TARGET_LAG (like, 1 hour or 5 minutes) and Snowflake will incrementally update the index automatically as your data changes.

Feature 4: Native Snowflake Platform Integration - SQL, Python SDK & Snowsight Support

Snowflake Cortex Search is fully integrated into Snowflake. You can easily create Snowflake Cortex Search services with SQL or Snowsight, query them via SQL or the Snowflake Python SDK, and use Snowflake’s security model (roles/privileges) natively.

Feature 5: Automatic Data Ingestion & Index Synchronization

When the source tables or query change, the service refreshes itself. You don’t have to manually re-index; just define your target lag, and Snowflake does the rest.

TL;DR: Snowflake Cortex Search provides a powerful, managed search backend directly within your Snowflake account. You focus on your data and queries, and Snowflake manages the underlying search infrastructure.

Snowflake Cortex Search Use Cases

Snowflake Cortex Search has two main uses: retrieval augmented generation, or RAG for short, and enterprise search.

Use Case 1: Retrieval-Augmented Generation (RAG) for LLM Chatbots
- Use Snowflake Cortex Search as the retrieval engine that feeds context to your large language model. It combines vector and keyword search to surface semantically relevant documents for chatbots or AI assistants; no extra embedding pipeline required.
Use Case 2: Enterprise Search & Knowledge Base Applications
- Use Snowflake Cortex Search as a backend for a high-quality search bar embedded in your application.

Want to take Chaos Genius for a spin?

It takes less than 5 minutes.

Enter your work email

How Snowflake Cortex Search Works: Architecture, Pipeline & Processing Stages

Snowflake Cortex Search is designed to be high-quality right away. Under the hood it uses an ensemble of retrieval and ranking models to provide you with a high level of search quality with little to no tuning required. The magic happens through a three-stage process:

Stage 1: Vector Search - Embedding-Based Semantic Similarity Matching

First the query text is embedded into a vector (using your chosen embedding model) and compared to the indexed document vectors. This finds semantically similar results beyond exact keyword matches.

Stage 2: Keyword Search - Traditional Full-Text Lexical Matching

In parallel, a classic full-text keyword match is performed on the raw text. This checks that no direct term matches are missed.

Stage 3: Semantic Reranking - Neural Network-Based Result Optimization

Finally, the union of those results is then passed through a neural re-ranker that scores the top candidates for relevance to the query.

Since it combines meaning and keywords, Snowflake Cortex Search usually beats out traditional keyword engines on fuzzy searches and still catches exact matches. Yet you don’t have to tinker with model parameters or train new models – Snowflake provides the model stack and tuning.

Snowflake Cortex Search Embedding Models

At the vector stage, Snowflake Cortex Search turns text into embeddings. An embedding is simply a numeric vector representation of text that captures meaning.

Snowflake Cortex Search supports several embedding models, each optimized for different use cases:

Model 1: Snowflake-arctic-embed-m-v1.5 (default)

Dimensions: 768
Context Window: 512 tokens
Language Support: English only
Parameters: 110M
Use Case: General-purpose English text with balanced performance and cost

Model 2: Snowflake-arctic-embed-l-v2.0

Dimensions: 1024
Context Window: 512 tokens
Language Support: Multilingual
Parameters: 568M
Use Case: Higher accuracy requirements and non-English content

Model 3: Snowflake-arctic-embed-l-v2.0-8k

Dimensions: 1024
Context Window: 8192 tokens
Language Support: Multilingual
Parameters: 568M
Use Case: Long documents like research papers, books, or extensive documentation

Model 4: Voyage-multilingual-2

Dimensions: 1024
Context Window: 512 tokens
Language Support: Multilingual
Provider: VoyageAI
Use Case: Strong multilingual performance across diverse languages

Note: The embedding model cannot be changed after service creation.

Your choice of model affects indexing speed, query quality, and credit costs (Snowflake charges per token embedded). Note that not all models are available in every Snowflake region. Always pick a model that fits your language needs and double-check the costs in the Service Consumption table.

🔮 How to Create Snowflake Cortex Search Service: Complete Step-by-Step Tutorial

Now, let’s build a Snowflake Cortex Search service from scratch using SQL commands. This will create an index over your data in Snowflake. We’ll assume you have some text data to search.

Prerequisites and Setup

You need a Snowflake account and a role granted privileges to create:

Database
Table and view
Cortex Search service
Warehouse for compute

You'll also need:

A Snowflake table containing the text data you want to search
Snowflake Change tracking enabled on that table (if you want real-time updates)
The SNOWFLAKE.CORTEX_USER role or equivalent permissions

Once all prerequisites are in place, carry out these steps:

Step 1—Log In to Snowflake - Snowsight or SnowSQL Command Line

Sign in to your Snowflake account (you can use Snowsight or SnowSQL command line). Make sure you're using a role with sufficient privileges.

Step 2—Create Database, Warehouse, Schema

Set up your environment with the basic Snowflake objects. Create a dedicated Warehouse (for indexing compute) and a database/schema to hold your data and service.


CREATE DATABASE IF NOT EXISTS snowflake_cortex_search_demo;


CREATE WAREHOUSE IF NOT EXISTS snowflake_cortex_search_wh 
WITH WAREHOUSE_SIZE = 'SMALL' 
AUTO_SUSPEND = 60 
AUTO_RESUME = TRUE;


CREATE SCHEMA IF NOT EXISTS snowflake_cortex_search_demo.search_data;


USE DATABASE snowflake_cortex_search_demo;
USE SCHEMA search_data;
USE WAREHOUSE snowflake_cortex_search_wh;

Creating Snowflake Warehouse, Database and Schema

Step 3—Create Snowflake Table & Populate with Sample Data

Create a table with sample content to search. For this example, we'll use company documents:

CREATE TABLE company_docs (
    doc_id INT AUTOINCREMENT,
    title VARCHAR(500),
    content TEXT,
    category VARCHAR(100),
    created_date TIMESTAMP_LTZ DEFAULT CURRENT_TIMESTAMP(),
    updated_date TIMESTAMP_LTZ DEFAULT CURRENT_TIMESTAMP()
);

Creating and populating Snowflake table with data

Let's now insert some data into company_documents table:

INSERT INTO company_docs (title, content, category) VALUES
('Employee Handbook 2025', 'This handbook covers company policies which includes remote work guidelines, performance review processes, and benefits enrollment procedures. All employees must familiarize themselves with these policies.', 'HR'),
('Q4 Financial Report', 'Revenue increased 15% year-over-year, driven by strong product sales in North America. Operating cost remained controlled at 23% of revenue. Cash flow from operations exceeded $100M.', 'Finance'),
('Product Roadmap Update', 'Our platform will launch three new features this quarter: Autonomous AI Agent for Snowflake, Autonomous AI Agent for Databricks, and Autonomous AI Agent for AWS EMR.', 'Product'),
('Security Incident Response Plan', 'In case of data breach or security incident, follow these procedures: isolate affected systems, notify security team within 15 minutes, document all actions taken, and communicate with stakeholders.', 'Security'),
('Marketing Campaign Results', 'The social media campaign generated 50% more leads than the previous quarter. Email open rates improved to 100%. Conversion rate from lead to customer increased to 60%.', 'Marketing');

Inserting data into Snowflake table - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Inserting data into Snowflake table - Snowflake Cortex Search

(Adapt the table columns and data to your use case.)

Step 4—Enable Change Tracking on Source Table for Incremental Updates (If Needed)

If you want Snowflake Cortex Search to refresh as data changes, enable Snowflake change tracking on the base table. Change tracking is required for incremental index updates.

ALTER TABLE company_docs SET CHANGE_TRACKING = TRUE;

Enabling Snowflake Change Tracking - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Enabling Snowflake Change Tracking - Snowflake Cortex Search

Snowflake will now efficiently detect row-level changes. If your role creating the service doesn’t own the table, you need change tracking to allow refresh.

Step 5—Create Cortex Search Service with SQL CREATE Statement

Now create the search service itself with a SQL command. You define: the search column, any filterable attributes, the Warehouse, target lag, embedding model, and the source query.

-- Create the Snowflake Cortex Search service
CREATE CORTEX SEARCH SERVICE company_docs_search
ON content  -- Column to search within
ATTRIBUTES title, category, created_date  -- Additional columns to return in results
WAREHOUSE = snowflake_cortex_search_wh
TARGET_LAG = '1 hour'  -- How fresh results need to be
AS (
    SELECT doc_id, title, content, category, created_date, updated_date
    FROM company_docs
);

Creating Snowflake Cortex Search Service - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Creating Snowflake Cortex Search Service

ON content tells it which column to index for full-text search.
ATTRIBUTES title, category, created_date means those columns will be indexed for filtering (faceting).
WAREHOUSE = snowflake_cortex_search_wh and TARGET_LAG='1 hour' specify indexing, compute, and freshness.
AS (SELECT ...) query defines the source data. You can even transform or concatenate fields here if needed.

Snowflake will then build the index (this may take a bit of time for larger data).

Note: By default, the snowflake-arctic-embed-m-v1.5 model is used for embeddings. If you want another model, add EMBEDDING_MODEL = 'model_name' in the CREATE statement. Note the embedding model cannot be changed after creation – you’d have to recreate the service.

Step 6—Verify Service Creation & Query Cortex Search Service

Check that your service was created successfully.

To show all Snowflake Cortex Search services:

SHOW CORTEX SEARCH SERVICES;

Displaying all Snowflake Cortex Search Services - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Displaying all Snowflake Cortex Search Services

Finally, you can test the service. One way is the SEARCH_PREVIEW function, which takes a JSON query.

SELECT PARSE_JSON(
  SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
    'company_docs_search',
    '{
       "query": "security incident response",
       "columns": ["title", "content", "category"],
       "limit": 1
     }'
  )
)['results'] AS results;

Querying Snowflake Cortex Search Service - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Querying Snowflake Cortex Search Service

This should return the most relevant to “security incident response” in JSON format. Also, you can use the table function CORTEX_SEARCH_DATA_SCAN to inspect indexed rows:

SELECT
  *
FROM
  TABLE (
    CORTEX_SEARCH_DATA_SCAN (
      SERVICE_NAME => 'company_docs_search'
    )
  );

As you can see, it will show the indexed data and even a generated embedding column. If you see your data and reasonable results, your Snowflake Cortex Search service is working perfectly.

🔮 How to Create Snowflake Cortex Search Service Using Snowsight UI: Complete Visual Guide

If you prefer a graphical interface, Snowsight provides a streamlined wizard for creating Cortex Search services.

Prerequisites for Snowsight Setup:

You need:

Snowflake account with Snowsight enabled (it's enabled by default in most accounts)
Role with SNOWFLAKE.CORTEX_USER privileges plus USAGE on your target database and schema. (By default, CORTEX_USER is granted to PUBLIC)
Active Warehouse
Database and schema containing your source table or view
Table with the data you want to index

Step 1—Log In to Snowsight

Navigate to your Snowflake account URL and log in. Snowflake Snowsight is usually the default interface, but you can always access it at 👇

https://app.snowflake.com/

Step 2—Switch the Role

Click your profile icon in the bottom left, then switch to a role that is granted the SNOWFLAKE.CORTEX_USER database role.

In the left navigation menu, look for "AI and ML". And then click on “Studio”.

Navigating to Snowflake AI and ML Studio - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Navigating to Snowflake AI and ML Studio - Snowflake Cortex Search

This section houses all Snowflake Cortex services, including Snowflake Cortex Search.

Navigating to Snowflake Cortex Search - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Navigating to Snowflake Cortex Search

Step 4—Access Cortex Search Section & Launch Setup Wizard

Find the "Cortex Search" section and click the "▶ Try" button. This launches the setup wizard.

Step 5—Select Role & Warehouse for Indexing Compute

Choose the role and the Warehouse for indexing.

Step 6—Choose Target Database & Schema for Service Creation

Select where you want to create the search service. This should be the same database and schema that contains your source table, though it doesn't have to be.

Step 7—Enter Descriptive Service Name

Give your search service a descriptive name. Use something that indicates what you're searching for. Avoid spaces and special characters.

Entering Snowflake Cortex Search Service name - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Entering Snowflake Cortex Search Service name

Step 8—Select Source Table or View Containing Searchable Content

Select the table containing your searchable content. The interface shows all tables in your chosen schema. You can also select a view if you need to join data from multiple tables or filter rows.

Picking Snowflake Source Table - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Picking Snowflake Source Table - Snowflake Cortex Search

Step 9—Select Search Columns

This is the text column that Cortex Search will actually search within. Pick the column with the main content. You can only choose one column here.

Selecting Search Colum - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Selecting Search Colum - Snowflake Cortex Search

Step 10—Select Attribute Column(s) for Filtering & Faceting

Select a set of columns that you want to use as filters when querying the service.

Selecting attribute column - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Selecting attribute column - Snowflake Cortex Search

Step 11—Set Result Columns to Include in Search Index & Query Results

Select the columns that you want to include in the search index. The data in these columns will be included in the search service and will be available for querying.

Selecting column to include in the Snowflake cortex search service - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Selecting column to include in the Snowflake cortex search service

Step 12—Configure your Search Service

Now, finally, select your desired configuration parameters for the service. The first one is Target Lag; it controls how quickly the search index updates when your source data changes. Second, select Embedding Models. Each model may incur a different cost per million input tokens processed. Refer to the Snowflake Service Consumption Table. And finally, select the Warehouse for indexing.

Configuring Snowflake Cortex Search Service - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Configuring Snowflake Cortex Search Service

Click "Create Search Service" and wait. Depending on your data volume, initial indexing takes a few minutes to several hours. You can monitor progress in the Snowsight interface.

Snowflake Snowsight will confirm creation. Behind the scenes it will run similar SQL to build your search index.

You can grant other roles permission to use the service (for example via GRANT USAGE ON CORTEX SEARCH SERVICE support_search TO ROLE analysts;). Once created, the service appears in your <db>.<schema> and you can query it just like the SQL example above.

🔮 Complete Example: Build a Movie Search Application Using Snowflake Cortex Search & Streamlit

Let's now build a complete search application that demonstrates Snowflake Cortex Search in action.

Prerequisites and Setup

You need everything from the SQL guide plus:

Access to create Snowflake Streamlit apps in your Snowflake account
A dataset with meaningful, searchable content

Step 1—Log In to Snowflake with Appropriate Role

(Same as before). Log in to Snowflake. Make sure your role is set to the one that created the service.

Step 2—Create Database, Warehouse, Tables

Run the SQL statements below to create a database and Virtual Warehouse for this tutorial.

CREATE DATABASE IF NOT EXISTS snowflake_cortex_search_demo_db;

CREATE OR REPLACE WAREHOUSE snowflake_cortex_search_demo_wh WITH
     WAREHOUSE_SIZE='X-SMALL'
     AUTO_SUSPEND = 60
     AUTO_RESUME = TRUE
     INITIALLY_SUSPENDED=TRUE;

Creating Snowflake Warehouse, Database and Schema - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Creating Snowflake Warehouse, Database and Schema - Snowflake Cortex Search

Step 3—Download Sample Movie Dataset

Grab a sample dataset from Huggingface (or any source you prefer) and save it as a single CSV (or JSON) file. To follow along with our example, download it directly in your browser using this link:

Movies Dataset

Step 4—Load Movie Dataset into Snowflake Table (Snowsight or SQL)

Now, load the example data into Snowflake before creating a Snowflake cortex search service. You can use Snowflake Snowsight or run SQL.

In Snowflake Snowsight:

Click + Create above the left nav bar, then choose Table › From File.

Creating and populating a dataset in Snowflake table - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Creating and populating a dataset in Snowflake table - Snowflake Cortex Search

Pick your tutorial Warehouse from the top-right drop-down.

Picking Snowflake Warehouse - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Picking Snowflake Warehouse - Snowflake Cortex Search

Drag your CSV or JSON file into the upload dialog.

Uploading Movie Dataset - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Uploading Movie Dataset - Snowflake Cortex Search

Set the database you created and choose the PUBLIC schema.

Selecting and creating Snowflake database and schema - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Selecting and creating Snowflake database and schema - Snowflake Cortex Search

Name the new table movies_data, then click Next.

Creating Snowflake Table - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Creating Snowflake Table - Snowflake Cortex Search

In Load Data into Table, uncheck anything that you do not want to add to the table.

Populating data into Snowflake table - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Populating data into Snowflake table - Snowflake Cortex Search

Click Load.

When loading finishes, click Query Data to open a new worksheet.

Querying Loaded Data - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Querying Loaded Data - Snowflake Cortex Search

Via SQL:

Run a PUT command to stage the file and a COPY INTO statement to load it into movies_data.

Step 5—Enable Change Tracking on Movies Table

Enable change tracking on your new table:

ALTER TABLE MOVIES_DATA SET CHANGE_TRACKING = TRUE;

Step 6—Create Cortex Search Service for Movie Descriptions

Now create a Snowflake search service for the movies.

CREATE CORTEX SEARCH SERVICE movie_search
ON description
ATTRIBUTES title, synopsis
WAREHOUSE = snowflake_cortex_search_demo_wh
TARGET_LAG = '1 hour'
AS (
    SELECT title, description, synopsis
    FROM movies_data
);

Check that your service was created successfully. To show all Snowflake Cortex Search services:

SHOW CORTEX SEARCH SERVICES;

Displaying all Snowflake Cortex Search Services - Snowflake Cortex Search — Displaying all Snowflake Cortex Search Services

You can test the service. You can utilize the SEARCH_PREVIEW function, which takes a JSON query.

SELECT PARSE_JSON(
  SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
    'movie_search',
    '{
       "query": "space exploration wormhole future planet",
       "columns": ["title", "description", "synopsis"],
       "limit": 1
     }'
  )
)['results'] AS results;

This should return the most relevant to “space exploration wormhole future planet” in JSON format. Also, you can use the table function CORTEX_SEARCH_DATA_SCAN to inspect indexed rows:

SELECT
  *
FROM
  TABLE (
    CORTEX_SEARCH_DATA_SCAN (
      SERVICE_NAME => 'movie_search'
    )
  );

It will list out all the indexed data and even a generated embedding column. If you see your data and reasonable results, your Snowflake Cortex Search service is working perfectly.

Step 7—Create Streamlit Application in Snowflake

In Snowsight, navigate to "Streamlit" in the left menu and click "+ Streamlit App":

Creating a Streamlit application - Snowflake Cortex – Snowflake Cortex Search – Cortex Search – Cortex Search Service – CREATE CORTEX SEARCH SERVICE – Large Language Models – LLMs – LLM Chatbot – Retrieval Augmented Generation – RAG – RAG Engine – Keyword Search – Semantic search – Snowflake AI – Snowflake AI ML – Fuzzy Search – Vector Search – Snowflake Vector Search – Snowflake Dynamic Table – Snowsight – Snowflake Snowsight – Embedding Models – Snowflake Embedding Models – Snowflake Streamlit – Streamlit – Enterprise Search – Snowflake Change Tracking – Snowflake Enable Change Tracking – Cortex LLM Functions – Snowflake Arctic Embed – Hybrid Search Engine – Snowflake AI & ML Studio – Snowflake AI and ML – Model Context Window – Semantic Reranking – Snowflake AI Search – EMBEDDING_MODEL – Cortex Search Tutorial – Snowflake Service Consumption Table – Materialized Query – Voyage Multilingual — Creating a Streamlit application - Snowflake Cortex Search

Application Title: Enter the name for your Streamlit application.

Select Database and Schema: Choose your snowflake_cortex_search_demo_db database and schema.

Select Warehouse: Choose your snowflake_cortex_search_demo_wh Warehouse.

Write the Application Code:

Now replace the default example application code with the following Streamlit app code:

import streamlit as st
from snowflake.core import Root
from snowflake.snowpark.context import get_active_session

# Constants
DB      = "SNOWFLAKE_CORTEX_SEARCH_DEMO_DB"
SCHEMA  = "PUBLIC"
SERVICE = "MOVIE_SEARCH"

def get_column_specification():
    """
    Retrieves the search_column and all columns for the Cortex search service
    """
    session = get_active_session()
    desc = session.sql(
        f"DESC CORTEX SEARCH SERVICE {DB}.{SCHEMA}.{SERVICE}"
    ).collect()[0]
    st.session_state.search_column = desc.search_column
    st.session_state.columns       = desc.columns.split(",")

def query_cortex_search(query: str, limit: int):
    """
    Run the Cortex search and return results
    """
    session = get_active_session()
    svc = (
        Root(session)
        .databases[DB]
        .schemas[SCHEMA]
        .cortex_search_services[SERVICE]
    )
    docs = svc.search(query, columns=st.session_state.columns, limit=limit)
    return docs.results

def display_results(results):
    st.header(f"🎉 Found {len(results)} Results")
    for i, row in enumerate(results, start=1):
        data = dict(row)

        # figure out which key is the title (case‐insensitive)
        title_key = next(
            (k for k in data.keys() if k.lower() == "title"),
            None
        )
        movie_title = data.get(title_key, "Untitled")

        with st.expander(f"🎬 Result #{i}: {movie_title}", expanded=False):
            # Movie title
            st.subheader(f"🎥 {movie_title}")

            # Description (if present)
            for field, emoji, label in [
                ("description", "📝", "Description"),
                ("synopsis",    "📖", "Synopsis")
            ]:
                # find the actual key for each
                key = next((k for k in data if k.lower() == field), None)
                if key and data.get(key):
                    st.markdown(f"**{emoji} {label}:** {data[key]}")

            # Any other fields
            other = {
                k: v
                for k, v in data.items()
                if k not in {title_key}
                and k.lower() not in {"description", "synopsis", st.session_state.search_column.lower()}
            }
            if other:
                st.markdown("**🔍 Other Details:**")
                for col, val in other.items():
                    st.markdown(f"- **{col}**: {val}")

def main():
    st.set_page_config(page_title="🎬 Cortex Movie Search", layout="wide")
    st.title("🍿 Movie Search App")
    st.markdown("Search through your movie dataset (powered by Snowflake Cortex Search) ✨")

    # Sidebar for inputs
    with st.sidebar:
        st.header("⚙️ Controls")
        query = st.text_input("🔍 Enter your search query", placeholder="e.g. Oppenheimer")
        limit = st.slider("📊 Max Results", min_value=1, max_value=20, value=5)
        st.markdown("---")
        st.caption("Powered by Snowflake Cortex Search")

    # Load service metadata
    get_column_specification()

    # Only run search when the user enters something
    if query:
        results = query_cortex_search(query, limit)
        display_results(results)
    else:
        st.info("🔎 Enter a search term in the sidebar to get started.")

if __name__ == "__main__":
    main()

Here’s a quick overview of the main components in the Streamlit-in-Snowflake code above:

get_column_specification() runs DESC CORTEX SEARCH SERVICE to retrieve and stash the search_column and available columns in st.session_state.
query_cortex_search(query, limit) opens a Snowflake session via get_active_session() and Root, then calls .search(query, columns, limit) on your Cortex service and returns the raw rows.
display_results(results) shows a hit count, locates the “title” field case-insensitively, and for each row renders an expander with the title, optional description/synopsis, and any extra fields.
main() sets up page config, places “Search Query” and “Max Results” controls in a sidebar, invokes metadata fetch once, then on each query runs the search and renders results (or prompts if no query).

Step 8—Run Streamlit Application & Test Semantic Search

Click "Run" to deploy your Streamlit movie search app.

Running a Streamlit application - Snowflake Cortex Search

Now, search for movies with queries like “space movies about saving humanity” and get relevant results. As you can see, the app demonstrates how Snowflake Cortex Search handles semantic matching.

Let's take another example: if you search for “dark superhero stories”, it will show you “The Dark Knight”, even without exact keyword matches.

Similarly, if you search for “dream heist movie”, it will show “Inception” based on conceptual understanding rather than literal terms.

Required Snowflake Privileges for Snowflake Cortex Search

Make sure users have the right grants to use Snowflake Cortex Search:

> Creating a Service

The role creating the service needs the SNOWFLAKE.CORTEX_USER database role. This special role (shared with Cortex LLM functions) allows the CREATE CORTEX SEARCH SERVICE command.

> Querying a Service

Any user querying the service must have USAGE privilege on the service itself and on its database and schema. For example:

GRANT USAGE ON DATABASE <db> TO ROLE analysts;
GRANT USAGE ON SCHEMA <schema> TO ROLE analysts;
GRANT USAGE ON CORTEX SEARCH SERVICE <service_name> TO ROLE analysts;

> Altering/Suspending

If you want to suspend or resume a service (via ALTER CORTEX SEARCH SERVICE ... SUSPEND), the role needs the OPERATE privilege on that service.

Note that, by default, the CORTEX_USER role is granted to PUBLIC, so new users can use it unless you revoke it for tighter security. Always follow your organization's RBAC policies when granting privileges.

Snowflake Cortex Search Cost Breakdown: Complete Pricing Guide & Optimization

Snowflake Cortex Search Service incurs cost via Warehouse compute for indexing, token embedding charges, and GB-month fees for indexed data. Here are the main cost components (for detailed pricing see the Snowflake Service Consumption Table):

Cost Component 1: Warehouse Compute for Index Building & Refreshing

Each time the service initializes or refreshes, it runs the source query on a Warehouse. Those credits depend on your Warehouse size and run time. (If no new data, no indexing is triggered, so no cost that cycle).

Cost Component 2: Token Embedding Charges (EMBED_TEXT Usage)

Every new/changed document’s text gets embedded, and Snowflake charges per token embedded. You pay for EMBED_TEXT_768 or EMBED_TEXT_1024 usage. These costs only occur for rows inserted/updated (incrementally). See Vector Embeddings for more on vector embedding costs.

Cost Component 3: Multi-Tenant Serving Layer (GB-Month Charges)

Snowflake Cortex Search service runs on Snowflake’s multi-tenant serving layer. You pay a monthly rate per GB of indexed (uncompressed) data. This runs continuously while the service is active, even if idle.

Cost Component 4: Snowflake Storage for Indexed Data & Materialized Tables

The indexed search data (and any materialized table) is stored in your Snowflake account, billed at flat $/TB rate.

Cost Component 5: Cloud Services for Change Detection & Refresh Scheduling

Snowflake’s background compute (to detect table changes and schedule refreshes) is billed via the Cloud Services credit bucket. (Snowflake only bills this if it exceeds 10% of your Warehouse costs).

Cost Optimization Strategies for Snowflake Cortex Search:

Select a modest Warehouse size (XSMALL or SMALL often suffices)
Choose embedding models based on your accuracy vs cost requirements
Set reasonable TARGET_LAG values; shorter lags mean more frequent updates and higher costs
Index only the text columns you actually need to search
Monitor usage through Snowflake's cost dashboards

Snowflake Cortex Search Limitations & Constraints: What You Need to Know

Snowflake Cortex Search is powerful, but it has some constraints to be aware of:

Limitation 1: Base Table Size Limit - Maximum 100 Million Rows

The materialized result of the service’s source query must be under 100 million rows. If your query would yield more, the CREATE CORTEX SEARCH SERVICE will fail. (To index very large datasets, contact Snowflake support to discuss limit increases.)

Limitation 2: Query Restrictions in CREATE Statement (Dynamic Table Rules)

The SELECT used in CREATE CORTEX SEARCH SERVICE AS (<query>) follows the same rules as Snowflake Dynamic Tables. That means certain expressions (non-deterministic functions, for example) aren’t allowed. See Snowflake Dynamic Table documentation for details on what’s permitted.

Limitation 3: Token Context Window - Only First 512 Tokens Embedded (Default Models)

When indexing, only the first 512 tokens of each text entry are embedded for vector search. If a document is longer, Cortex Search still includes the full text for keyword search, but anything past 512 tokens won’t affect the semantic vector match. (Snowflake recommends splitting very long texts into smaller chunks for best results).

Limitation 4: No Cloning or Replication Support

Currently, Snowflake Cortex Search Services cannot be cloned or replicated via Snowflake’s cloning/replication features.

Limitation 5: Model & Regional Availability Varies

Not all embedding models are supported in every region. Also, Cortex Search must run in a region that supports the feature. Check the Regional availability chart to be sure your region has the desired models.

Limitation 6: Default 512-Token Context Window for Standard Models

As mentioned above by default, each model supports 512 tokens except the specialized 8k model. So if you have huge documents, consider pre-chunking or use the -8k model. Even then, Snowflake suggests 512-token chunks for best retrieval accuracy

Limitation 7: Language Support Varies by Embedding Model

The default model (snowflake-arctic-embed-m-v1.5) is English-only. If you index non-English text with it, results may suffer. Use a multilingual model for mixed-language data.

Limitation 8: No Custom Embedding Model Support

You're limited to Snowflake's provided embedding models. Can't bring your own fine-tuned models or use embeddings from external services.

What Is the Difference Between Snowflake Cortex Search and Analyst?

Snowflake Cortex Search and Snowflake Cortex Analyst solve different problems, though they're both part of Snowflake's AI toolkit:

Snowflake Cortex Search	🔮	Snowflake Cortex Analyst
Full-text search over unstructured text data (documents, logs, pdfs etc.)	Primary Function	Natural language question-answering over structured data (SQL)
Text fields in tables or views	Data Type	Columns in relational tables (structured data)
Keywords or free-form text; returns relevant document rows	Query Input	Natural language questions ("What were sales last month?"); returns answers
List of matching records (text snippets, documents, plus attributes)	Output	Direct answers/insights (with possibly supporting data)
Vector embeddings + full-text index + semantic reranker	Tech Stack	LLM-based text-to-SQL engine using semantic models, with Snowflake execution
RAG chatbots, enterprise search bars, document retrieval	Usage	Self-service analytics, BI chat assistants, NL-to-SQL queries
SQL/JSON API within Snowflake (in-database)	Deployment	REST API integrated into apps; uses LLMs (Mistral/Llama or OpenAI)
Credit usage for indexing (Warehouse + Embedding) and serving (GB-mo)	Cost	Credits for LLM inference per query, plus underlying Warehouse usage

When Should You Use Snowflake Cortex Search or Snowflake Cortex Analyst?

Choose Cortex Search for:

Document retrieval and search applications
RAG implementation for chatbots
Enterprise knowledge base search
Content discovery and exploration

Choose Cortex Analyst for:

Business intelligence and reporting
Natural language data analysis
Self-service analytics platforms
Automated insight generation

TL:DR; Cortex Search is your go-to when you want to search text fields quickly and return data; Cortex Analyst is the service to ask business questions (it generates SQL under the hood) and get answers. Cortex Search does retrieval, while Cortex Analyst does analytics via LLM

Save up to 30% on your Snowflake spend in a few minutes!

Enter your work email

Conclusion

And that’s a wrap! Snowflake Cortex Search brings powerful, high-quality search capabilities directly to your data in Snowflake. It combines vector embeddings and keyword lookup and delivers robust search over even messy text without the heavy lifting of building a custom search stack. So if you’re building a RAG-powered chatbot or adding a smart search bar to your app, Snowflake Cortex Search can handle pretty much everything. In the right scenarios, Snowflake Cortex Search can save you weeks of development and tuning while giving you modern search quality in minutes.

In this article, we have covered:

What is Snowflake Cortex Search?
How Snowflake Cortex Search Works Under the Hood
Which Embedding Models Does Snowflake Cortex Search Support?
Step-by-Step Guide to Create Snowflake Cortex Search Service
Step-by-Step Guide to Create Snowflake Cortex Search Service via Snowflake Snowsight
Example: Create a Simple Search App Using Snowflake Cortex Search Service and Streamlit
Required Privileges for Cortex Search
Snowflake Cortex Search Cost Breakdown
Limitations of Snowflake Cortex Search
What Is the Difference Between Snowflake Cortex Search and Analyst?

FAQs

What exactly is Snowflake Cortex Search?

A built-in search engine in Snowflake that combines vector embeddings and keyword matching to let you run fast, semantic queries over text stored in your tables.

How do I enable change tracking on my table?

Run ALTER TABLE your_table SET CHANGE_TRACKING = TRUE;. This flags new or updated rows so Cortex can refresh its index incrementally.

What’s the basic SQL to create a Cortex Search service?

CREATE OR REPLACE CORTEX SEARCH SERVICE <service_name>
  ON text_column
  ATTRIBUTES attr1, attr2
  WAREHOUSE = wh_name
  TARGET_LAG = '1 day'
  EMBEDDING_MODEL = 'model_name'
  AS SELECT text_column, attr1, attr2 FROM your_table;

Which role lets me create a Cortex Search service?

Any role granted SNOWFLAKE.CORTEX_USER can issue CREATE CORTEX SEARCH SERVICE.

What privileges do I need to query a search service?

Grant USAGE on the service plus its database and schema to your role.

Can I change the embedding model after creation?

No. To switch models you must drop and recreate the service with a different EMBEDDING_MODEL.

Which embedding models are available?

snowflake-arctic-embed-m-v1.5 (768‑D, English)
snowflake-arctic-embed-l-v2.0 (1024‑D, multilingual)
snowflake-arctic-embed-l-v2.0-8k (1024‑D, 8 k token)
voyage-multilingual-2 (1024‑D, multilingual, open source)

How does Cortex Search incur cost?

Through Warehouse compute for indexing, token embedding charges, GB‑month fees for indexed data, standard storage, and minimal Cloud Services overhead.

Is there a row‑count limit for indexing?

Yes—service creation fails if the AS SELECT returns more than ~100 million rows.

Can I clone or replicate a Cortex Search service?

Not currently. Cortex Search services do not support Snowflake cloning or replication.

What happens if my text exceeds the model’s token window?

Text beyond the context window ( >512 or >8192 tokens) gets truncated for embedding; keyword search still sees the full text.

Does Cortex Search respect row‑level security?

No. Queries run with the service owner’s rights, bypassing RLS on underlying tables.

How do I monitor Cortex Search usage and costs?

Snowflake provides views for this purpose. You can use the ACCOUNT_USAGE.CORTEX_SEARCH_SERVING_USAGE_HISTORY view, which shows how many credits you’ve used for serving (GB-month). The INFORMATION_SCHEMA.CORTEX_SEARCH_SERVICES view (or its ACCOUNT_USAGE equivalent) lists your active services and their indexed data size. You can also check the Snowflake Service Consumption (Billing) table for Cortex Search costs.

Continue Reading

“Chaos Genius has been a game-changer for our DataOps at NetApp. Thanks to the precise recommendations, intuitive interface and predictive capabilities, we were able to lower our Snowflake costs by 28%, yielding us a 20X ROI”

Chaos Genius has given us a much better understanding of what's driving up our data-cloud bill. It's user-friendly, pays for itself quickly, and monitors costs daily while instantly alerting us to any usage anomalies.

Anju Mohan

Director, IT

Simon Esprit

Chief Technology Officer

Join today to get upto
30% Snowflake savings

Join today to get upto 30% Snowflake savings

Unlock Snowflake Savings

Join waitlist

Start Saving. Instantly.

What is Snowflake Cortex Search?

Key Features and Capabilities of Snowflake Cortex Search

Feature 1: Hybrid Search Architecture - Vector Embeddings + Keyword Matching + Neural Reranking

Feature 2: Zero Infrastructure Management - Fully Managed Search Service

Feature 3: Real-Time Index Updates with Dynamic Refresh & TARGET_LAG

Feature 4: Native Snowflake Platform Integration - SQL, Python SDK & Snowsight Support

Feature 5: Automatic Data Ingestion & Index Synchronization

Snowflake Cortex Search Use Cases

How Snowflake Cortex Search Works: Architecture, Pipeline & Processing Stages

Snowflake Cortex Search Embedding Models

🔮 How to Create Snowflake Cortex Search Service: Complete Step-by-Step Tutorial

Prerequisites and Setup

Step 1—Log In to Snowflake - Snowsight or SnowSQL Command Line

Step 2—Create Database, Warehouse, Schema

Step 3—Create Snowflake Table & Populate with Sample Data

Step 4—Enable Change Tracking on Source Table for Incremental Updates (If Needed)

Step 5—Create Cortex Search Service with SQL CREATE Statement

Step 6—Verify Service Creation & Query Cortex Search Service

🔮 How to Create Snowflake Cortex Search Service Using Snowsight UI: Complete Visual Guide

Prerequisites for Snowsight Setup:

Step 1—Log In to Snowsight

Step 2—Switch the Role

Step 3—Navigate to Snowflake AI & ML Studio in Left Navigation Menu

Step 4—Access Cortex Search Section & Launch Setup Wizard

Step 5—Select Role & Warehouse for Indexing Compute

Step 6—Choose Target Database & Schema for Service Creation

Step 7—Enter Descriptive Service Name

Step 8—Select Source Table or View Containing Searchable Content

Step 9—Select Search Columns

Step 10—Select Attribute Column(s) for Filtering & Faceting

Step 11—Set Result Columns to Include in Search Index & Query Results

Step 12—Configure your Search Service

🔮 Complete Example: Build a Movie Search Application Using Snowflake Cortex Search & Streamlit

Prerequisites and Setup

Step 1—Log In to Snowflake with Appropriate Role

Step 2—Create Database, Warehouse, Tables

Step 3—Download Sample Movie Dataset

Step 4—Load Movie Dataset into Snowflake Table (Snowsight or SQL)

Step 5—Enable Change Tracking on Movies Table

Step 6—Create Cortex Search Service for Movie Descriptions

Step 7—Create Streamlit Application in Snowflake

Step 8—Run Streamlit Application & Test Semantic Search

Required Snowflake Privileges for Snowflake Cortex Search

Snowflake Cortex Search Cost Breakdown: Complete Pricing Guide & Optimization

Cost Component 1: Warehouse Compute for Index Building & Refreshing

Cost Component 2: Token Embedding Charges (EMBED_TEXT Usage)

Cost Component 3: Multi-Tenant Serving Layer (GB-Month Charges)

Cost Component 4: Snowflake Storage for Indexed Data & Materialized Tables

Cost Component 5: Cloud Services for Change Detection & Refresh Scheduling

Cost Optimization Strategies for Snowflake Cortex Search:

Snowflake Cortex Search Limitations & Constraints: What You Need to Know

Limitation 1: Base Table Size Limit - Maximum 100 Million Rows

Limitation 2: Query Restrictions in CREATE Statement (Dynamic Table Rules)

Limitation 3: Token Context Window - Only First 512 Tokens Embedded (Default Models)

Limitation 4: No Cloning or Replication Support

Limitation 5: Model & Regional Availability Varies

Limitation 6: Default 512-Token Context Window for Standard Models

Limitation 7: Language Support Varies by Embedding Model

Limitation 8: No Custom Embedding Model Support

What Is the Difference Between Snowflake Cortex Search and Analyst?

When Should You Use Snowflake Cortex Search or Snowflake Cortex Analyst?

Conclusion

FAQs

Tags

Continue Reading

Snowflake Cortex Analyst 101: Launch Your Chat Analytics App (2025)

HOW TO: Extract Data From Documents via Snowflake Document AI (2025)

HOW TO: Use Databricks Assistant for Fast Data Analysis (2025)