28 posts tagged with "release"

Release notes and updates

Spice v1.8.1 (Oct 13, 2025)

October 14, 2025 · 5 min read

Senior Software Engineer at Spice AI

Announcing the release of Spice v1.8.1! 🚀

Spice v1.8.1 is a patch release that adds Acceleration Snapshots Indexes, and includes a number of bug fixes and performance improvements.

What's New in v1.8.1

Acceleration Snapshot Indexes

Management of Acceleration Snapshots has been improved by adopting an Iceberg-inspired metadata.json, which now encodes pointer IDs, schema serialization, and robust checksum and size, which is validate before loading the snapshot.
Acceleration Snapshot Metrics: The following metrics are now available for Acceleration Snapshots:
dataset_acceleration_snapshot_bootstrap_duration_ms: The time it took the runtime to download the snapshot - only emitted when it initially downloads the snapshot.
dataset_acceleration_snapshot_bootstrap_bytes: The number of bytes downloaded to bootstrap the acceleration from the snapshot.
dataset_acceleration_snapshot_bootstrap_checksum: The checksum of the snapshot used to bootstrap the acceleration.
dataset_acceleration_snapshot_failure_count: Number of failures encountered when writing a new snapshot at the end of the refresh cycle. A snapshot failure does not prevent the refresh from completing.
dataset_acceleration_snapshot_write_timestamp: Unix timestamp in seconds when the last snapshot was completed.
dataset_acceleration_snapshot_write_duration_ms: The time it took to write the snapshot to object storage.
dataset_acceleration_snapshot_write_bytes: The number of bytes written on the last snapshot write.
dataset_acceleration_snapshot_write_checksum: The SHA256 checksum of the last snapshot write.

To learn more, see the Acceleration Snapshots Documentation and the Metrics Documentation.

Improved Regular Expression for DuckDB acceleration

Regular expression support has been expanded when using DuckDB acceleration for functions like regexp-like and regexp_match.

For more details, refer to the SQL Reference for the list of available regular expression functions.

Additional Improvements & Bugfixes

Reliability: Resolved an issue with partitioning on empty partition sets.
Validation: Added better validation for incorrectly configured Spicepods.
Reliability: Fixed partition_by accelerations when a projection is applied on empty partition sets.
Performance: Ensured ListingTable partitions are pruned when filters are not used.
Performance: Don't download acceleration snapshots if the acceleration is already present.
Performance: Refactored some blocking I/O and synchronization in the async codebase by moving operations to tokio::task::spawn_blocking, replacing blocking locks with async-friendly variants.
Bugfix: Nullable fields are now supported for S3 Vectors index columns.

Contributors

Breaking Changes

No breaking changes.

Cookbook Updates

New Accelerated Snapshots Recipe - The recipe shows how to bootstrap DuckDB accelerations from object storage to skip cold starts.

The Spice Cookbook includes 81 recipes to help you get started with Spice quickly and easily.

Upgrading

To upgrade to v1.8.1, use one of the following methods:

CLI:

spice upgrade

Homebrew:

brew upgrade spiceai/spiceai/spice

Docker:

Pull the spiceai/spiceai:1.8.1 image:

docker pull spiceai/spiceai:1.8.1

For available tags, see DockerHub.

Helm:

helm repo update
helm upgrade spiceai spiceai/spiceai

AWS Marketplace:

🎉 Spice is now available in the AWS Marketplace!

What's Changed

Changelog

Remove println in datafusion by @phillipleblanc in #7461
fix: Ensure ListingTable partitions are pruned when filters are not used by @peasee in #7471
Create runtime-secrets crate by @phillipleblanc in #7474
Create runtime-parameters crate by @phillipleblanc in #7475
Don't download the snapshot if the acceleration is present by @phillipleblanc in #7477
Add support for S3 dataset params by @phillipleblanc in #7476
Add better snapshot validation for incorrectly configured spicepods by @phillipleblanc in #7487
Move blocking/sync I/O to spawn blocking by @lukekim in #7462
Validate spicepod file exists before running tests by @lukekim in #7492
Make snapshot reading/writing more robust with Iceberg-like metadata.json by @phillipleblanc in #7486
Create runtime-request-context crate by @Jeadie in #7459
Two minor fixes for AI udf tests by @krinart in #7503
Add model response timeout for ai udf tests by @krinart in #7504
Add sccache for build test operator by @lukekim in #7515
Fix partition_by accelerations when a projection is applied on empty partition sets by @phillipleblanc in #7526
Nullable fields for index columns by @Jeadie in #7523

Spice v1.8.0 (Oct 6, 2025)

October 7, 2025 · 20 min read

Phillip LeBlanc

Co-Founder and CTO of Spice AI

Announcing the release of Spice v1.8.0! 🧊

Spice v1.8.0 delivers major advances in data writes, scalable vector search, and now in preview—managed acceleration snapshots for fast cold starts. This release introduces write support for Iceberg tables using standard SQL INSERT INTO, partitioned S3 Vector indexes for petabyte-scale vector search, and preview of the AI SQL function for direct LLM integration in SQL. Additional improvements include improved reliability, and the v3.0.3 release of the Spice.js Node.js SDK.

What's New in v1.8.0

Iceberg Table Write Support (Preview)

Append Data to Iceberg Tables with SQL INSERT INTO: Spice now supports writing to Iceberg tables and catalogs using standard SQL INSERT INTO statements. This enables data ingestion, transformation, and pipeline use cases—no Spark or external writer required.

Append-only: Initial version targets appends; no overwrite or delete.
Schema validation: Inserted data must match the target table schema.
Secure by default: Writes are only enabled for datasets or catalogs explicitly marked with access: read_write.

Example Spicepod configuration:

catalogs:
  - from: iceberg:https://glue.ap-northeast-3.amazonaws.com/iceberg/v1/catalogs/111111/namespaces
    name: ice
    access: read_write

datasets:
  - from: iceberg:https://iceberg-catalog-host.com/v1/namespaces/my_namespace/tables/my_table
    name: iceberg_table
    access: read_write

Example SQL usage:

-- Insert from another table
INSERT INTO iceberg_table
SELECT * FROM existing_table;

-- Insert with values
INSERT INTO iceberg_table (id, name, amount)
VALUES (1, 'John', 100.0), (2, 'Jane', 200.0);

-- Insert into catalog table
INSERT INTO ice.sales.transactions
VALUES (1001, '2025-01-15', 299.99, 'completed');

Note: Only Iceberg datasets and catalogs with access: read_write support writes. Internal Spice tables and other connectors remain read-only.

Learn more in the Iceberg Data Connector documentation.

Acceleration Snapshots for Fast Cold Starts (Preview)

Bootstrap Managed Accelerations from Object Storage: Spice now supports managed acceleration snapshots in preview, enabling datasets accelerated with file-based engines (DuckDB or SQLite) to bootstrap from a snapshot stored in object storage (such as S3) if the local acceleration file does not exist on startup. This dramatically reduces cold start times and enables ephemeral storage for accelerations with persistent recovery.

Key features:

Rapid readiness: Datasets can become ready in seconds by downloading a pre-built snapshot, skipping lengthy initial acceleration.
Hive-style partitioning: Snapshots are organized by month, day, and dataset for easy retention and management.
Flexible bootstrapping: Configurable fallback and retry behavior if a snapshot is missing or corrupted.

Example Spicepod configuration:

snapshots:
  enabled: true
  location: s3://some_bucket/some_folder/ # Folder for storing snapshots
  bootstrap_on_failure_behavior: warn # Options: warn, retry, fallback
  params:
    s3_auth: iam_role # All S3 dataset params accepted here

datasets:
  - from: s3://some_bucket/some_table/
    name: some_table
    params:
      file_format: parquet
      s3_auth: iam_role
    acceleration:
      enabled: true
      snapshots: enabled # Options: enabled, disabled, bootstrap_only, create_only
      engine: duckdb
      mode: file
      params:
        duckdb_file: /nvme/some_table.db

How it works:

On startup, if the acceleration file does not exist, Spice checks the snapshot location for the latest snapshot and downloads it.
Snapshots are stored as: s3://some_bucket/some_folder/month=2025-09/day=2025-09-30/dataset=some_table/some_table_<timestamp>.db
If no snapshot is found, a new acceleration file is created as usual.
Snapshots are written after each refresh (unless configured otherwise).

Supported snapshot modes:

enabled: Download and write snapshots.
bootstrap_only: Only download on startup, do not write new snapshots.
create_only: Only write snapshots, do not download on startup.
disabled: No snapshotting.

Note: This feature is only supported for file-based accelerations (DuckDB or SQLite) with dedicated files.

Why use acceleration snapshots?

Faster cold starts: Skip waiting for full acceleration on startup.
Ephemeral storage: Use fast local disks (e.g., NVMe) for acceleration, with persistent recovery from object storage.
Disaster recovery: Recover from federated source outages by bootstrapping from the latest snapshot.

Partitioned S3 Vector Indexes

Efficient, Scalable Vector Search with Partitioning: Spice now supports partitioning Amazon S3 Vector indexes and scatter-gather queries using a partition_by expression in the dataset vector engine configuration. Partitioned indexes enable faster ingestion, lower query latency, and scale to billions of vectors.

Example Spicepod configuration:

datasets:
  - name: reviews
    vectors:
      enabled: true
      engine: s3_vectors
      params:
        s3_vectors_bucket: my-bucket
        s3_vectors_index: base-embeddings
      partition_by:
        - 'bucket(50, PULocationID)'
    columns:
      - name: body
        embeddings:
          from: bedrock_titan
      - name: title
        embeddings:
          from: bedrock_titan

See the Amazon S3 Vectors documentation for details.

AI SQL function for LLM Integration (Preview)

LLMs Directly In SQL: A new asynchronous ai SQL function enables direct calls to LLMs from SQL queries for text generation, translation, classification, and more. This feature is released in preview and supports both default and model-specific invocation.

Example Spicepod model configuration:

models:
  - name: gpt-4o
    from: openai:gpt-4o
    params:
      openai_api_key: ${secrets:openai_key}

Example SQL usage:

-- basic usage with default model
SELECT ai('hi, this prompt is directly from SQL.');

-- basic usage with specified model
SELECT ai('hi, this prompt is directly from SQL.', 'gpt-4o');

-- Using row data as input to the prompt
SELECT ai(concat_ws(' ', 'Categorize the zone', Zone, 'in a single word. Only return the word.')) AS category
FROM taxi_zones
LIMIT 10;

Learn more in the SQL Reference AI documentation.

Remote Endpoint Support for Spice CLI

Run CLI Commands Remotely: The Spice CLI now supports connecting to remote Spice instances, enabling you to run spice sql, spice search, and spice chat commands from your local machine against a remote spiced daemon or to Spice Cloud. Previously, these commands required running on the same machine as the runtime. Now, new flags allow remote execution:

--cloud: Connect to a Spice Cloud instance (requires --api-key).
--endpoint <endpoint>: Connect to a remote Spice instance via HTTP or Arrow Flight SQL (gRPC). Supports http://, https://, grpc://, or grpc+tls:// schemes.

Examples:

# Run SQL queries against a remote Spice instance
spice sql --endpoint http://remote-host:8090

# Use Spice Cloud for chat or search
spice chat --cloud --api-key <your-api-key>
spice search --cloud --api-key <your-api-key>

Supported CLI Commands:

spice sql --cloud / spice sql --endpoint <endpoint>
spice search --cloud / spice search --endpoint <endpoint>
spice chat --cloud / spice chat --endpoint <endpoint>

Additional Flags:

--headers: Pass custom HTTP headers to the remote endpoint.
--tls-root-certificate-file: Specify a root certificate for TLS verification.
--user-agent: Set a custom user agent for requests.

For more details, see the Spice CLI Command Reference.

Spice.js v3.0.3 SDK

Spice.js v3.0.3 Released: The official Spice.ai Node.js/JavaScript SDK has been updated to v3.0.3, bringing cross-platform support, new APIs, and improved reliability for both Node.js and browser environments.

Modern Query Methods: Use sql(), sqlJson(), and nsql() for flexible querying, streaming, and natural language to SQL.
Browser Support: SDK now works in browsers and web applications, automatically selecting the optimal transport (gRPC or HTTP).
Health Checks & Dataset Refresh: Easily monitor Spice runtime health and trigger dataset refreshes on demand.
Automatic HTTP Fallback: If gRPC/Flight is unavailable, the SDK falls back to HTTP automatically.
Migration Guidance: v3 requires Node.js 20+, uses camelCase parameters, and introduces a new package structure.

Example usage:

import { SpiceClient } from '@spiceai/spice'

const client = new SpiceClient(apiKey)
const table = await client.sql('SELECT * FROM my_table LIMIT 10')
console.table(table.toArray())

See Spice.js SDK documentation for full details, migration tips, and advanced usage.

Additional Improvements

Reliability: Improved logging, error handling, and network readiness checks across connectors (Iceberg, Databricks, etc.).
Vector search durability and scale: Refined logging, stricter default limits, safeguards against index-only scans and duplicate results, and always-accessible metadata for robust queryability at scale.
Cache behavior: Tightened cache logic for modification queries.
Full-Text Search: FTS metadata columns now usable in projections; max search results increased to 1000.
RRF Hybrid Search: Reciprocal Rank Fusion (RRF) UDTF enhancements for advanced hybrid search scenarios.

Contributors

Breaking Changes

This release introduces two breaking changes associated with the search observability and tooling.

Firstly, the document_similarity tool has been renamed to search. This has the equivalent change to tracing of these tool calls:

## Old: v1.7.1
>> spice trace tool_use::document_similarity
>> curl -XPOST http://localhost:8090/v1/tools/document_similarity \
  -d '{
    "datasets": ["my_tbl"],
    "text": "Welcome to another Spice release"
  }'

## New: v1.8.0
>> spice trace tool_use::search
>> curl -XPOST http://localhost:8090/v1/tools/search \
  -d '{
    "datasets": ["my_tbl"],
    "text": "Welcome to another Spice release"
  }'

Secondly, the vector_search task in runtime.task_history has been renamed to search.

Cookbook Updates

Added new AI SQL function recipe for invoking LLMs within SQL queries.
Updated Iceberg Catalog Connector recipe for Iceberg Writes.
Updated Spice.js JavaScript (Node.js) SDK for v3.0.3 with examples and v2 to v3 migration guide.

The Spice Cookbook now includes 80 recipes to help you get started with Spice quickly and easily.

Upgrading

To upgrade to v1.8.0, use one of the following methods:

CLI:

spice upgrade

Homebrew:

brew upgrade spiceai/spiceai/spice

Docker:

Pull the spiceai/spiceai:1.8.0 image:

docker pull spiceai/spiceai:1.8.0

For available tags, see DockerHub.

Helm:

helm repo update
helm upgrade spiceai spiceai/spiceai

AWS Marketplace:

🎉 Spice is now available in the AWS Marketplace!

What's Changed

Dependencies

iceberg-rust: Upgraded to v0.7.0-rc.1
mimalloc: Upgraded from 0.1.47 to 0.1.48
azure_core: Upgraded from 0.27.0 to 0.28.0
Jimver/cuda-toolkit: Upgraded from 0.2.27 to 0.2.28

Changelog

Add #[cfg(feature = "postgres")] to acceleration refresh tests by @Jeadie in #7241
fix: Update benchmark snapshots by @github-actions[bot] in #7267
fix: Update benchmark snapshots by @github-actions[bot] in #7268
fix: Update benchmark snapshots by @github-actions[bot] in #7269
Update the tpch benchmark snapshots for: federated/databricks[sql_warehouse].yaml by @github-actions[bot] in #7270
EmbeddingInput cache keys to include model name by @mach-kernel in #7275
ensure FTS metadata columns can be used in projection by @Jeadie in #7282
Use 8-core runners for Windows CUDA builds by @sgrebnov in #7284
Make search test more robust by @krinart in #7283
Post-release housekeeping by @sgrebnov in #7272
fix: Use median cached response duration for test search cache by @peasee in #7286
Bump dirs from 5.0.1 to 6.0.0 by @dependabot[bot] in #7244
Bump indexmap from 2.11.0 to 2.11.4 by @dependabot[bot] in #7248
Fix JOIN level filters not having columns in schema by @Jeadie in #7287
use SessionContext::new_empty in RRF by @kczimm in #7291
Use rust:1.89-slim-bookworm for build, more places to bump rust version by @sgrebnov in #7293
Update openapi.json by @github-actions[bot] in #7290
Enable chunking in SearchIndex by @Jeadie in #7143
Add index name and remove duplicate records string to S3 Vectors log by @lukekim in #7260
Use file-based fts index by @Jeadie in #7024
Remove 'PostApplyCandidateGeneration' by @Jeadie in #7288
RRF: Rank and recency boosting by @mach-kernel in #7294
Update ROADMAP.md by removing v1.7 milestone by @sgrebnov in #7297
RRF: Preserve base ranking when results differ -> FULL OUTER JOIN does not produce time column by @mach-kernel in #7300
chore: remove unused Dataset methods by @kczimm in #7295
fix removing embedding column by @Jeadie in #7302
fix: Add feature flag for using object store in spicepod by @peasee in #7303
Upgrade to iceberg-rust v0.7.0-rc1 by @sgrebnov in #7296
Enable DML Update SQL operations for datasets configured as access: read_write by @sgrebnov in #7304
Create and parse partitioned S3 vector index names by @kczimm in #7198
RRF: Fix decay for disjoint result sets by @mach-kernel in #7305
RRF: Project top scores, do not yield duplicate results by @mach-kernel in #7306
RRF: Case sensitive column/ident handling by @mach-kernel in #7309 in #7309
For vector_search, use a default limit of 1000 if no limit specified by @lukekim in #7311
Don’t cache modification queries (DDL, DML, COPY) by @sgrebnov in #7316
Fix Anthropic model regex and add validation tests by @ewgenius in #7319
Enhancement: Implement before/after/lag metrics for acceleration refresh by @krinart in #7310
Refactor chat model health check to lower tokens usage for reasoning models by @ewgenius in #7317
Add support for writing into Iceberg tables by @sgrebnov in #7315
Fix lint warnings by @lukekim in #7327
Use logical plan in SearchQueryProvider by @Jeadie in #7314
FTS max search results 100 -> 1000 by @Jeadie in #7331
Improve Databricks SQL Warehouse Error Handling by @sgrebnov in #7332
Use spicepod embedding model name for model_name() by @Jeadie in #7333
Handle async queries for Databricks SQL Warehouse API by @phillipleblanc in #7335
Enable DML (INSERT INTO) operations for catalogs configured as access:read_write by @sgrebnov in #7330
Bump regex from 1.11.2 to 1.11.3 by @dependabot[bot] in #7336
Update qa_analytics.csv with 1.7.0 release data by @sgrebnov in #7337
RRF: Fix ident resolution for struct fields, autohashed join key for varying types by @mach-kernel in #7339
v1.7.1 release notes by @kczimm in #7348
Bump Jimver/cuda-toolkit from 0.2.27 to 0.2.28 by @dependabot[bot] in #7343
Add support for writing into Glue (Iceberg) tables and catalogs by @sgrebnov in #7355
Bump mimalloc from 0.1.47 to 0.1.48 by @dependabot[bot] in #7342
Add ai async UDF by @lukekim in #7328
Use self-hosted and spiceai-macos runners for workflows where possible by @lukekim in #7371
Several updates for improved search testing by @Jeadie in #7358
Update supported versions in SECURITY.md by @Jeadie in #7377
1.7.1 release analytics by @mach-kernel in #7380
Add acceleration_file_path helper and refactor spice_sys to use Snafu errors by @phillipleblanc in #7376 in #7376
fix: Update benchmark snapshots by @github-actions[bot] in #7353
Robust search test by @Jeadie in #7381
[bug] Fix ai UDF bug of mismatched column length by @lukekim in #7383
Add OpenOption to spice_sys acceleration tables by @phillipleblanc in #7379
Add new snapshots Spicepod configuration by @phillipleblanc in #7384
Update naming of tool_use::document_similarity and vector_search spans by @Jeadie in #7273
fix: Update benchmark snapshots by @github-actions[bot] in #7354
Make ai UDF a models only feature by @lukekim in #7387
Add new runtime_acceleration crate; create SnapshotManager; implement SnapshotManager::download_latest_snapshot by @phillipleblanc in #7386
Refactor 'VectorScanTableProvider' to use just 'VectorIndex::list_table_provider' by @Jeadie in #7318
Fix embed logs by @Jeadie in #7382
Enable spicepod dependencies in testoperator by @Jeadie in #7334
ai UDF security and performance optimizations by @lukekim in #7392
Wire up the snapshot download on dataset startup by @phillipleblanc in #7389
Implement initial snapshot creation logic in SnapshotManager by @phillipleblanc in #7391
Make tool_use::table_schema output model-friendly by @krinart in #7393
Fix minor lint warnings by @lukekim in #7395
Enable metadata columns in document-based object store datasets by @Jeadie in #7397
Core dependencies of financebench by @Jeadie in #7400
Add S3vector variant to financebench by @Jeadie in #7399
Set PostgreSQL unsupported_spice_action=string by default by @lukekim in #7398
Use non-blocking connection check for verify_ns_lookup_and_tcp_connect by @phillipleblanc in #7401
Bump moka from 0.12.10 to 0.12.11 by @dependabot[bot] in #7340
Bump tokio-postgres from 0.7.13 to 0.7.14 by @dependabot[bot] in #7344
Bump azure_core from 0.27.0 to 0.28.0 by @dependabot[bot] in #7338
Forbid INSERT OVERWRITE DML operations by @sgrebnov in #7402
Make database connection pool sizes consistent by @lukekim in #7403
Disable vector index only scans by @Jeadie in #7405
Make CLI --endpoint and --cloud args & table output consistent by @lukekim in #7396
Write new snapshots at the end of an accelerated refresh by @phillipleblanc in #7410
Read and write partitioned S3 indexes by @kczimm in #7313
Fix partial data writes in Iceberg data connector by @sgrebnov in #7411
Remove nix by @phillipleblanc in #7414
Use DataFusion JoinSetTracer for async context propagation by @lukekim in #7416
Implement cache invalidation for DML (INSERT INTO) operations by @sgrebnov in #7394
Make cleanup disk GH action; use in integration tests by @Jeadie in #7418
Move S3Vector to 'search' crate by @Jeadie in #7373
Use LogicalPlan builder API for LogicalPlans by @Jeadie in #7408
Use hive-style partitioned paths for DB snapshots by @phillipleblanc in #7422
Limit results from SearchIndex::query_table_provider by @Jeadie in #7421
Delay initial readiness if snapshots are enabled with an append-mode refresh by @phillipleblanc in #7425
Disable snapshots by default by @phillipleblanc in #7426
Rewrite ChunkedNonIndexVectorGeneration to use LogicalPlanBuilder (instead of string formatting) by @Jeadie in #7413
Fix for search field as metadata for chunked search indexes by @Jeadie in #7429
Add feature is currently in preview warning for read_write access mode by @sgrebnov in #7440
Add feature is currently in preview warning for snapshots by @sgrebnov in #7442
Fix tracing so that ai_completions are parented under sql_query by @lukekim in #7415
Disable acceleration refresh metrics by @krinart in #7450
Enable snapshot acceleration by default by @phillipleblanc in #7451
fix: partition name validation by @kczimm in #7452

Spice v1.7.1 (Sep 29, 2025)

September 30, 2025 · 6 min read

Kevin Zimmerman

Principal Software Engineer at Spice AI

Announcing the release of Spice v1.7.1! 🔍

Spice v1.7.1 is a patch release focused on search improvements, bug fixes, and performance enhancements. This release introduces the Reciprocal Rank Fusion (RRF) user-defined table function (UDTF) for hybrid search, improves vector and text search reliability, and resolves several issues across the runtime, connectors, and query engine.

What's New in v1.7.1

Reciprocal Rank Fusion (RRF) UDTF: Spice now supports Reciprocal Rank Fusion (RRF) as a user-defined table function, enabling advanced hybrid search scenarios that combine results from multiple search methods (e.g., vector and text search) for improved relevance ranking.

Features:

Multi-search fusion: Combine results from vector_search, text_search, and other search UDTFs in a single query.
Advanced tuning: Per-query ranking weights, recency boosting, and configurable decay functions.
Performance: Optional user-specified join key for optimal performance.
Automatic joining: Falls back to on-the-fly JOIN key computation when no explicit key is provided.

Example usage:

SELECT id, title, content, fused_score
FROM rrf(
  vector_search(documents, 'machine learning algorithms', rank_weight => 1.5),
  text_search(documents, 'neural networks deep learning', rank_weight => 1.2),
  join_key => 'id',    -- optional join key for optimal performance
  k => 60.0            -- optional smoothing factor
)
WHERE fused_score > 0.01
ORDER BY fused_score DESC;

Learn more in the RRF documentation.

Acceleration Refresh Metrics: Spice now exposes additional Prometheus metrics that provide detailed observability into dataset acceleration refreshes. These metrics help monitor data freshness and ingestion lag for accelerated datasets with a time column.

Reported metrics:

Metric Name	Description
`dataset_acceleration_max_timestamp_before_refresh_ms`	Maximum value of the dataset's time column before refresh (milliseconds).
`dataset_acceleration_max_timestamp_after_refresh_ms`	Maximum value of the dataset's time column after refresh (milliseconds).
`dataset_acceleration_refresh_lag_ms`	Difference between max timestamp after and before refresh (milliseconds).
`dataset_acceleration_ingestion_lag_ms`	Lag between current wall-clock time and max timestamp after refresh (milliseconds).

These metrics are emitted during each acceleration refresh and can be scraped by Prometheus for monitoring and alerting. For more details, see the Observability documentation.

Bug Fixes & Improvements

This release resolves several issues and improves reliability across search, connectors, and query planning:

Full-Text Search (FTS): Ensure FTS metadata columns can be used in projection, fix JOIN-level filters not having columns in schema, and adds support for persistent file-based FTS indexes. Default limit of 1000 results if no limit specified.
Vector Search: Default limit of 1000 results if no limit specified, and fix removing embedding column.
Databricks SQL Warehouse: Improved error handling and support for async queries.
Other: Fixes for Anthropic model regex validation, tweaked AI-model health checks, and improved error messages.

Contributors

Breaking Changes

No breaking changes.

Cookbook Updates

Added Hybrid-Search using RRF - Combine results from multiple search methods (vector and text search) using Reciprocal Rank Fusion for improved relevance ranking.

The Spice Cookbook includes 78 recipes to help you get started with Spice quickly and easily.

Upgrading

To upgrade to v1.7.1, use one of the following methods:

CLI:

spice upgrade

Homebrew:

brew upgrade spiceai/spiceai/spice

Docker:

Pull the spiceai/spiceai:1.7.1 image:

docker pull spiceai/spiceai:1.7.1

For available tags, see DockerHub.

Helm:

helm repo update
helm upgrade spiceai spiceai/spiceai

AWS Marketplace:

🎉 Spice is now available in the AWS Marketplace!

What's Changed

Changelog

ensure FTS metadata columns can be used in projection (#7282) by @Jeadie in #7282
Fix JOIN level filters not having columns in schema (#7287) by @Jeadie in #7287
Use file-based fts index (#7024) by @Jeadie in #7024
Remove 'PostApplyCandidateGeneration' (#7288) by @Jeadie in #7288
RRF: Rank and recency boosting (#7294) by @mach-kernel in #7294
RRF: Preserve base ranking when results differ -> FULL OUTER JOIN does not produce time column (#7300) by @mach-kernel in #7300
fix removing embedding column (#7302) by @Jeadie in #7302
RRF: Fix decay for disjoint result sets (#7305) by @mach-kernel in #7305
RRF: Project top scores, do not yield duplicate results (#7306) by @mach-kernel in #7306
RRF: Case sensitive column/ident handling (#7309) by @mach-kernel in #7309
For vector_search, use a default limit of 1000 if no limit specified (#7311) by @lukekim in #7311
Fix Anthropic model regex and add validation tests (#7319) by @ewgenius in #7319
Enhancement: Implement before/after/lag metrics for acceleration refresh (#7310) by @krinart in #7310
Refactor chat model health check to lower tokens usage for reasoning models (#7317) by @ewgenius in #7317
Enable chunking in SearchIndex (#7143) by @Jeadie in #7143
Use logical plan in SearchQueryProvider. (#7314) by @Jeadie in #7314
FTS max search results 100 -> 1000 (#7331) by @Jeadie in #7331
Improve Databricks SQL Warehouse Error Handling (#7332) by @sgrebnov in #7332
use spicepod embedding model name for 'model_name' (#7333) by @Jeadie in #7333
Handle async queries for Databricks SQL Warehouse API (#7335) by @phillipleblanc in #7335
RRF: Fix ident resolution for struct fields, autohashed join key for varying types (#7339) by @mach-kernel in #7339

Spice v1.7.0 (Sep 23, 2025)

September 23, 2025 · 21 min read

Sergei Grebnov

Senior Software Engineer at Spice AI

Announcing the release of Spice v1.7.0! ⚡

Spice v1.7.0 upgrades to DataFusion v49 for improved performance and query optimization, introduces real-time full-text search indexing for CDC streams, EmbeddingGemma support for high-quality embeddings, new search table functions powering the /v1/search API, embedding request caching for faster and cost-efficient search and indexing, and OpenAI Responses API tool calls with streaming. This release also includes numerous bug fixes across CDC streams, vector search, the Kafka Data Connector, and error reporting.

What's New in v1.7.0

DataFusion v49 Highlights

DataFusion Clickbench Performance Graph Source: DataFusion 49.0.0 Release Blog.

Performance Improvements 🚀

Equivalence System Upgrade: Faster planning for queries with many columns, enabling more sophisticated sort-based optimizations.
Dynamic Filters & TopK Pushdown: Queries with ORDER BY and LIMIT now use dynamic filters and physical filter pushdown, skipping unnecessary data reads for much faster top-k queries.
Compressed Spill Files: Intermediate files written during sort/group spill to disk are now compressed, reducing disk usage and improving performance.
WITHIN GROUP for Ordered-Set Aggregates: Support for ordered-set aggregate functions (e.g., percentile_disc) with WITHIN GROUP.
REGEXP_INSTR Function: Find regex match positions in strings.

Spice Runtime Highlights

EmbeddingGemma Support: Spice now supports EmbeddingGemma, Google's state-of-the-art embedding model for text and documents. EmbeddingGemma provides high-quality, efficient embeddings for semantic search, retrieval, and recommendation tasks. You can use EmbeddingGemma via HuggingFace in your Spicepod configuration:

Example spicepod.yml snippet:

embeddings:
  - from: huggingface:huggingface.co/google/embeddinggemma-300m
    name: embeddinggemma
    params:
      hf_token: ${secrets:HUGGINGFACE_TOKEN}

Learn more about EmbeddingGemma in the official documentation.

POST /v1/search API Use Search Table Functions: The /v1/search API now uses the new text_search and vector_search Table Functions for improved performance.

Embedding Request Caching: The runtime now supports caching embedding requests, reducing latency and cost for repeated content and search requests.

Example spicepod.yml snippet:

runtime:
  caching:
    embeddings:
      enabled: true
      max_size: 128mb
      item_ttl: 5s

See the Caching documentation for details.

Real-Time Indexing for Full Text Search: Full Text search indexing is now supported for connectors that enable real-time changes, such as Debezium CDC streams. Adding a full-text index on a column with refresh_mode: changes works as it does for full/append-mode refreshes, enabling instant search on new data.

Example spicepod.yml snippet:

datasets:
  - from: debezium:cdc.public.question
    name: questions
    acceleration:
      enabled: true
      engine: duckdb
      primary_key: id
      refresh_mode: changes # Use 'changes'
    params: *kafka_params
    columns:
      - name: title
        full_text_search:
          enabled: true # Enable full-text-search indexing
          row_id:
            - id

OpenAI Responses API Tool Calls with Streaming: The OpenAI Responses API now supports tool calls with streaming, enabling advanced model interactions such as web_search and code_interpreter with real-time response streaming. This allows you to invoke OpenAI-hosted tools and receive results as they are generated.

Learn more in the OpenAI Model Provider documentation.

Runtime Output Level Configuration: You can now set the output_level parameter in the Spicepod runtime configuration to control logging verbosity in addition to the existing CLI and environment variable support. Supported values are info, verbose, and very_verbose. The value is applied in the following priority: CLI, environment variables, then YAML configuration.

Example spicepod.yml snippet:

runtime:
  output_level: info # or verbose, very_verbose

For more details on configuring output level, see the Troubleshooting documentation.

Bug Fixes

Several bugs and issues have been resolved in this release, including:

CDC Streams: Fixed issues where refresh_mode: changes could prevent the Spice runtime from becoming Ready, and improved support for full-text indexing on CDC streams.
Vector Search: Fixed bugs where vector search HTTP pipeline could not find more than one IndexedTableProvider, and resolved errors with field mismatches in vector_search UDTF.
Kafka Integration: Improved Kafka schema inference with configurable sample size, improved consumer group persistence for SQLite and Postgres accelerations, and added cooperative mode support.
Perplexity Web Search: Fixed bug where Perplexity web search sometimes used incorrect query schema (limit).
Databricks: Fixed issue with unparsing embedded columns.
Error Reporting: ThrottlingException is now reported correctly instead of as InternalError.
Iceberg Data Connector: Added support for LIMIT pushdown.
Amazon S3 Vectors: Fixed ingestion issues with zero-vectors and improved handling when vector index is full.
Tracing: Fixed vector search tracing to correctly report SQL status.

Contributors

New Contributors

@ChrisTomAlxHitachi made their first contribution in github.com/spiceai/spiceai/pull/6932 🎉

Breaking Changes

No breaking changes.

Cookbook Updates

New Spice with Dotnet SDK Recipe - The recipe shows how to query Spice using the Dotnet SDK.

The Spice Cookbook includes 78 recipes to help you get started with Spice quickly and easily.

Upgrading

To upgrade to v1.7.0, use one of the following methods:

CLI:

spice upgrade

Homebrew:

brew upgrade spiceai/spiceai/spice

Docker:

Pull the spiceai/spiceai:1.7.0 image:

docker pull spiceai/spiceai:1.7.0

For available tags, see DockerHub.

Helm:

helm repo update
helm upgrade spiceai spiceai/spiceai

AWS Marketplace:

🎉 Spice is now available in the AWS Marketplace!

What's Changed

Dependencies

Rust: Upgraded from 1.88.0 to 1.89.0
DataFusion: Upgraded from 48.0.1 to 49.0.0
text-embeddings-inference: Upgraded from 1.7.3 to 1.8.2
twox-hash: Upgraded from 1.6.3 to 2.1.0.

Changelog

Fix parameterised query planning in DataFusion by @Jeadie in #6942
fix: Update benchmark snapshots by @app/github-actions in #6944
refactor: Decouple full text search candidate from UDTF by @peasee in #6940
fix: Re-enable search integration tests by @peasee in #6930
Update acknowledgements and spicepod.schema.json by @sgrebnov in #6948
Add enabling the responses API by @lukekim in #6949
Post-release housekeeping by @sgrebnov in #6951
Add missing param in release notes by @Advayp in #6959
Create comprehensive S3vectors test by @Jeadie in #6903
Update ROADMAP after v1.6 release by @sgrebnov in #6955
Update openapi.json by @app/github-actions in #6961
Add build step for new spiced images in end game template by @Jeadie in #6960
refactor: Use text search UDTF in v1/search by @peasee in #6962
Bump Jimver/cuda-toolkit from 0.2.26 to 0.2.27 by @app/dependabot in #6922
Bump notify from 8.0.0 to 8.2.0 by @app/dependabot in #6924
Use model2vec for search integration tests for speed by @Jeadie in #6971
feat: Add initial DuckDB regexp pushdown support by @peasee in #6966
Bump rustyline from 16.0.0 to 17.0.1 by @app/dependabot in #6976
Upgrade delta_kernel to 0.14 by @phillipleblanc in #6977
Consistent snapshots for mongodb by @krinart in #6974
Bump indexmap from 2.10.0 to 2.11.0 by @app/dependabot in #6921
Fix mongo tests: ignore container_registry() when building image name by @krinart in #6983
Implement support for s3 tables for glue DataConnector by @krinart in #6981
Bump serde_json from 1.0.142 to 1.0.143 by @app/dependabot in #6925
Update build_and_release macOS pipeline to skip updating cmake if installed by @phillipleblanc in #6998
Mark Kafka Data Connector Alpha quality by @sgrebnov in #6991
Add v1.6.1 release notes by @lukekim in #7000
Spice CLI trace: make error friendlier when task_history is disabled by @sgrebnov in #6996
Warn when runtime or management is added in spicepod dependency by @Jeadie in #6953
Enable .datasets[].vectors.params.s3_vectors_distance_metric for S3 Vectors by @Jeadie in #6982
Add s3_vectors index support for CDC and Append streams by @sgrebnov in #6986
Find all vector indexes in v1/search by @Jeadie in #7004
Fix RRF; reorder by score by @Jeadie in #7007
Fix for nested VectorScanTableProvider by @krinart in #7017
Add --sql flag to output SQL query for spice trace by @Jeadie in #7002
Make web search params engine-specific by @Advayp in #7022
Add more MTEB benchmark spicepods by @peasee in #7026
Improve error messaging in tools by @Jeadie in #6895
Add retry for exporting task history records by @sgrebnov in #7049
Increase DoPut write timeout for the next batch from 30 to 120 seconds by @sgrebnov in #7054
Avoid redundant search embedding by @peasee in #7053
Truncate text_embed task_history trace by @sgrebnov in #7050
Use the UTC offset for the start_time and end_time fields in the task history by @ewgenius in #7056
Update supported versions in SECURITY.md by @Jeadie in #7060
Add integration test for Kafka S3 Vectors by @sgrebnov in #6988
Enable parameters to enforce the value is one of several options by @Jeadie in #6984
feat(iceberg): lakekeeper catalog - add warehouse param to spicepod by @ChrisTomAlxHitachi in #6932
feat: Add HTTP query concurrency support to testoperator by @peasee in #7025
Ensure no data does not throw error in v1/search by @Jeadie in #7033
Bump github.com/spf13/cobra from 1.9.1 to 1.10.1 by @app/dependabot in #7013
Add QA analytics for 1.6.x releases by @sgrebnov in #7082
Use env variable for HF cache in model2vec by @Jeadie in #7076
chore: upgrade to Rust 1.88 by @kczimm in #7077
Kafka/Debezium: make common errors user-friendlier by @sgrebnov in #7084
Create Apache Datafusion upgrade issue template by @kczimm in #6800
No join predicate pushdown on empty results by @Jeadie in #7075
Bump tract-onnx from 0.21.10 to 0.22.0 by @app/dependabot in #7071
Bump mongodb from 3.2.4 to 3.3.0 by @app/dependabot in #7073
Bump indicatif from 0.17.11 to 0.18.0 by @app/dependabot in #7070
Bump actions/github-script from 7 to 8 by @app/dependabot in #7069
Bump actions/setup-go from 5 to 6 by @app/dependabot in #7068
Bump actions/download-artifact from 4 to 5 by @app/dependabot in #7066
Bedrock: Tool use without inputs must empty Document by @Jeadie in #7036
Bump github.com/stretchr/testify from 1.10.0 to 1.11.1 by @app/dependabot in #7015
Bump actions/setup-python from 5 to 6 by @app/dependabot in #7067
Upgrade dependabot dependencies by @phillipleblanc in #7061
Bump tempfile from 3.20.0 to 3.21.0 by @app/dependabot in #7018
Only call 'list_datasets' once, after initial system/user messages by @Jeadie in #7039
Bump github.com/spf13/pflag from 1.0.7 to 1.0.10 by @app/dependabot in #7062
Bump actions/checkout from 4 to 5 by @app/dependabot in #7065
Bump golang.org/x/mod from 0.27.0 to 0.28.0 by @app/dependabot in #7064
Bump github.com/AzureAD/microsoft-authentication-library-for-go from 1.4.1 to 1.5.0 by @app/dependabot in #7063
Add friendly message for Kafka operation timeout error, improve code by @sgrebnov in #7088
embed UDF by @mach-kernel in #6967
fix: Update benchmark snapshots by @app/github-actions in #7097
Fix SF100 benchmark tests dispatch by @sgrebnov in #7098
chore(logging): add log when iceberg rest catalog fails with ssl cert error by @ChrisTomAlxHitachi in #6909
Add xxhash support for search/sql results by @krinart in #6978
Use proper federation in max_timestamp_df during acceleration refresh by @krinart in #7055
Fix spiced_docker workflows for new actions/download-artifact@v5 behavior by @phillipleblanc in #7108
Fix spiced_docker workflow by @phillipleblanc in #7111
Add filter for zero vectors before writing to S3 Vectors by @phillipleblanc in #7110
Ensure we find vector index when it also has text search by @Jeadie in #7120
Enable unified traceparent override support for HTTP API by @sgrebnov in #7122
Fix ORDER BY: (BytesProcessedExec to avoid pruning ordered execs during physical optimization) by @mach-kernel in #7105
Fix spiced_docker_nightly workflow by @sgrebnov in #7125
Add output_level to runtime config by @krinart in #7119
Add tests for xxhash hashers by @krinart in #7124
Add input option to update snapshots in Integration tests by @Jeadie in #7127
Fix formatting to improve merges by @lukekim in #7128
Add tests to nulling logic by @Jeadie in #7113
Bump chrono from 0.4.41 to 0.4.42 by @app/dependabot in #7131
Bump ctrlc from 3.4.7 to 3.5.0 by @app/dependabot in #7132
Search: RRF UDTF by @mach-kernel in #7090
Update openapi.json by @app/github-actions in #7141
Bump packages to DF49; resolve incompatibilities by @Jeadie in #7101
fix: Don't error for chunked columns when vectors are disabled by @peasee in #7150
Allow bzip2-1.0.6 license in deny.toml by @Jeadie in #7148
Tune retry settings for Kafka/Debezium connectors by @sgrebnov in #7142
Update TEI by @Jeadie in #7152
Use twox-hash version 2.1.2 by @krinart in #7165
Revert "Use proper federation in max_timestamp_df during acceleration refresh (#7055)" by @phillipleblanc in #7156
Bump octocrab from 0.44.1 to 0.45.0 by @app/dependabot in #7158
Bump github.com/spf13/viper from 1.19.0 to 1.21.0 by @app/dependabot in #7130
Bump keyring from 3.6.2 to 3.6.3 by @app/dependabot in #7157
fix: Remove keywords from AI document search by @peasee in #7052
Bump tract-core from 0.21.10 to 0.22.0 by @app/dependabot in #7134
Update TEI by @Jeadie in #7171
Update openapi.json by @app/github-actions in #7172
fix: Ensure vector search UDTF respects the supplied projection by @peasee in #7155
Bump clap from 4.5.45 to 4.5.47 by @app/dependabot in #7135
Bump golang.org/x/sys from 0.35.0 to 0.36.0 by @app/dependabot in #7129
Include 'catalog_id' in Glue catalog parameters by @Jeadie in #7151
fix: Use head ref from merge group event in pulls-with-spice concurrency group by @peasee in #7175
Fix lint for xxhash feature by @phillipleblanc in #7176
Add Kafka-specific metrics for consumer lag and consumed records by @sgrebnov in #7146
Kafka: persist consumer between restarts with SQLite and PG acceleration by @sgrebnov in #7177
Kafka: support specifying a target consumer group ID by @sgrebnov in #7178
Fix timestamp parsing for spice trace by @krinart in #7173
Support full-text indexing on CDC/append streams by @phillipleblanc in #7180
Bump iceberg-rust version to include limit push down by @krinart in #7191
Make full text stream connector more robust by @phillipleblanc in #7193
fix: Update benchmark snapshots by @app/github-actions in #7179
Initial changes for SearchIndex by @Jeadie in #7103
Robustly handle indexing FTS for CDC streams by @phillipleblanc in #7197
Proper handling/mapping for ThrottlingException during embedding calls by @krinart in #7170
Add spicepod.yml by @lukekim in #7202
Delta Lake: Support read pruning on timestamp columns using maxValues stats by @sgrebnov in #7203
feat: Add initial embeddings cache by @peasee in #7194
Make S3vector a FixedSizeListArray by @Jeadie in #7201
Fix projection mismatch issues with RRF calling vector search / text search by @mach-kernel in #7200
feat: Add embeddings cache to all embeddings by @peasee in #7204
Revert "Make S3vector a FixedSizeListArray (#7201)" by @kczimm in #7210
Update duckdb version to make ICU statically linked by default by @krinart in #7215
Change DataType list nullability from true to false by @Jeadie in #7216
Use Instant + saturating_sub to handle time drift by @krinart in #7212
Flatten 'IndexedTableProvider' when adding full-text support by @Jeadie in #7219
Include comments in pulls by @lukekim in #7224
Add github_max_concurrent_connections = 5 by @lukekim in #7225
RRF: Fix scoring by @mach-kernel in #7226
Update RRF search integration snapshots after scoring change by @mach-kernel in #7227
Make S3vector a FixedSizeListArray by @Jeadie in #7230
Proper federation during acceleration refresh + datafusion version bump + integration tests by @krinart in #7228
Use DuckDBDialect for DuckDB non-federated queries by @krinart in #7232
Move chunking out of llms and into new crate chunking by @Jeadie in #7229
Remove duplicate pg_port configuration in test by @lukekim in #7233
Upgrade to Rust 1.89 by @phillipleblanc in #7235
Catalog connection error: fix connector name from 'iceberg' to 'spice.ai' by @sgrebnov in #7240
Create PutVectorsSink by @kczimm in #7199
Benchmark tests: fix API key reference in spicecloud catalog by @sgrebnov in #7239
Add Dotnet SDK sample to end game template by @sgrebnov in #7238
Update spicepod.schema.json by @app/github-actions in #7254
Postgres: Improve Decimals read performance and add Name type support by @sgrebnov in #7255
Add tests for hybrid search on a vector engine by @Jeadie in #7220

Spice v1.6.1 (Sep 1, 2025)

September 2, 2025 · 3 min read

Jack Eadie

Token Plumber at Spice AI

Announcing the release of Spice v1.6.1! ⚡

Spice 1.6.1 is a patch release that provides improved Kafka type inference and JSON flattening support, alongside several bug fixes.

What's New in v1.6.1

Improved Kafka Type Inference: Improve Kafka type inference by configuring the number of Kafka messages sampled during schema inference. Increasing the sample size can improve the robustness and reliability of inferred schemas, especially in cases where data contains optional fields or varying structures.

Example spicepod.yml:

dataset:
  - from: kafka:orders_events
    name: orders
    params:
      schema_infer_max_records: 100 # Default 1.

For details, see the Kafka Data Connector Documentation.

Improved Kafka JSON Support: Enable nested JSON Kafka messages to be represented in flattened JSON format for the dataset schema.

Example spicepod.yml:

dataset:
  - from: kafka:orders_events
    name: orders
    params:
      flatten_json: true # default false

For example, the object:

{
  "order_id": "a1f2c3d4-1111-2222-3333-444455556666",
  "customer": {
    "id": 101,
    "name": "Alice",
    "premium": true,
    "contact": {
      "email": "[email protected]",
      "phone": "555-1234"
    }
  },
  "discount": 5.0,
  "shipped": false
}

With flatten_json: true the result is:

+------------------------+-----------+-------------+
| column_name            | data_type | is_nullable |
+------------------------+-----------+-------------+
| order_id               | Utf8      | YES         |
| customer.id            | Int64     | YES         |
| customer.name          | Utf8      | YES         |
| customer.premium       | Boolean   | YES         |
| customer.contact.email | Utf8      | YES         |
| customer.contact.phone | Utf8      | YES         |
| discount               | Float64   | YES         |
| shipped                | Boolean   | YES         |
+------------------------+-----------+-------------+

With flatten_json: false or ommitted the result is:

+-------------+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------+
| column_name | data_type                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | is_nullable |
+-------------+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------+
| order_id    | Utf8                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | YES         |
| customer    | Struct([Field { name: "id", data_type: Int64, nullable: true, dict_id: 0, dict_is_ordered: false, metadata: {} }, Field { name: "name", data_type: Utf8, nullable: true, dict_id: 0, dict_is_ordered: false, metadata: {} }, Field { name: "premium", data_type: Boolean, nullable: true, dict_id: 0, dict_is_ordered: false, metadata: {} }, Field { name: "contact", data_type: Struct([Field { name: "email", data_type: Utf8, nullable: true, dict_id: 0, dict_is_ordered: false, metadata: {} }, Field { name: "phone", data_type: Utf8, nullable: true, dict_id: 0, dict_is_ordered: false, metadata: {} }]), nullable: true, dict_id: 0, dict_is_ordered: false, metadata: {} }]) | YES         |
| discount    | Float64                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  | YES         |
| shipped     | Boolean                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  | YES         |
+-------------+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------+

For details, see the Kafka Data Connector Documentation.

Contributors

Breaking Changes

No breaking changes.

Cookbook Updates

No new cookbook recipes added in this release.

The Spice Cookbook includes 77 recipes to help you get started with Spice quickly and easily.

Upgrading

To upgrade to v1.6.1, use one of the following methods:

CLI:

spice upgrade

Homebrew:

brew upgrade spiceai/spiceai/spice

Docker:

Pull the spiceai/spiceai:1.6.1 image:

docker pull spiceai/spiceai:1.6.1

For available tags, see DockerHub.

Helm:

helm repo update
helm upgrade spiceai spiceai/spiceai

AWS Marketplace:

🎉 Spice is now available in the AWS Marketplace!

What's Changed

Changelog

Fix metadata field issue by @Advayp in #6957
Update datafusion and datafusion-table-providers crates (#6985) by @Jeadie in #6985
Add flatten_json param support for Kafka connector (#6976) by @sgrebnov in #6976
Add schema_inference_sample_count param support for Kafka connector (#6969) by @sgrebnov in #6969
Add integration test for Kafka connector (#6965) by @sgrebnov in #6965
Skip dataset health check for IcebergTableProvider datasets by @phillipleblanc in #6995

What's New in v1.8.1​

Acceleration Snapshot Indexes​

Improved Regular Expression for DuckDB acceleration​

Additional Improvements & Bugfixes​

Contributors​

Breaking Changes​

Cookbook Updates​

Upgrading​

What's Changed​

Changelog​

What's New in v1.8.0​

Iceberg Table Write Support (Preview)​

Acceleration Snapshots for Fast Cold Starts (Preview)​

Partitioned S3 Vector Indexes​

AI SQL function for LLM Integration (Preview)​

Remote Endpoint Support for Spice CLI​

Spice.js v3.0.3 SDK​

Additional Improvements​

Contributors​

Breaking Changes​

Cookbook Updates​

Upgrading​

What's Changed​

Dependencies​

Changelog​

What's New in v1.7.1​

Bug Fixes & Improvements​

Contributors​

Breaking Changes​

Cookbook Updates​

Upgrading​

What's Changed​

Changelog​

What's New in v1.7.0​

DataFusion v49 Highlights​

Spice Runtime Highlights​

Bug Fixes​

Contributors​

New Contributors​

Breaking Changes​

Cookbook Updates​

Upgrading​

What's Changed​

Dependencies​

Changelog​

What's New in v1.6.1​

Contributors​

Breaking Changes​

Cookbook Updates​

Upgrading​

What's Changed​

Changelog​

What's New in v1.8.1

Acceleration Snapshot Indexes

Improved Regular Expression for DuckDB acceleration

Additional Improvements & Bugfixes

Contributors

Breaking Changes

Cookbook Updates

Upgrading

What's Changed

Changelog

What's New in v1.8.0

Iceberg Table Write Support (Preview)

Acceleration Snapshots for Fast Cold Starts (Preview)

Partitioned S3 Vector Indexes

AI SQL function for LLM Integration (Preview)

Remote Endpoint Support for Spice CLI

Spice.js v3.0.3 SDK

Additional Improvements

Contributors

Breaking Changes

Cookbook Updates

Upgrading

What's Changed

Dependencies

Changelog

What's New in v1.7.1

Bug Fixes & Improvements

Contributors

Breaking Changes

Cookbook Updates

Upgrading

What's Changed

Changelog

What's New in v1.7.0

DataFusion v49 Highlights

Spice Runtime Highlights

Bug Fixes

Contributors

New Contributors

Breaking Changes

Cookbook Updates

Upgrading

What's Changed

Dependencies

Changelog

What's New in v1.6.1

Contributors

Breaking Changes

Cookbook Updates

Upgrading

What's Changed

Changelog