VAST DataBase

Although I work for VAST Data, these notes are my own personal notes and are not authoritative. They may be wrong.

VAST DataBase is a capability built into the VAST Element Store to represent structured (tabular) data in a way that leverages the strengths of VAST’s pools of SCM and flash. It combines row‑based transactions and column‑based analytics by writing new rows into SCM and, once enough data accumulates, transposing rows into small (~32 KB) columnar chunks as they are written down to flash.

Queries can read from the write buffers and from the columnar chunks, so the system delivers ACID transaction semantics while still providing column‑store performance.

The behavior of DataBase is pretty well documented in the VAST DataBase Administrator’s Guide.

Interfaces

Querying the VAST DataBase is mostly commonly done using either Trino or Spark SQL. VAST ships drivers for both that implement predicate pushdown.

The low-level interface into the VAST DataBase is provided by the VAST Arrow Database Connectivity (ADBC) driver.¹ This exposes a DuckDB-like SQL interface.

Underneath this is a REST-based API that end users aren’t meant to use. I think it encodes messages using protobuf.

VAST Query Engine

VAST implements a query engine within DataBase that is becoming progressively more capable as they release new versions. As of version 5.4, the VAST Query Engine supports:²

Feature	Function
Vector search	The basis for the VAST vector database
Column permissions	Control which users can see which columns
Filter pushdown	Scalable filtering of simple predicates³
Nested datatype processing	Query elements within structures within columns

Glenn's Digital Garden

Explorer

VAST DataBase

Interfaces

VAST Query Engine

Graph View

Table of Contents

Backlinks

Glenn's Digital Garden

Explorer

VAST DataBase

Interfaces

VAST Query Engine

Footnotes

Graph View

Table of Contents

Backlinks