Tabular Docs

Tabular is a cloud-native warehouse and automation platform. It’s one central store for all your analytic data that can be used anywhere.

On this page:

Core Concepts

Storage

Tabular provides a fully managed storage solution built on top of Apache Iceberg tables, the standard for analytic data, and cloud object storage such as Amazon S3.

Tabular storage essentially behaves just like a traditional data warehouse. Users create tables with well defined schemas, read and write data using the tool of their choice, and control data access through a rich role-based access control system.

All the while, Tabular automatically handles all storage related maintenance and optimization such as compacting small files and removing expired table snapshots.

The end result is improved query performance, lower cloud resource spend, and a reduced load on data engineering teams.

Compute

Tabular allows users to use a variety of compute environments directly against their cloud-native storage.

This is a Bring You Own Compute model. Unlike traditional data warehouse products, Tabular users are free to use whatever compute engine makes sense for their use cases, including open source tools like Apache Spark, Trino, and Apache Flink, as well as commercial products like AWS Athena and Snowflake.

Catalog

Tabular includes a fully-hosted implementation of the Apache Iceberg REST catalog API. This enables any compute framework that supports the Iceberg REST catalog API to interact with a Tabular warehouse via simple configuration. In particular this is true with AWS (Athena, Redshift, EMR), as AWS has also adopted the REST catalog. Tabular and AWS connect directly; the integration is accomplished via IAM roles (which Athena requires).

Acting as the catalog interface to the data is what enables Tabular to automate table maintenance and optimization activities as well as enforce access controls.

Getting Started

Tabular homepage displaying the Poochfitness demo environment

Welcome to Tabular

Next steps: