Name: Tinybird
Brand: Tinybird
Rating: 5.0 (10 reviews)

ClickHouse is trending.

The open source DBMS currently has over 36K GitHub stars and rising, and it seems like everybody is adopting it for real-time analytics use cases.

Among popular open-source databases, ClickHouse has experienced incredible growth.

There's good reason for the hype. ClickHouse is arguably the fastest OLAP database in the world. Its column-oriented storage format and SQL engine make it tremendously effective as a true DBMS for handling large-scale analytics on streaming and event-driven data architectures.

But.

(There's always a but.)

ClickHouse is known for its complexity. With great power comes great responsibility, and some people simply don't want to be responsible for setting up and maintaining a ClickHouse cluster.

Self-hosting ClickHouse means learning all the ins and outs of maintaining the powerful database.

That doesn't mean you shouldn't use ClickHouse. You absolutely should, especially if you are trying to build scalable real-time, user-facing analytics over any kind of time series data.

But unless you intend to hunker down and learn the deep internals of this powerful but mercurial database, you probably want some help, likely in the form of a managed ClickHouse service.

And even if you can learn the ClickHouse deep magic, you still might want a managed service for no other reason than you don't want to maintain infra. That's a perfectly valid reason to choose a managed ClickHouse service over self-hosting.

If managed ClickHouse is what you're after, skip ahead to see some options and how they compare. If you're not convinced that a managed service is for you, keep reading.

Why choose managed ClickHouse over self-hosted ClickHouse?

For the same reason you choose "buy" over "build" in any other case: time and money. Building things takes time and costs money. You buy off the shelf to get the economies afforded by someone who has been there before you.

But let's dig a bit deeper into what you actually need to build and maintain for an effective self-hosted ClickHouse:

High Availability. High Availability (HA) is critical for production databases. If a cluster fails, you need a backup, and you need to gracefully manage failover. To have a high-availability ClickHouse, you need at minimum 2 ClickHouse instances + a ZooKeeper implementation + a load balancer.
Upgrades. ClickHouse is frequently upgraded (stable packages are released roughly every month, and long-term support (LTS) packages roughly twice a year). Upgrading the database unlocks new features and is required for security reasons, but it also introduces regressions.

When you upgrade a ClickHouse cluster, you have to consider all running queries, read and write paths, materialized views being populated, etc. This is very non-trivial.
Write/Read Services. ClickHouse is often used in fast-paced, high-scale data applications involving very high ingest throughput and high query concurrency demanding sub-second latency response times.

When using ClickHouse, you have to consider ancillary services to the "left and right" of the database. How will you handle streaming ingestion when ClickHouse prefers to batch writes? How will you expose the query engine to your applications?

These things take time to build and money to host.
Observability. Databases don't exist in a vacuum. They need to be monitored, and ClickHouse is no exception.

There's more here. ClickHouse is complex, the learning curve is steep, and it may take too long to fully harness its power, especially for smaller engineering teams.

Start building with Tinybird!

If you've read this far, you might want to use Tinybird as your analytics backend. You can just get started, on the free plan.

Current options for Managed ClickHouse

Fortunately for those who don't want the headaches of scaling their own ClickHouse cluster, there are a growing number of managed ClickHouse options. You can find an exhaustive list further below, but three of the most popular managed ClickHouse products currently on the market are (in no particular order):

ClickHouse Cloud
Altinity
Tinybird

Below you'll find a quick comparison of each of the three in terms of use cases, developer experience, cost, and critical features.

ClickHouse Cloud

The "official" managed ClickHouse, created by ClickHouse, Inc., the maintainers of the OSS ClickHouse project.

Highlights

Available on AWS, GCP, and Azure
Automated scaling on prescribed compute range
Automated replication and backups
Automatic service idling when inactive
Low data storage costs backed by cloud-native architecture and object storage
Interactive SQL Console on the Web UI
Visualize any query result as a chart
Terraform provider for automating infrastructure management
Direct access to the ClickHouse database
ClickPipes for managed ingestion from a handful of streaming sources
Several clients for common languages to build your backend over ClickHouse
SOC 2 Type II compliant

Cost

Pricing for ClickHouse Cloud is based on your chosen plan and compute size.

Development plans are pinned at 16GiB 2vCPU and range from $1 to $193 per month depending on storage ($35/TB compressed) and compute ($0.22/active hour).

Production Plans can include from 24 GiB 6vCPU to 3600GiB 960 vCPU compute. Pricing is based on storage (~$50/TB compressed) and compute ($0.75/compute unit-hr) with costs ranging from ~$500 per month to $100,000 depending on cloud, compute units, and storage.

A screenshot of ClickHouse Cloud's pricing calculator — Pricing for ClickHouse Cloud is based on storage and compute units consumed. Costs for a production-grade cluster can range from $200 - $100K+ per month.

Dedicated infrastructure is also available at customized prices.

You can check pricing for ClickHouse Cloud and calculate your price based on storage and compute here.

Developer Experience

ClickHouse Cloud is a classic "database-as-a-service", providing a clean interface to a hosted database solution. The developer experience demands a level of ClickHouse expertise, as most of the non-infrastructure database settings are not abstracted. Actions like creating, renaming, or dropping tables, populating materialized views, etc. require knowledge of ClickHouse's flavor of SQL.

A screenshot of ClickHouse Cloud's SQL Console — ClickHouse Cloud provides a classic read/write SQL interface to the database. You can perform most database operations directly from this console

ClickHouse Cloud also boasts a solid amount of integrations, both managed and supported through community development. The recent introduction of ClickPipes allows for streaming data ingestion from popular sources like Apache Kafka, Confluent, Postgres + Debezium CDC, and others without setting up additional infrastructure.

A screenshot of ClickHouse Cloud's streaming ingestion integrations, called ClickPipes. — ClickHouse Cloud recently introduced ClickPipes, which are managed connectors for various streaming data sources.

For developers keen on building an analytics API service on top of ClickHouse, ClickHouse Cloud recently launched query endpoints in beta. With this feature, you can create static HTTP endpoints from saved queries, however these don't benefit from the same performance characteristics as the underlying database.

A screenshot of a ClickHouse Cloud SELECT query published as a beta endpoint — You can publish saved SELECT queries as API Endpoints in beta, but the performance of these queries was not very good in our tests.

Takeaways

ClickHouse Cloud is a flexible and performant way to host a ClickHouse database on managed infrastructure. It provides great tools for directly interfacing with the database itself, with some additional infrastructure features (ClickPipes, endpoints, etc.) that extend beyond the database itself.

Altinity

Altinity.Cloud provides managed ClickHouse deployments on both hosted cloud and VPC. In addition, Altinity offers ClickHouse consulting services and maintains several open source ClickHouse projects such as their Kubernetes Operator for ClickHouse.