Choosing the Right Time-Series Architecture: ArcticDB vs. ClickHouse and TimescaleDB

Blog

Why serverless DataFrame databases like ArcticDB are the superior choice for initializing real-time streaming in [**autoStreams**](https://autostreams.de/) compared to traditional OLAP engines.

Zurück zum Blog
6. Januar 2026 | Schoenenberger, Bayer | architecturedatabasestime-seriescloud-native

The Challenge: Lean Real-Time Initialization

At Palaimon, our autoStreams platform triggers immediate actions from real-time data. However, “real-time” often requires historical context—comparing live metrics to baselines from days or weeks ago.

To maintain operational agility, we follow a strict philosophy: The live system should be as lean as possible. We avoid long-running processes with massive state. Instead, we offload historical data to a dedicated backend that must provide:

The Architectural Shift: Serverless DataFrames

Traditional OLAP engines like TimescaleDB (Postgres-based) and ClickHouse (columnar) are the industry standards for high-performance time-series. They excel at complex SQL and massive aggregations but require “always-on” infrastructure.

ArcticDB offers a serverless alternative. Developed by Man Group, it is a high-performance, versioned DataFrame database that stores data directly in S3 or local storage. The “engine” is a Python library running within your application, reading directly from storage and eliminating the database server entirely.

Technical Comparison

While TimescaleDB and ClickHouse offer full SQL support and materialized views, they come with “hidden” costs: 24/7 managed instances, sharding complexity, and hardware optimized for queries we rarely run.

Architecture & Features

Aspect / OperationArcticDBTimescaleDBClickHouse
TypeLibrary (client-side)PostgreSQL extensionColumn-store DB
Server Required❌ No✅ Yes✅ Yes
StorageS3 / Local filesPostgreSQL storageLocal/remote disks
ScalingHorizontal (via storage)Vertical + shardingHorizontal
JoinsClient-side only✅ Full SQL✅ Global
SQL SupportPython API only✅ Native✅ Native
Range Queries✅ Efficient✅ Optimized✅ Very fast
Predicate Pushdown✅ Yes✅ Yes✅ Yes
AggregationsBasicFull SQLVery fast
Materialized Views❌ No✅ Yes✅ Yes

The Trade-off: We sacrifice server-side joins for simplicity. Since we process data in-memory with Pandas, ArcticDB’s efficient range queries and built-in Time-Travel (versioning) provide exactly what we need without the “compute tax” of idle servers.

The Economics of Scale: 10 TB Benchmark

The difference between “server-backed” and “serverless” becomes business-defining at scale. Here is the estimated monthly cost for storing and accessing 10 TB of time-series data on AWS.

FeatureTimescaleDB (Managed)ClickHouse (Managed)ArcticDB (S3 + Lambda/EC2)
Compute~$450 (High-mem)~$600 (Prod cluster)$0 (In app runtime)
Storage~$1,000 (EBS/IOPS)~$1,000 (SSD)~$230 (S3 Standard)
Total/Mo~$1,450~$1,600~$230

Performance on Your Hardware

For on-prem or sovereign clouds, ArcticDB achieves retrieval speeds that networked databases cannot match. Zero-copy reads from local NVMe SSDs bypass network protocol overhead entirely.

ComponentTraditional ClusterArcticDB (On NVMe)
Drives3x 4TB SSDs (RAID)1x 10TB NVMe SSD
Admin~2-4 Days/Month~1 Hour/Month
RetrievalNetwork limited (~1-10 Gbps)Bus limited (~32-64 Gbps)

Conclusion

For initializing real-time systems with historical windows, we don’t need a full database server—we need fast, targeted retrieval. ArcticDB’s library-based architecture delivers cheaper storage (S3), faster retrieval (direct NVMe), and zero maintenance.

Don’t pay for an idle server to watch your data grow. Go serverless.