Aerodynamic Data Models: Flying Fast at Scale with DuckDB
- Track: Lightning talks
- Room: H.2215 (Ferrer)
- Day: Sunday
- Start: 15:05
- End: 15:20
- Video only: h2215
- Chat: Join the conversation!
At Rill, we rely on DuckDB to power uniquely fast dashboards for exploring time-series metrics. To achieve this interactivity, Rill’s dashboards generate up to 100 parallel queries in response to each user interaction.
In this lightning talk, we'll share a series of optimization and data modeling techniques that have been pivotal in achieving remarkably fast, sub-second response times using DuckDB.
Our primary tactics include employing parallel connections to facilitate simultaneous query processing and organizing data in a chronological order to enhance the effectiveness of min-max indexes. We also utilize enum types for more efficient handling of string column queries, along with config tunings. These approaches have collectively enabled us to enhance DuckDB's capability to handle larger datasets (100+ GBs) with sub-second query responses.
We invite you to join us in this insightful session to discover how these optimizations can significantly improve your data processing and query performance in DuckDB.
Speakers
Nishant Bangarwa (Rill Data) | |
Mike Driscoll |