Tuesday, October 22, 2024 | San Francisco Bay Area
Agenda
As Apache Iceberg becomes the de facto open table format for analytical datasets, there is a growing need to have specific Apache Iceberg tables containing high cardinality, highly dimensional event data available for rapid and open-ended data exploration. This talk discusses how Druid addresses this need by extending its ingestion layer to read the Iceberg table format. This integration helps Druid power interactive dashboards and support slice-n-dice analytics within Data Lakehouses.
By the end of this session, participants will understand:
* How Iceberg tables can be ingested into Druid
* Real-world use cases
In this session, we’ll provide an overview of how Polaris serves as the easy button for Druid – including an architectural overview, product differentiators, and top technical use cases such as asynchronous query, time series analysis and more.
1. **Cluster Management and Deployment:** An overview of Netflix’s strategies for managing and deploying Druid clusters, emphasizing automation and scalability.
2. **Centralized Logging and Metrics:** Techniques for aggregating and analyzing logs and metrics to facilitate real-time monitoring and post-mortem analysis.
3. **Cluster Architecture Patterns:** Best practices and patterns employed by Netflix to architect Druid clusters for optimal performance and reliability.
4. **Parallel Testing Framework:** Detailed methodologies for executing parallel runs and conducting A/B testing to evaluate different Druid configurations, including the tools and frameworks used.
This session will provide practical knowledge and actionable insights, empowering attendees to apply similar strategies within their own organizations to optimize Druid deployments. Join us to learn how Netflix leverages advanced testing and analytical techniques to push the boundaries of what is possible with Apache Druid.