Druid Summit is a conference devoted to helping data professionals develop and operate real-time analytics solutions based on the open source Apache Druid database. It includes training and education about Druid and its ecosystem, as well as talks by practitioners and industry experts covering development methods, architectural patterns, operational best practices and real-world case studies of Druid in production. It also provides a forum for data engineers, architects and leaders to share their experiences and network with their peers.
In this one day hands-on course you will learn how to architect a Druid platform that is optimized for your specific use case and work with you to ensure that you will be successful in production. Specifically, we will cover the following concepts and hands on exercises:
● What is Apache Druid?
● How does Druid work?
● What can you use Druid for?
● Druid architecture
● Druid File Format
● Data Modeling with Druid
● Druid and Apache Kafka
● Data Ingestion into Druid
● Querying Data
As a complement to the Apache Druid Fundamentals Training, the free Druid University series of videos is available.
.
For this class, we'll be using a virtual machine (VM) with Hadoop installed on it. Each student will need to install VirtualBox 5.2.20 or above. Their computers will need a 64-bit processor and 8+ GB of free RAM. Alternatively, this course can be done on Imply Cloud, which would require computers to have internet access to AWS and a browser.
In this half day session, you will learn how to turn your Apache Druid database performance up to an 11! We will cover techniques used to make ingestion (both streaming and batch) and queries perform best dependent on your (virtual or physical) hardware configuration. You will learn what settings are the most effective and how to wield their power to the greatest benefit. Mixed workloads of both ingestion and queries? No problem! Need to do a massive backfill? We have you covered! Interactive dashboards with 1000s of users? You got this!
● Native batch ingestion
● Real time ingestion
● Query tuning
● Planning for concurrency
● Mixed workloads
So, you like Apache Druid eh? Got a pretty good POC or Pilot up and running, but wondering how to move it over into production? Then this is the class for you! In this half day course, we will cover the best practices to bring a Druid cluster into production without casting a Level 10 spell. We will cover:
● Compaction
● Deleting data
● GDPR with Druid
● Updating data
● Retention Rules / Data Tiering
● General Troubleshooting / Logs
Attendees should be familiar with Druid architecture and concepts and perhaps have explored Druid in a single node environment.
As preparation for the Advanced Apache Druid Training, the free Druid University videos are available at Druid University.
For this class, we'll be using a virtual machine (VM) with Hadoop installed on it. Each student will need to install VirtualBox 5.2.20 or above. Their computers will need a 64-bit processor and 8+ GB of free RAM. Alternatively, this course can be done on Imply Cloud, which would require computers to have internet access to AWS and a browser.
Style and substance combine at San Francisco Airport Marriott Waterfront. Gorgeously redesigned and ideally situated in Burlingame, the Marriott offers a complimentary shuttle to and from the SFO terminals. Downtown San Francisco is easily accessible via BART and Caltrain. Settle in to newly transformed accommodations with light, airy design and tech-friendly amenities. Many of our hotel rooms offer views of the San Francisco Bay waterfront, and all feature soundproof windows and pillowtop bedding.
Marriott Bonvoy members and guests reserving our M Club accommodations enjoy access to our M Club, with its superb staff and benefits including complimentary daily breakfast. Elsewhere at the hotel, you can take a swim in the indoor pool, work out in the fitness center or watch planes take off and land while you dine at Hangar Steak Restaurant, our on-site restaurant. Those planning events in Burlingame will be delighted by our waterfront venues and ease of access San Francisco International Airport.
A discounted room rate of $299 is available for conference attendees. Link to book rooms will be included in your confirmation of registration.
If you plan to drive, the venue is easily accessible highway 101. The hotel is a 23 minute walk or 5 minute car share ride from the Millbrae BART / Caltrain station.
Apache Druid is an independent project of The Apache Software Foundation. More information can be found at https://druid.apache.org.