Counting 2,899 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

Project Page
Last Commit
Jun. 22, 2018
Oct. 23, 2012

Build Status Inspections Status Coverage Status


Druid is a distributed, column-oriented, real-time analytics data store that is commonly used to power exploratory dashboards in multi-tenant environments.

Druid excels as a data warehousing solution for fast aggregate queries on petabyte sized data sets. Druid supports a variety of flexible filters, exact calculations, approximate algorithms, and other useful calculations.

Druid can load both streaming and batch data and integrates with Samza, Kafka, Storm, Spark, and Hadoop.


Apache License, Version 2.0

More Information

More information about Druid can be found on


You can find the documentation for the latest Druid release on the project website.

If you would like to contribute documentation, please do so under /docs/content in this repository and submit a pull request.

Getting Started

You can get started with Druid with our quickstart.

Reporting Issues

If you find any bugs, please file a GitHub issue.


The Druid community is in the process of migrating to Apache by way of the Apache Incubator. Eventually, as we proceed along this path, our site will move from to

Community support is available on the druid-user mailing list([email protected]), which is hosted at Google Groups.

Development discussions occur on [email protected], which you can subscribe to by emailing [email protected].

We also have a couple people hanging out on IRC in #druid-dev on


Please follow the guidelines listed here.

Latest Releases
 Jun. 8 2018
 Jun. 5 2018
 May. 15 2018
 May. 3 2018
 Mar. 8 2018