Counting 3,834 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

Project Page
Last Commit
May. 18, 2019
Apr. 10, 2013


CrateDB is a distributed SQL database that makes it simple to store and analyze massive amounts of machine data in real-time.

Features of CrateDB:

  • Standard SQL plus dynamic schemas, queryable objects, geospatial features, time series data, first-class BLOB support, and realtime full-text search.
  • Horizontally scalable, highly available, and fault tolerant clusters that run very well in virtualized and containerised environments.
  • Extremely fast distributed query execution.
  • Auto-partitioning, auto-sharding, and auto-replication.
  • Self-healing and auto-rebalancing.

CrateDB offers the scalability and flexibility typically associated with a NoSQL database and is designed to run on inexpensive commodity servers and can be deployed and run across any sort of network. From personal computers to multi-region hybrid clouds.

The smallest CrateDB clusters can easily ingest tens of thousands of records per second. And this data can be queried, ad-hoc, in parallel across the whole cluster in real time.


CrateDB provides an admin UI:

Screenshots of the CrateDB admin UI

Try CrateDB

The fastest way to try CrateDB out is by running:

$ bash -c "$(curl -L"

Or spin up the official Docker image:

$ docker run -p 4200:4200 crate

Visit the getting started page to see all the available download and install options.

Once you're up and running, head on over to the introductory docs.


This project is primarily maintained by, but we welcome community contributions!

See the developer docs and the contribution docs for more information.


Looking for more help?

Latest Releases
 Apr. 17 2019
 Apr. 16 2019
 Apr. 9 2019
 Apr. 9 2019
 Mar. 28 2019