Apache HBase is an open-source, distributed, versioned, column-oriented store modeled after Google' Bigtable
Apache Cassandra is a highly-scalable partitioned row store. Rows are organized into tables with a required primary key
Unified Resource Scheduler to co-schedule mixed types of workloads such as batch, stateless and stateful jobs in a single cluster for better resource utilization.
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services
Apache Hadoop is a framework for running applications on large cluster built of commodity hardware
An open source, distributed bitmap index that dramatically accelerates queries across multiple, massive data sets.
Metron integrates a variety of open source big data technologies in order to offer a centralized tool for security monitoring and analysis
A Hadoop native SQL query engine that combines the key technological advantages of MPP database with the scalability and convenience of Hadoop
A data lake management software platform and framework for enabling scalable enterprise-class data lakes on Apache Hadoop and Spark
Torch on steroids
A GPU-powered real-time analytics storage and query engine.
High performance distributed data processing engine
A large-scale entity and relation database supporting very large graphs containing rich, aggregated properties on the nodes and edges. Several storage options are available, including Accumulo, Hbase and Parquet.
The MapD Core database
Columnar store for analytics with Postgres, developed by Citus Data
A time-series database streaming oriented optimized for the serving layer.
Scalable PostgreSQL for multi-tenant and real-time workloads
Distributed training framework for TensorFlow, Keras, PyTorch, and MXNet.
A lightweight, modular, and scalable deep learning framework.
Similar to how Hadoop provides a set of general primitives for doing batch processing, Storm provides a set of general primitives for doing realtime computation
Lightning-fast, distributed SQL queries for petabytes of data stored in Apache Hadoop clusters
A distributed, parallel C++ query engine that lets you analyze, transform and combine data stored in Apache Hadoop clusters
The Apache Ignite In-Memory Data Fabric is a high-performance, integrated and distributed in-memory platform for computing and transacting on large-scale data sets in real-time, orders of magnitude faster than possible with traditional disk-based or flash technologies.
The Baidu File System.
Apache Apex is a unified platform for big data stream and batch processing
Fast, scalable, easy-to-use Python based Deep Learning Framework by Nervana™
A framework for implementing federated learning
An engine for low-latency computation over large data sets. It stores and indexes your data such that queries, selection and processing over the data can be performed at serving time.
Lightweight library to build and train neural networks in Theano