A distributed SQL database that makes it simple to store and analyze massive amounts of machine data in real-time.
A Hyper-Relational Database for Knowledge-Oriented System
Universe: a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications.
Apache Hadoop is a framework for running applications on large cluster built of commodity hardware
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Airflow is a platform to programmatically author, schedule and monitor workflows
Apache HBase is an open-source, distributed, versioned, column-oriented store modeled after Google' Bigtable
The Apache Hive (TM) data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL
A high performance replicated log service. (The development is moved to Apache Incubator)
Fast, scalable, easy-to-use Python based Deep Learning Framework by Nervana™
Spark is a fast and general cluster computing system for Big Data
Open Source, Distributed, RESTful Search Engine
Caffe: a fast open framework for deep learning.
Vitess is a database clustering system for horizontal scaling of MySQL.
An engine for low-latency computation over large data sets. It stores and indexes your data such that queries, selection and processing over the data can be performed at serving time.
Microsoft Cognitive Toolkit (CNTK)
Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.
Column oriented distributed data store ideal for powering interactive applications
Alluxio, formerly Tachyon, A Virtual Distributed Storage at Memory Speed
PArallel Distributed Deep LEarning
A fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms
A customisable 3D platform for agent-based AI research
Arrow is a set of technologies that enable big-data systems to process and move data fast
A realtime distributed OLAP datastore
Distributed SQL query engine for big data
A lightweight, modular, and scalable deep learning framework.
A Hadoop native SQL query engine that combines the key technological advantages of MPP database with the scalability and convenience of Hadoop
Similar to how Hadoop provides a set of general primitives for doing batch processing, Storm provides a set of general primitives for doing realtime computation