A lightweight, modular, and scalable deep learning framework.
Distributed training framework for TensorFlow.
The MapD Core database
A fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms
Ultrafast and elastic data processing
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
A customisable 3D platform for agent-based AI research
High performance distributed data processing engine
An EXPERIMENTAL Algorithmic Distributed Big Data Batch Processing Framework in C++
Vitess is a database clustering system for horizontal scaling of MySQL.
Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.
ClickHouse is a free analytic DBMS for big data.
PlaidML is a framework for making deep learning work everywhere.
Apache Hadoop is a framework for running applications on large cluster built of commodity hardware
Open Source, Distributed, RESTful Search Engine
PArallel Distributed Deep LEarning
Spark is a fast and general cluster computing system for Big Data
Column oriented distributed data store ideal for powering interactive applications
A data lake management software platform and framework for enabling scalable enterprise-class data lakes on Apache Hadoop and Spark
Stroom is a highly scalable data storage, processing and analysis platform.
A Hyper-Relational Database for Knowledge-Oriented System
Apache Mesos is a cluster manager that provides efficient resource isolation and sharing across distributed applications, or frameworks
Alluxio, formerly Tachyon, A Virtual Distributed Storage at Memory Speed
Let’s Big Data. Hue is an open source Web interface for analyzing data with Hadoop and Spark.
Apache Cassandra is a highly-scalable partitioned row store. Rows are organized into tables with a required primary key
Distributed SQL query engine for big data
Apache Flink is an open source stream processing framework with powerful stream- and batch-processing capabilities
Airflow is a platform to programmatically author, schedule and monitor workflows
Caffe: a fast open framework for deep learning.