Apache Calcite is a dynamic data management framework.
Lightweight library to build and train neural networks in Theano
Spark is a fast and general cluster computing system for Big Data
A lightweight, modular, and scalable deep learning framework.
Distributed training framework for TensorFlow.
Arrow is a set of technologies that enable big-data systems to process and move data fast
The Apache Ignite In-Memory Data Fabric is a high-performance, integrated and distributed in-memory platform for computing and transacting on large-scale data sets in real-time, orders of magnitude faster than possible with traditional disk-based or flash technologies.
Ultrafast and elastic data processing
Similar to how Hadoop provides a set of general primitives for doing batch processing, Storm provides a set of general primitives for doing realtime computation
Caffe: a fast open framework for deep learning.
Curator is a set of Java libraries that make using Apache ZooKeeper much easier
Apache Hadoop is a framework for running applications on large cluster built of commodity hardware
Kafka™ is used for building real-time data pipelines and streaming apps
SnappyData: OLTP + OLAP Database built on Apache Spark
Microsoft Cognitive Toolkit (CNTK)
Airflow is a platform to programmatically author, schedule and monitor workflows
Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.
A customisable 3D platform for agent-based AI research
A python package that provides Tensor computation (like numpy) with strong GPU acceleration and Deep Neural Networks built on a tape-based autograd system
Deep Scalable Sparse Tensor Network Engine (DSSTNE) is an Amazon developed library for building Deep Learning (DL) machine learning (ML) models
ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services
A scalable, fault tolerant and low latency storage service optimized for append-only workloads.
PArallel Distributed Deep LEarning
Lightning-fast, distributed SQL queries for petabytes of data stored in Apache Hadoop clusters
A fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms
Apache Mesos is a cluster manager that provides efficient resource isolation and sharing across distributed applications, or frameworks
Metron integrates a variety of open source big data technologies in order to offer a centralized tool for security monitoring and analysis
The MapD Core database
The Apache Hive (TM) data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL
Apache Accumulo is a sorted, distributed key/value store that provides robust, scalable data storage and retrieval