Beringei is a high performance, in-memory storage engine for time series data.
A customisable 3D platform for agent-based AI research
Apache Drill is a distributed MPP query layer that supports SQL and alternative query languages against NoSQL and Hadoop data storage systems
Lightning-fast, distributed SQL queries for petabytes of data stored in Apache Hadoop clusters
A fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms
A high-performance neural network inference framework optimized for the mobile platform
A flexible framework of neural networks for deep learning
Distributed training framework for TensorFlow.
Deep Learning for humans
A lightweight, modular, and scalable deep learning framework.
Open Source, Distributed, RESTful Search Engine
High performance distributed data processing engine
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
Vitess is a database clustering system for horizontal scaling of MySQL.
A platform for cluster management and resource scheduling for AI that incorporates the mature design with a proven track record in Microsoft's large scale production environment
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Distributed, masterless, high performance, fault tolerant data processing
Open source platform for the complete machine learning lifecycle
PArallel Distributed Deep LEarning
A data lake management software platform and framework for enabling scalable enterprise-class data lakes on Apache Hadoop and Spark
A scalable, fault tolerant and low latency storage service optimized for append-only workloads.
ClickHouse is a free analytic DBMS for big data.
A python package that provides Tensor computation (like numpy) with strong GPU acceleration and Deep Neural Networks built on a tape-based autograd system
Spark is a fast and general cluster computing system for Big Data
Containerized Data Analytics
Distributed SQL query engine for big data
Airflow is a platform to programmatically author, schedule and monitor workflows
Apache Calcite is a dynamic data management framework.
The Apache Hive (TM) data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL