Counting 1,868 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

luminoth

687

Deep Learning toolkit for Computer Vision

Halide

1857

a language for image processing and computational photography

AthenaX

479

SQL-based streaming analytics platform at scale

facets

3115

Visualizations for machine learning datasets

fairseq-py

735

Facebook AI Research Sequence-to-Sequence Toolkit written in PyTorch

ml-agents

1035

Unity Machine Learning Agents

SerpentAI

2784

A Game Agent Framework helping you create AIs / Bots to play any game you own

onnx

1409

Open Neural Network Exchange (ONNX) is the first step toward an open ecosystem that empowers AI developers to choose the right tools as their project evolves

nnvm

1086

Intermediate Computational Graph Representation for Deep Learning Systems

rocketmq

2536

A distributed messaging and streaming platform with low latency, high performance and reliability, trillion-level capacity and flexible scalability.

nteract

1910

Desktop notebook app + packages

Chart.js

32909

Simple HTML5 Charts using the <canvas> tag

incubator-predictionio

10490

PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray

gym

7609

A toolkit for developing and comparing reinforcement learning algorithms.

Clusterize.js

5371

Tiny vanilla JS plugin to display large data sets easily

luma.gl

740

A JavaScript WebGL Framework for Data Visualization

voyager

357

Visualization browser for open-ended data exploration

scrapy

23235

Scrapy, a fast high-level web crawling & scraping framework for Python

tsfresh

2268

Automatic extraction of relevant features from time series:

TriFusion

24

Streamlining phylogenomic data gathering, processing and visualization

DeepVideoAnalytics

1331

Analyze videos & images, perform detections, index frames & detected objects, search by examples

vega

5322

Vega is a visualization grammar, a declarative format for creating and saving interactive visualization designs

matplotlib

6033

Plotting with Python

incubator-airflow

6257

Airflow is a platform to programmatically author, schedule and monitor workflows.

redash

7621

Make Your Company Data Driven. Connect to any data source, easily visualize and share your data.

faker

5796

A library for generating fake data such as names, addresses, and phone numbers.

pinpoint

4618

Pinpoint is an open source APM (Application Performance Management) tool for large-scale distributed systems written in Java.

cruise-control

559

A fully automate the dynamic workload rebalancer and self-healing of a kafka cluster

kaldi

2602

A toolkit for speech recognition written in C++, intended for use by speech recognition researchers.

metabase

7199

The simplest, fastest way to get business intelligence and analytics to everyone in your company