Counting 3,384 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

apollo

11835

An open autonomous driving platform

tfjs-converter

447

Convert TensorFlow SavedModel and Keras models to TensorFlow.js

reality

775

Comprehensive data proxy to knowledge about real world

Horizon

1527

A platform for Applied Reinforcement Learning (Applied RL)

PyTorch-NLP

1120

Supporting Rapid Prototyping with a Toolkit including Datasets and Neural Network Layers

Matterport

296

A dataset for RGB-D machine learning tasks captured throughout 90 properties with a Matterport Pro Camera

Clusterize.js

6031

Tiny vanilla JS plugin to display large data sets easily

jiant

241

jiant sentence representation learning toolkit is an extensible platform meant to make it easy to run experiments that involve multitask and transfer learning across sentence-level NLP tasks.

SentEval

894

A python tool for evaluating the quality of sentence embeddings.

Chinese-Word-Vectors

3498

100+ Chinese Word Vectors

texar

863

Toolkit for Text Generation and Beyond

opencog

1624

A framework for integrated Artificial Intelligence & Artificial General Intelligence (AGI)

jsonnet

1854

Jsonnet - The data templating language

vega

6478

Vega is a visualization grammar, a declarative format for creating and saving interactive visualization designs

BizCharts

3290

Powerful data visualization library based on G2 and React.

Semantic-Segmentation-Suite

988

Semantic Segmentation Suite in TensorFlow. Implement, train, and test new Semantic Segmentation models easily!

apexcharts.js

4501

A JavaScript Chart Library

pixiedust

684

Python Helper library for Spark IPython Notebooks

spark-jobserver

2209

REST job server for Apache Spark

go-chart

1949

Go chart is a basic charting library in native golang.

Cook

274

Fair job scheduler on Mesos for batch workloads and Spark

serving

2910

A flexible, high-performance serving system for machine learning models

fairseq

2630

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

gpu_monitor

85

Monitor your GPUs whether they are on a single computer or in a cluster

tensorspace

3327

Neural network 3D visualization framework, build interactive and intuitive model in browsers, support pre-trained deep learning models from TensorFlow, Keras, TensorFlow.js

HiBench

743

HiBench is a big data benchmark suite.

dvc

1584

Data Version Control - Git for data scientists

UpSetR

310

An R implementation of the UpSet set visualization technique published by Lex, Gehlenborg, et al..

hypertools

1313

A Python toolbox designed to facilitate dimensionality reduction-based visual explorations of high-dimensional data to gain geometric insights

pygal

1981

PYthon svg GrAph plotting Library