Counting 2,283 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

stats

829

A statistics package with common functions that are missing from the Golang standard library.

libsvm

2475

Libsvm is a simple, easy-to-use, and efficient software for SVM classification and regression

hyperopt

1764

Distributed Asynchronous Hyperparameter Optimization in Python

PyTorch

11523

A python package that provides Tensor computation (like numpy) with strong GPU acceleration and Deep Neural Networks built on a tape-based autograd system

elasticsearch-hadoop

1169

:elephant: Elasticsearch real-time search and analytics natively integrated with Hadoop

keras-js

3486

Run Keras models (tensorflow backend) in the browser, with GPU support

scikit-learn

24951

scikit-learn: machine learning in Python

nltk

5859

A leading platform for building Python programs to work with human language data

pandas

12579

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

keras

24372

Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano or TensorFlow.

dlib

3864

A toolkit for making real world machine learning and data analysis applications in C++

peloton

1069

The Self-Driving Database Management System

cleverhans

1558

A library for benchmarking vulnerability to adversarial examples

spaCy

7940

Industrial-strength Natural Language Processing (NLP) with Python and Cython

probtorch

339

Probabilistic Torch is library for deep generative models that extends PyTorch

tensorlayer

3076

TensorLayer: Deep Learning and Reinforcement Learning Library for TensorFlow.

smile

3593

Statistical Machine Intelligence & Learning Engine

tflearn

7493

Deep learning library featuring a higher-level API for TensorFlow.

mlpack

1883

A scalable machine learning library, written in C++, that aims to provide fast, extensible implementations of cutting-edge machine learning algorithms

wordvectors

715

Pre-trained word vectors of 30+ languages

RISE

1315

RISE: "Live" Reveal.js Jupyter/IPython Slideshow Extension

dynet

2053

DyNet is a neural network library developed by Carnegie Mellon University

darknet

5567

Convolutional Neural Networks

fastText

12260

Library for fast text representation and classification.

neural-enhance

7642

Super Resolution for images using deep learning.

MITIE

1646

library and tools for information extraction

OpenNMT

1555

Open-Source Neural Machine Translation in Torch

h5py

739

The h5py package is a Pythonic interface to the HDF5 binary data format

tiny-dnn

3752

header only, dependency-free deep learning framework in C++11

edward

3204

A library for probabilistic modeling, inference, and criticism. Deep generative models, variational inference. Runs on TensorFlow.