Counting 3,384 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

autokeras

3881

The ultimate goal of AutoML is to allow domain experts with limited data science or machine learning background easily accessible to deep learning models.

TransmogrifAI

1191

TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Spark with minimal hand tuning

dmlc-core

602

A common bricks library for building scalable and portable distributed machine learning.

albumentations

1480

fast image augmentation library and easy to use wrapper around other libraries

pocket-tensor

36

Run Keras models from a C++ application on embedded devices

fastText

16748

Library for fast text representation and classification.

hyperopt

2834

Distributed Asynchronous Hyperparameter Optimization in Python

h2o-3

3655

Open Source Fast Scalable Machine Learning API For Smarter Applications (Deep Learning, Gradient Boosting, Random Forest, Generalized Linear Modeling (Logistic Regression, Elastic Net), K-Means, PCA...)

modAL

316

An active learning framework for Python3, designed with modularity, flexibility and extensibility in mind.

tensorpack

3332

A Neural Net Training Interface on TensorFlow

ELL

1870

The Embedded Learning Library allows you to build and deploy machine-learned pipelines onto embedded platforms, like Raspberry Pis, Arduinos, micro:bits, and other microcontrollers.

distiller

1141

A Python package for neural network compression research to reduce the memory footprint of a neural network, increase its inference speed and save energy

zipline

8082

Zipline, a Pythonic Algorithmic Trading Library

tensor2tensor

6122

A library for generalized sequence to sequence models

LocustDB

897

Massively parallel, high performance analytics database that will rapidly devour all of your data.

flair

920

A very simple framework for state-of-the-art NLP

PocketFlow

1421

An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.

incubator-mxnet

15813

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

fluent

1092

A fully managed, data-first computation framework under development of U.C. Berkeley RISE Lab.

incubator-predictionio

11593

PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray.

ignite

1124

A high-level library to help with training neural networks in PyTorch.

sparkflow

189

Easy to use library to bring Tensorflow on Apache Spark

nlp-architect

1609

A Python library by Intel for exploring the state-of-the-art deep learning topologies and techniques for natural language processing and natural language understanding

deeplearning4j

10028

Deep Learning for Java, Scala & Clojure on Hadoop & Spark With GPUs - From Skymind

tensorflow_scala

549

TensorFlow API for the Scala Programming Language

edward

4017

A library for probabilistic modeling, inference, and criticism. Deep generative models, variational inference. Runs on TensorFlow.

blaze

2544

NumPy and Pandas interface to Big Data

swift

3201

Swift for TensorFlow documentation repository.

ml5-library

2121

Friendly machine learning for the web!

auto-sklearn

2869

auto-sklearn is an automated machine learning toolkit and a drop-in replacement for a scikit-learn estimator