Counting 3,742 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

snips-nlu

2541

Snips Python library to extract meaning from text

skorch

1989

A scikit-learn compatible neural network library that wraps pytorch

swift

3876

Swift for TensorFlow documentation repository.

thrift

6158

The Apache Thrift software framework, for scalable cross-language services development, combines a software stack with a code generation engine to build services that work efficiently and seamlessly between multiple languages

fastText

18080

Library for fast text representation and classification.

keras

40390

Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano or TensorFlow.

xlearn

2275

High Performance, Easy-to-use, and Scalable Machine Learning Package

incubator-mxnet

16727

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

SeetaFaceEngine

3572

An open source C++ face recognition engine, which can run on CPU with no third-party dependence

sarama

4123

Sarama is a Go library for Apache Kafka 0.8, 0.9, and 0.10.

hyperopt

3288

Distributed Asynchronous Hyperparameter Optimization in Python

pymc3

4125

Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Theano

FiloDB

1101

Distributed. Columnar. Versioned. Streaming. SQL.

incubator-predictionio

11751

PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray.

deeplearning4j

10656

Deep Learning for Java, Scala & Clojure on Hadoop & Spark With GPUs - From Skymind

tnt

891

A framework for torch which provides a set of abstractions aiming at encouraging code re-use as well as encouraging modular programming

tensorlayer

4842

A Deep Learning and Reinforcement Learning Library for Researchers and Engineers

xgboost

15649

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow

Open3D

1431

A Modern Library for 3D Data Processing

imgaug

5637

Image augmentation for machine learning experiments.

datasketch

832

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++

pretrained-models.pytorch

4007

Pretrained ConvNets for pytorch: ResNeXt101, ResNet152, InceptionV4, InceptionResnetV2, etc.

armnn

326

Arm NN ML software

Augmentor

3053

An image augmentation library in Python for machine learning that allows for finer grained control over augmentation, and implements as many augmentation procedures as possible

cvxpy

1740

A Python-embedded modeling language for convex optimization problems.

librdkafka

2957

The Apache Kafka C/C++ library

pytorch-summary

1113

Model summary in PyTorch similar to 'model.summary()' in Keras

PyTorch-BigGraph

1222

Software used for generating embeddings from large-scale graph-structured data.

brain.js

9489

Neural networks in JavaScript

hub

1451

A library for transfer learning by reusing parts of TensorFlow models.