Counting 3,663 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

elassandra

1136

Elassandra = cassandra + elasticsearch

zipline

8491

Zipline, a Pythonic Algorithmic Trading Library

hyperopt

3173

Distributed Asynchronous Hyperparameter Optimization in Python

dopamine

7633

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

dask

4382

Versatile parallel programming with task scheduling

ipyparallel

1274

Interactive Parallel Computing in Python

vid2vid

6156

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

mlr

1196

mlr: Machine Learning in R

stanford-cs-229-machine-learning

7200

VIP cheatsheets for Stanford's CS 229 Machine Learning

100-Days-Of-ML-Code

21001

100 Days of ML Coding

mujoco-py

866

MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.

r4ds

1816

R for data science

tensorflow

123617

An Open Source Machine Learning Framework for Everyone

models

50182

Models built with TensorFlow

keras

39376

Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano or TensorFlow.

xgboost

15224

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow

incubator-systemml

738

SystemML is a flexible, scalable machine learning system

systemml

738

Apache SystemML provides an optimal workplace for machine learning using big data

iodide

846

A frictionlessly portable interface for literate scientific computing in the browser

cascading-flink

49

Cascading on Apache Flink™

doccano

672

Open source text annotation tool for machine learning practitioner.

wavenet

899

Keras WaveNet implementation

py-faster-rcnn

5517

Faster R-CNN Python implementation

autokeras

4870

The ultimate goal of AutoML is to allow domain experts with limited data science or machine learning background easily accessible to deep learning models.

thrift

6015

The Apache Thrift software framework, for scalable cross-language services development, combines a software stack with a code generation engine to build services that work efficiently and seamlessly between multiple languages

polyrnn-pp-pytorch

470

PyTorch training/tool code for Polygon-RNN++ (CVPR 2018)

pva-faster-rcnn

626

Demo code for PVANet

dragonfly

315

An open source python library for scalable Bayesian optimisation.

fastText

17819

Library for fast text representation and classification.

LARK

714

LAnguage Representations Kit