Counting 2,129 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

gcn

808

Implementation of Graph Convolutional Networks in TensorFlow

CapsNet-pytorch

263

PyTorch implementation of NIPS 2017 paper Dynamic Routing Between Capsules

CapsNet-Keras

1051

A Keras implementation of CapsNet in NIPS2017 paper "Dynamic Routing Between Capsules". Now test error < 0.4%.

Coloring-greyscale-images-in-Keras

251

Coloring B&W portraits with neural networks.

learning-spark

2092

Example code from Learning Spark book

StockInference-Spark

348

Stock inference engine using Spring XD, Apache Geode / GemFire and Spark ML Lib.

mx-maskrcnn

1209

A MXNet implementation of Mask R-CNN

presto

6852

Distributed SQL query engine for big data

cats-dogs-cortex-redux

53

Kaggle Cats & Dogs Redux with Cortex and Resnet50

color-accessibility-neural-network-deeplearnjs

131

🍃 Using a Neural Network to choose a accessible font color based on a background color.

progressive_growing_of_gans

1689

Progressive Growing of GANs for Improved Quality, Stability, and Variation

leaf

5128

Open Machine Intelligence Framework for Hackers. (GPU/CPU)

CNTK

13353

Microsoft Cognitive Toolkit (CNTK)

gora

65

Apache Gora open source framework provides an in-memory data model and persistence for big data. Gora supports persisting to column stores, key value stores, document stores and RDBMSs, and analyzing the data with extensive Apache Hadoop MapReduce support

pipeline

2190

PipelineIO: End-to-End ML and AI Platform for Real-time Spark and Tensorflow Data Pipelines

pinot

1673

A realtime distributed OLAP datastore

siddhi

345

Siddhi CEP is a lightweight, easy-to-use Open Source Complex Event Processing Engine (CEP) under Apache Software License v20

pomegranate

1206

Fast, flexible and easy to use probabilistic modelling in Python.

sarama

2351

Sarama is a Go library for Apache Kafka 0.8, 0.9, and 0.10.

torch2coreml

190

Torch7 -> CoreML

pandas-cookbook

2607

Recipes for using Python's pandas library

hue

2602

Let’s Big Data. Hue is an open source Web interface for analyzing data with Hadoop and Spark.

ml-agents

1386

Unity Machine Learning Agents

scio

810

A Scala API for Apache Beam and Google Cloud Dataflow

sumy

1194

Module for automatic summarization of text documents and HTML pages.

deeplearnjs

5104

A hardware-accelerated deep learning library for the web.

nmt

2075

This tutorial gives readers a full understanding of seq2seq models and shows how to build a competitive seq2seq model from scratch.

face_recognition

7847

The world's simplest facial recognition api for Python and the command line

ML-From-Scratch

7492

Bare bones Python implementations of some of the foundational Machine Learning models and algorithms.

tensorflow-generative-model-collections

1888

Collection of generative models in Tensorflow