Counting 2,899 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

decaNLP

473

The Natural Language Decathlon: A Multitask Challenge for NLP

models

37224

Models built with TensorFlow

datacollector

501

StreamSets DataCollector - Continuous big data ingest infrastructure

keras

30837

Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano or TensorFlow.

categorical_encoding

509

A library of sklearn compatible categorical variable encoders

DALI

305

A library containing both highly optimized building blocks and an execution engine for data pre-processing in deep learning applications

scikit-learn

28928

scikit-learn: machine learning in Python

r4ds

1340

R for data science

dbx

59

A fast, easy-to-use database library for R. Supports Postgres, MySQL, SQLite, and more.

beaker-notebook

1737

A code notebook that allows you to analyze, visualize, and document data using multiple programming languages

DensePose

2142

A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body

infinispan

662

Infinispan is an open source data grid platform and highly scalable NoSQL cloud data store.

zero-shot-gcn

192

A re-implementation of the zero-shot classification in ImageNet in the paper Zero-shot Recognition via Semantic Embeddings and Knowledge Graphs.

SLM-Lab

113

Modular Deep Reinforcement Learning framework in PyTorch.

kibana

9543

:bar_chart: Kibana analytics and search dashboard for Elasticsearch

finetune-transformer-lm

361

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"

tpot

4195

A Python tool that automatically creates and optimizes machine learning pipelines using genetic programming.

boxx

148

Tool-box for efficient build and debug in Python. Especially for Scientific Computing and Computer Vision.

snowplow

4120

Enterprise-strength web, mobile and event analytics, powered by Hadoop, Kafka, Kinesis, Redshift and Elasticsearch

elastalert

4573

Easy & Flexible Alerting With ElasticSearch

nifi

991

Apache NiFi is an easy to use, powerful, and reliable system to process and distribute data

zipline

7187

Zipline, a Pythonic Algorithmic Trading Library

deeplearning4j

9157

Deep Learning for Java, Scala & Clojure on Hadoop & Spark With GPUs - From Skymind

knox

59

The Knox Gateway is able to provide valuable functionality to aid in the control, integration, monitoring and automation of critical administrative and analytical needs of the enterprise

TFaaS

24

TensorFlow as a Service, a general purpose framework to serve TF models.

tensorflow-wavenet

3722

A TensorFlow implementation of DeepMind's WaveNet paper

redash

9397

Make Your Company Data Driven. Connect to any data source, easily visualize and share your data.

TensorFlow-Examples

23091

TensorFlow Tutorial and Examples for beginners

kafka-manager

5437

A tool for managing Apache Kafka.

fastText

14562

Library for fast text representation and classification.