Counting 3,222 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

kylo

531

A data lake management software platform and framework for enabling scalable enterprise-class data lakes on Apache Hadoop and Spark

pygdf

716

GPU open analytics initiative

baselines

5755

High-quality implementations of reinforcement learning algorithms

ray

4540

An experimental distributed execution engine

AirSim

5894

Open source simulator based on Unreal Engine for autonomous vehicles from Microsoft AI & Research

examples

6386

A repository showcasing examples of using pytorch

PyTorch

20305

A python package that provides Tensor computation (like numpy) with strong GPU acceleration and Deep Neural Networks built on a tape-based autograd system

seq2seq

4226

A general-purpose encoder-decoder framework for Tensorflow

samza

460

Apache Samza is a distributed stream processing framework

face_recognition

17853

The world's simplest facial recognition api for Python and the command line

storm

5354

Similar to how Hadoop provides a set of general primitives for doing batch processing, Storm provides a set of general primitives for doing realtime computation

lucene-solr

2010

Apache Solr is a search engine server that uses Apache Lucene

tensorboard

2661

TensorFlow's Visualization Toolkit

2048-deep-reinforcement-learning

108

Trained A Convolutional Neural Network To Play 2048 using Deep-Reinforcement Learning

faiss

4819

A library for efficient similarity search and clustering of dense vectors.

altair

2858

Declarative statistical visualization library for Python

dopamine

6067

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

pandas

16524

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

PythonDataScienceHandbook

12118

Jupyter Notebooks for the Python Data Science Handbook

face_classification

3302

Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.

nibabel

214

Python package to access a cacophony of neuro-imaging file formats

deap

2356

Distributed Evolutionary Algorithms in Python

grafana

24461

Gorgeous metric viz, dashboards & editors for Graphite, InfluxDB & Prometheus

DeepSpeech

8206

A TensorFlow implementation of Baidu's DeepSpeech architecture

faker

6829

Faker is a Python package that generates fake data for you.

kableExtra

248

Construct Complex Table with knitr::kable() + pipe

ParlAI

3744

A framework for training and evaluating AI models on a variety of openly available dialog datasets.

albumentations

1076

fast image augmentation library and easy to use wrapper around other libraries

fluent

1025

A fully managed, data-first computation framework under development of U.C. Berkeley RISE Lab.

nteract

3224

Desktop notebook app + packages