Counting 3,541 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

ludwig

2853

Ludwig is a toolbox built on top of TensorFlow that allows to train and test deep learning models without the need to write code.

faker

7329

Faker is a Python package that generates fake data for you.

incubator-airflow

11083

Airflow is a platform to programmatically author, schedule and monitor workflows.

VOTT

1000

An electron app for building end to end Object Detection Models from Sample Videos.

hanabi-learning-environment

269

A research platform for Hanabi experiments

datashader

1797

Turns even the largest data into images, accurately.

scikit-plot

1386

An intuitive library to add plotting functionality to scikit-learn objects.

airflow-scheduler-failover-controller

81

A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability

visdom

5568

A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.

metabase

13530

The simplest, fastest way to get business intelligence and analytics to everyone in your company

AirSim

6983

Open source simulator based on Unreal Engine for autonomous vehicles from Microsoft AI & Research

simpledet

1045

A Simple and Versatile Framework for Object Detection and Instance Recognition

hrbrthemes

490

Opinionated, typographic-centric ggplot2 themes and theme components

matplotlib

8724

Plotting with Python

tsfresh

3534

Automatic extraction of relevant features from time series:

bokeh

8992

Interactive Web Plotting for Python

scrapy

31501

Scrapy, a fast high-level web crawling & scraping framework for Python

grafana

26815

Gorgeous metric viz, dashboards & editors for Graphite, InfluxDB & Prometheus

mirage

1754

GUI for Elasticsearch Queries

nsfw_data_scrapper

8170

Collection of scripts to aggregate image data for the purposes of training an NSFW Image Classifier

textql

7995

Execute SQL against structured text like CSV or TSV

superset

23152

Superset is a data exploration platform designed to be visual, intuitive, and interactive

bqplot

2215

Plotting library for IPython/Jupyter Notebooks

incubator-predictionio

11663

PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray

Chart.js

41836

Simple HTML5 Charts using the <canvas> tag

hollow

784

Hollow is a java library and comprehensive toolset for harnessing small to moderately sized in-memory datasets which are disseminated from a single producer to many consumers for read-only access.

xcessiv

1136

A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling in Python.

d3

82483

Bring data to life with SVG, Canvas and HTML. :bar_chart::chart_with_upwards_trend::tada:

DeepVideoAnalytics

2472

Analyze videos & images, perform detections, index frames & detected objects, search by examples

bert-as-service

2317

This repo uses BERT as the sentence encoder and hosts it as a service via ZeroMQ, allowing you to map sentences into fixed-length representations in just two lines of code.