Counting 3,742 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

HIP

1692

HIP : Convert CUDA to Portable C++ Code

Chart.js

43021

Simple HTML5 Charts using the <canvas> tag

scrapy

32454

Scrapy, a fast high-level web crawling & scraping framework for Python

kaldi

5780

A toolkit for speech recognition written in C++, intended for use by speech recognition researchers.

PythonRobotics

5218

A Python code collection of robotics algorithms, especially for autonomous navigation.

smart_open

1236

Utils for streaming large files (S3, HDFS, gzip, bz2...)

kepler.gl

4343

A data-agnostic, high-performance web-based application for visual exploration of large-scale geolocation data sets

pinpoint

8512

Pinpoint is an open source APM (Application Performance Management) tool for large-scale distributed systems written in Java.

opencog

1705

A framework for integrated Artificial Intelligence & Artificial General Intelligence (AGI)

deck.gl

5955

WebGL based visualization layers

dvc

2501

Data Version Control - Git for data scientists

Stocktalk

638

Data collection toolkit for social media analytics

fairseq

3570

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

deepdetect

1858

Deep Learning API and Server in C++11 with Python bindings and support for Caffe, Tensorflow, XGBoost and TSNE

colorcet

304

A set of useful perceptually uniform colormaps for plotting scientific data

gym

16440

A toolkit for developing and comparing reinforcement learning algorithms.

superset

24044

Superset is a data exploration platform designed to be visual, intuitive, and interactive

papermill

1719

Parameterize, execute, and analyze notebooks

bokeh

9313

Interactive Web Plotting for Python

Keshif

458

Keshif: Data Made Explorable

serving

3327

A flexible, high-performance serving system for machine learning models

matplotlib

9098

Plotting with Python

Chinese-Word-Vectors

4527

100+ Chinese Word Vectors

vowpal_wabbit

6272

A machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

google-images-download

4692

Python Script to download hundreds of images from 'Google Images'

billboard.js

3533

Re-usable, easy interface JavaScript chart library based on D3 v4+

livelossplot

617

Live training loss plot in Jupyter Notebook for Keras, PyTorch and others

jupyter-themes

5378

Custom Jupyter Notebook Themes

DeepPavlov

2970

An open source library for building end-to-end dialog systems and training chatbots.

VoTT

1170

An electron app for building end to end Object Detection Models from Images and Videos.