Counting 3,222 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

text

1202

Data loaders and abstractions for text and NLP

hyperlearn

432

50%+ Faster, 50%+ less RAM usage, GPU support re-written Sklearn, Statsmodels combo with new novel algorithms.

sumy

1653

Module for automatic summarization of text documents and HTML pages.

albumentations

1031

fast image augmentation library and easy to use wrapper around other libraries

pandas

16490

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

polyaxon

1566

A platform that helps you build, manage and monitor deep learning models

ray

4510

An experimental distributed execution engine

dlib

5909

A toolkit for making real world machine learning and data analysis applications in C++

ignite

859

A high-level library to help with training neural networks in PyTorch.

gleam

1623

Fast, efficient, and scalable distributed map/reduce system, DAG execution, in memory or on disk, written in pure Go, runs standalone or distributedly

schematics

2053

Python Data Structures for Humans™.

openpose

9450

A Real-Time Multi-Person Keypoint Detection And Multi-Threading C++ Library

mycroft-core

2733

Mycroft Core, the Mycroft Artificial Intelligence platform.

flint

484

A Time Series Library for Apache Spark

tensorpack

3030

A Neural Net Training Interface on TensorFlow

Skater

583

Python Library for Model Agnostic Interpretation

mmlspark

1029

Microsoft Machine Learning for Apache Spark

sonnet

6923

TensorFlow-based neural network library

pyflux

1254

Open source time series library for Python

neurojs

4142

A javascript deep learning and reinforcement learning library.

BigDL

2664

Distributed Deep learning Library for Apache Spark

scalardb

39

A library that provides an storage abstraction and client-coordinated distributed transaction on top of Cassandra

autokeras

3414

The ultimate goal of AutoML is to allow domain experts with limited data science or machine learning background easily accessible to deep learning models.

ScikitLearn.jl

236

Julia implementation of the scikit-learn API. ScikitLearn.jl supports both models from the Julia ecosystem and those of the scikit-learn library.

ImageAI

1674

A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities

spaCy

11094

Industrial-strength Natural Language Processing (NLP) with Python and Cython

keras

34586

Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano or TensorFlow.

pytorch-summary

634

Model summary in PyTorch similar to 'model.summary()' in Keras

modin

451

Unify the way you interact with your data

cortex

1067

Machine learning in Clojure