Counting 3,464 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

kafka

10910

Kafka™ is used for building real-time data pipelines and streaming apps

spark

20353

Spark is a fast and general cluster computing system for Big Data

models

47575

Models built with TensorFlow

machine-learning-for-software-engineers

20963

A complete daily plan for studying to become a machine learning engineer.

TensorFlow-Examples

28656

TensorFlow Tutorial and Examples for beginners

twint

983

An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.

nsfw_data_scrapper

5949

Collection of scripts to aggregate image data for the purposes of training an NSFW Image Classifier

caffe

26896

Caffe: a fast open framework for deep learning.

confluent-kafka-python

1061

Confluent's Apache Kafka Python client

hbase

2520

Apache HBase is an open-source, distributed, versioned, column-oriented store modeled after Google' Bigtable

DQN-tensorflow

1869

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

kafka-manager

6615

A tool for managing Apache Kafka.

dejaVu

5248

The Missing Web UI for Elasticsearch

arrow

3172

Arrow is a set of technologies that enable big-data systems to process and move data fast

embulk

1192

Embulk is a parallel bulk data loader that helps data transfer between various storages, databases, NoSQL and cloud services

Deep-Learning-Papers-Reading-Roadmap

21478

Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!

flink

5963

Apache Flink is an open source stream processing framework with powerful stream- and batch-processing capabilities

oryx

1566

Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

zk-smoketest

291

zk-smoketest.py provides a simple smoketest client for a ZooKeeper ensemble

tensorflow-wavenet

4207

A TensorFlow implementation of DeepMind's WaveNet paper

DeepMind-Atari-Deep-Q-Learner

1718

The original code from the DeepMind article + my tweaks

shiny

3223

Easy interactive web applications with R

elastalert

5352

Easy & Flexible Alerting With ElasticSearch

kaldi

5137

A toolkit for speech recognition written in C++, intended for use by speech recognition researchers.

infinispan

699

Infinispan is an open source data grid platform and highly scalable NoSQL cloud data store.

mit-deep-learning

3769

Tutorials, assignments, and competitions for MIT Deep Learning related courses.

LSTM Human Activity Recognition

1959

Human activity recognition using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six categories.

dfply

457

dplyr-style piping operations for pandas dataframes

elasticsearch-sql

4482

Use SQL to query Elasticsearch

keras

37647

Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano or TensorFlow.