Kafka™ is used for building real-time data pipelines and streaming apps
Spark is a fast and general cluster computing system for Big Data
Models built with TensorFlow
A complete daily plan for studying to become a machine learning engineer.
TensorFlow Tutorial and Examples for beginners
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
Collection of scripts to aggregate image data for the purposes of training an NSFW Image Classifier
Caffe: a fast open framework for deep learning.
Confluent's Apache Kafka Python client
Apache HBase is an open-source, distributed, versioned, column-oriented store modeled after Google' Bigtable
Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning
A tool for managing Apache Kafka.
The Missing Web UI for Elasticsearch
Arrow is a set of technologies that enable big-data systems to process and move data fast
Embulk is a parallel bulk data loader that helps data transfer between various storages, databases, NoSQL and cloud services
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
Apache Flink is an open source stream processing framework with powerful stream- and batch-processing capabilities
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
zk-smoketest.py provides a simple smoketest client for a ZooKeeper ensemble
A TensorFlow implementation of DeepMind's WaveNet paper
The original code from the DeepMind article + my tweaks
Easy interactive web applications with R
Easy & Flexible Alerting With ElasticSearch
A toolkit for speech recognition written in C++, intended for use by speech recognition researchers.
Infinispan is an open source data grid platform and highly scalable NoSQL cloud data store.
Tutorials, assignments, and competitions for MIT Deep Learning related courses.
Human activity recognition using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six categories.
dplyr-style piping operations for pandas dataframes
Use SQL to query Elasticsearch
Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano or TensorFlow.