Counting 3,384 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

Author
Contributors
Last Commit
Dec. 13, 2018
Created
Oct. 9, 2018

docs on_gitbook chat on_slack codecov

Alpha - technology preview

T4 is alpha software. It is not yet recommended for production use.

Overview

Rethinking S3: Announcing T4, a team data hub.

A team data hub for S3

  • T4 adds search, content preview, versioning, and a Python API to any S3 bucket
  • Every file in T4 is versioned and searchable
  • T4 is for data scientists, data engineers, and data-driven teams

Use cases

  • Collaborate - get everyone on the same page by pointing them all to the same immutable data version
  • Experiment faster - blob storage is schemaless and scalable, so iterations are quick
  • Recover, rollback, and reproduce with immutable packages
  • Understand what's in S3 - plaintext and faceted search over S3

Key features

  • Browse, search any S3 bucket
  • Preview images, Jupyter notebooks, Vega visualizations - without downloading
  • Read/write Python objects to and from S3
  • Immutable versions for objects, immutable packages for collections of objects

Components

  • /catalog (JavaScript) - Search, browse, and preview your data in S3
  • /api/python - Read, write, and annotate Python objects in S3

Documentation

Roadmap

Latest Releases
v2.9.9
 Jul. 31 2018
2.9.8
 Jul. 30 2018
2.9.7
 Jul. 11 2018
2.9.6
 Jun. 12 2018
2.9.5
 May. 23 2018