Counting 3,567 Big Data & Machine Learning Frameworks, Toolsets, and Examples...
Suggestion? Feedback? Tweet @stkim1

Author
Last Commit
Feb. 23, 2019
Created
Dec. 21, 2012

logo

Slack Release Docker Pulls Documentation Twitter Follow License

What is Alluxio

Alluxio (formerly known as Tachyon) is a virtual distributed storage system. It bridges the gap between computation frameworks and storage systems, enabling computation applications to connect to numerous storage systems through a common interface. Read more about Alluxio Overview.

The Alluxio project originated from a research project called Tachyon at AMPLab, UC Berkeley, which was the data layer of the Berkeley Data Analytics Stack (BDAS). For more details, please refer to Haoyuan Li's PhD dissertation Alluxio: A Virtual Distributed File System.

Who Uses Alluxio

Alluxio is used in production to manage Petabytes of data in many leading companies, with the largest deployment exceeding 1300 nodes. Find more use cases at Powered by Alluxio.

Download Alluxio

Binary download

Prebuilt binaries are available to download at https://www.alluxio.org/download .

Docker

Download and start an Alluxio master and a worker. More details can be found in documentation.

# launch a master
$ docker run -d --net=host\
    -v /mnt/data:/opt/alluxio/underFSStorage\
    alluxio/alluxio master
# launch a worker
$ docker run -d --net=host --shm-size=1G\
    -e ALLUXIO_WORKER_MEMORY_SIZE=1G\
    -v /mnt/data:/opt/alluxio/underFSStorage\
    -e ALLUXIO_MASTER_HOSTNAME=localhost\
    alluxio/alluxio worker

MacOS Homebrew

$ brew install alluxio

Quick Start

Please follow the Guide to Get Started to run a simple example with Alluxio.

Report a Bug

To report bugs, suggest improvements, or create new feature requests, please open a Github Issue. Our previous Alluxio JIRA system has been deprecated since December 2018.

Join the Community

Please use the following to reach members of the community:

Depend on Alluxio

For Alluxio versions 1.4 or earlier, use the alluxio-core-client artifact.

For Alluxio versions 1.5 or later, Alluxio provides several different client artifacts. The Alluxio file system interface provided by the alluxio-core-client-fs artifact is recommended for the best performance and access to Alluxio-specific functionality. If you want to use other interfaces, include the appropriate client artifact. For example, alluxio-core-client-hdfs provides a client implementing HDFS's file system API.

Apache Maven

<dependency>
  <groupId>org.alluxio</groupId>
  <artifactId>alluxio-core-client-fs</artifactId>
  <version>1.8.1</version>
</dependency>

SBT

libraryDependencies += "org.alluxio" % "alluxio-core-client-fs" % "1.8.1"

Contributing

Contributions via GitHub pull requests are gladly accepted from their original author. Along with any pull requests, please state that the contribution is your original work and that you license the work to the project under the project's open source license. Whether or not you state this explicitly, by submitting any copyrighted material via pull request, email, or other means you agree to license the material under the project's open source license and warrant that you have the legal authority to do so. For a more detailed step-by-step guide, please read how to contribute to Alluxio. For new contributor, please take 2 new contributor tasks.

Useful Links

Latest Releases
v2.0.0-preview-RC1
 Feb. 8 2019
v2.0.0-preview-RC1
 Jan. 30 2019
v1.8.1
 Sep. 27 2018
v1.8.0
 Jul. 9 2018
v1.8.0
 Jul. 2 2018