Community Office Hours

The Alluxio POSIX API enables data engineers to access any distributed file system or cloud storage as if accessing a local file system with an added performance improvement. This reduces the effort and complexity for data engineers to run their machine learning or legacy workloads on new data storage without data migration or data duplication.


In this Office Hour you'll learn about:

  • How to install and setup the Alluxio POSIX API to enable data access to disparate storage systems, including AWS S3 
  • Tensorflow model training using Alluxio POSIX API to read data from S3
  • Open Session for discussion on any topics such as solving the separation of compute and storage problem, unifying multiple storage systems, and more


Interested in learning more? 


Thanks for your interest! This event has concluded. 

Running Machine Learning Workloads with Tensorflow + Alluxio + AWS S3

Speaker: Gene Pang

Gene Pang is a founding engineer and PMC Maintainer of Alluxio. Previously, he worked at Google. Gene earned his PhD from the AMPLab at UC Berkeley, working on distributed database systems, and holds an MS from Stanford University and a BS from Cornell University.

Founding Engineer at Alluxio

Bin Fan is the founding engineer of Alluxio, Inc. and the PMC member of Alluxio open source project. Prior to Alluxio, he worked for Google where he won the Technical Infrastructure Award. Bin received his Ph.D. in Computer Science from Carnegie Mellon University working on distributed systems.

Evangelist and Founding Member at Alluxio

Speaker: Bin Fan

...open-source virtual distributed file system that provides a unified data access layer for hybrid and multi cloud deployments. 


Alluxio resides between storage systems such as Amazon S3 or Apache HDFS and computations frameworks and applications such as Apache Spark or Presto.


With Alluxio, your data is centralized and applications have a single common interface and namespace for data access.

Alluxio is an...