Alluxio Improves Interface Support to Accelerate and Simplify Onboarding of Even More Data Driven Applications
Now available, Alluxio 2.5 focuses on POSIX and S3 interface access to improve performance and compatibility with popular interfaces for analytics and Machine Learning data pipelines
Alluxio, the developer of open source cloud data orchestration software, today announced the immediate availability of version 2.5 of its Data Orchestration Platform featuring access via POSIX and S3 interfaces enabling data platform teams to accelerate data pipelines for both business intelligence and model training using frameworks such as Tensorflow and PyTorch.
Marketing Technology News: Global Study Reveals Need For Greater Understanding Of Data Security Responsibility As Cloud…
“For modern AI / ML data pipelines, the preferred application programming interface (API) for storage access is not HDFS,” said Haoyuan Li, Founder and CEO, Alluxio. “With this release, Alluxio significantly improves support for model training pipelines with an accelerated POSIX API for unified storage access, performance and ease of management.”
“With Alluxio 2.5, we have made major strides in improving machine learning and AI support on Kubernetes. Enhancements to the FUSE interface for Tensorflow access have dramatically improved the model training experience,” said Yang Che, Sr. Staff Engineer, Alibaba Cloud.
“The Alluxio Data Orchestration System slashed query run times by half when running analytics jobs like Spark in Tencent Cloud, using our EMR platform to allow for greater I/O performance, and provides the ability to provision elastic compute with significantly reduced network resources,” said Long Chen, Vice Director of R&D, Center of Big Data Product, Tencent Cloud.
Marketing Technology News: WooRank Announces Launch Of Ground-breaking Page-Level SEO Analysis Tool
Alluxio 2.5 also improves compatibility with the S3 API. Together S3, HDFS and POSIX make up a majority of the APIs preferred by data driven applications and data management tools. Administrators now have the flexibility to manage the Alluxio file system namespace through a standard object storage console. This flexibility makes it even simpler to integrate Alluxio into existing large scale data pipelines.
New and improved storage connectors on Amazon Web Services, Azure Cloud and Google Cloud Platform improve the onboarding experience with seamless authentication and improved performance. Data lakes on all major cloud platforms can now easily integrate Alluxio to orchestrate data management. A new Quickstart guide using Data Orchestration Hub for single, hybrid or multi-cloud data orchestration is also included, along with support for the Hub on Kubernetes.
Kubernetes is a popular deployment choice for Alluxio with both data analytics and machine learning pipelines across on-premise and cloud environments. With the dynamic nature of containerized environments, log collection is a challenge when containers are frequently killed or restarted. Now Alluxio logs can be aggregated on a centralized collection server in Kubernetes.
Marketing Technology News: Contentful Unveils New Features and Partner Apps at Second Annual Blueprints Conference