Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.


High Performance Parallel storage for Virtualized Cloud-Based Infrastructure




What is Cloud Edition for Lustre?


Cloud Edition for Lustre (CE) is a software-defined storage cluster that runs on scalable cloud infrastructures. Cloud Edition provides a high performance parallel filesystem using virtualized resources. The full package includes CentOS, Lustre, Ganglia, and Lustre Monitoring Tool (LMT).


Cloud Edition is intended to be used as the working filesystem for a HPC or other IO intensive workloads. It is not intended to be used as long term storage or as an alternative to cloud storage options such as S3. We recommend that S3 or "cold-storage" be used for long term data storage, and Cloud Edition be used whenever a high-performance shared filesystem is required. 



Image Added





Cloud Edition on Amazon Web Services


Amazon Web Services is a collection of remote computing services that make up Amazon's cloud computing infrastructure. Cloud Edition uses Amazon Machine Images (AMI) and Elastic Compute Storage (EC2) to provide a parallel and highly scalable storage cluster on AWS.


Cloud Edition on Microsoft Azure


Azure is Microsoft’s cloud computing platform, a growing collection of integrated services—analytics, computing, database, mobile, networking, storage, and web—for moving faster, achieving more, and saving money.




Cloud Edition on Google Cloud Platform


Google Cloud Platform is a collection of computing resources consist of virtual machines (VMs), storage, databases, networking, Infrastructure as Service (SaaS), Platform as a Service (PaaS), and Software as a Service (SaaS).

When combined with HPC and Technical computing applications running on AWS, Cloud Edition can improve storage performance and increase scalability by eliminating storage bottlenecks.


[Click Here for More Information]


Cloud Edition combines with HPC and Technical computing applications running on Azure to improve storage performance and increase scalability by eliminating storage bottlenecks.


[Click Here for More Information]


Cloud Edition on Google Cloud Platform combines with HPC and Technical computing applications to improve user experience, storage performance and increase scalability by eliminating storage bottlenecks.


[Click Here for More Information]



Support Details

Cloud Edition for Lustre software is supported by the Lustre experts at Whamcloud. Product support includes the latest software updates, patches, and fixes to ensure a stable, flexible, and robust storage environment that leverages the benefits of cloud-based infrastructure.


To create a new support ticket, please setup an account in our JIRA ticketing system and file your ticket in the AWSP project: https://jira.whamcloud.com/secure/Dashboard.jspa and our support team will resolve your issue promptly. Please provide your Amazon ID and a description of the issue when filing a ticket.


For other questions please contact us at info@whamcloud.com

Scope:

  • The purpose of this page is to provide a process for submitting requests to Product Management to determine next steps, request inclusion in the product, or escalate for further approval. Requests will be submitted in a standard format (illustrated below) to the Product Manager, at which time they will be added to the agenda of the Product Management Meeting. Please make sure to include any stakeholders that will be impacted, as well as those necessary for reaching a clear decision.

Template:

 

Title

Improved Readability of OST Rebalance Chart
TypeFeature Request (Feature, Bug, Proof of Concept)
DescriptionThe OST Balance chart in IML becomes difficult to read when the number of OST's exceeds 16. Deployments exceeding 16 OST's are officially supported (reference doc) and (reference customers) have requested this functionality. The chart will need to be redesigned to facilitate readability for deployments up to and exceeding 128 OST's.
PurposeWhat benefit does this feature provide?
Next StepsWhat action is needed?
PriorityMedium (High, Medium, Low)
CreatorMicah Bhakti
Required AttendeesBryon Neitzel, Mark Rogers
StatusOpen, On Hold, Closed

 

This template will be submitted to the Product Manger via email (micah@intel.com).

 

Product Management Meeting Agenda:

TBA

Product Management Current Initiatives:

Table of Contents:

NameDescriptionPriority
SAM (Service Assurance Management) Integration for IMLMonitoring features for IML that tie into Intel PlatformLow
Lustre Client for Xeon PhiAllows Xeon Phi to connect to lustre storage cluster

Med

Differentiated Storage ServicesAllows SSD Caching based on defined client profiles, caching done at MDS/OSSHigh
IML Installer ImprovementsAdd checks to validate install environment, improve robustness, and output functional error messages when issues are encountered 
   
   

 

 

Title

SAM (Service Assurance Management) integration for IML
TypeProof of Concept
DescriptionService Assurance Management is designed to provide QoS monitoring, metering, and performance analysis of nodes on a number of metrics (CPU, memory, IO) using a common platform node agent when used with IA platforms.
PurposeProvides increased feature set when used with Intel hardware, as well as giving more data about bottlenecks and server usage.
Next StepsAllocate a resource to work with the SAM team to setup a SAM scheduler and Node Collectors on an IEEL system and test basic functionality. Intended outcome would be to gather information about the usefulness of the data and potention benefits to IEEL product line.
PriorityLow
CreatorMicah Bhakti
Required AttendeesMicah Bhakti, Mrittika Ganguli (SAM team), Dan Ferber (or SE team resource)
StatusOpen

Title

Xeon Phi Supported Lustre Client
TypeFeature
DescriptionXeon Phi uses an embeded Linux OS on the card. We would like to add support for the Phi to access the Lustre FS based on Phi addoption in HPC as well as the Enterprise Technical Computing potential. To do this we will either need to support the Lustre client on the Phi Linux OS, or on the host and allow client Phi's to access the FS through the host.
PurposeProvides Lustre FS access to Xeon Phi co-processors.
Next Steps

Lustre 2.4 working on Phi with Ethernet (MPSS 2.1), IB support waiting on new OFED stack (WW39). TT-1346 will enable automatic Lustre build with RPM's, under review now.

Performance is still not where we want through virtual ethernet.

PriorityMed
CreatorMicah Bhakti
Required AttendeesMicah Bhakti, Dmitry Eremin, Oleg Droken, Peter Jones
StatusOpen

Title

Differentiated Storage Services
TypeFeature
DescriptionDSS runs as a service on the storage server (MDS or OSS) allocating data to tiered storage based on metadata. Profiles are created and client-specific, which allow data to be sorted to SSD or HDD storage based on the performance-optimized profiles.
PurposeImproves lustre performance for critical workloads, as well as potentially improving small-file IO.
Next Steps

DSS needs to be backported to current RHEL/CentOS kernel, followed by benchmarking and evaluation of test profiles for specific workloads. Following this we can define the impact and required engineering to integrate into the product.

PriorityHigh
CreatorMicah Bhakti
Required AttendeesMicah Bhakti, Michael Mesnier, Christian Black, Mikhail Pershin
StatusOpen

...

Title

...

 

...