Successfully establishing data lakes in the cloud opens up a world of possibilities for data analysis. Users need an easy-to-use, robust, high-performance platform with the precise feature set required to create and support data lake environments, no matter which hyperscaler they use.

 

Azure NetApp® Files and Cloud Volumes Service are fully managed cloud storage services available on Microsoft Azure, Amazon Web Services (AWS), and Google Cloud Platform (GCP). These cloud storage services use NetApp technology, together with additional cloud services,  to provide an on-demand storage offering for allocating and managing highly available and scalable file shares in the cloud.

 

Azure NetApp Files and Cloud Volumes Service are high-performance platforms for deploying NFS and SMB file services, offering unprecedented levels of I/O throughput. Users can control the performance by configuring the service level, which can be set to Standard, Premium, or Extreme. The Extreme service level provides up to 128MB/s of throughput per TB. In addition, you can change service levels on the fly, which is very difficult, if not impossible, with other storage services.

 

Cloud Volumes Service performance levels:

  • Up to 16MB of throughput per TB
  • Up to 64MB of throughput per TB
  • Up to 128MB of throughput per TB

With Azure NetApp Files and Cloud Volumes Service, NetApp brings to bear decades of experience in building enterprise NAS solutions. This means that these services easily scale to meet the most demanding conditions, providing concurrent access to hundreds of client hosts and applications. Scalability to this degree is a difficult requirement for large-scale environments, and it’s impossible to achieve with custom-built NAS solutions.

 

One of the biggest wins for a data lake deployment is to leverage data synchronization capabilities from NetApp. Data from multiple external data sources with a range of different data types can be synchronized to a data lake and kept synchronized through scheduled, incremental updates, whether the source data resides on the premises or is even hosted in multiple cloud platforms. For example, if on-premises data is to be consolidated in AWS, the NetApp Cloud Volumes Service is able to connect and transfer the data securely and efficiently. Data can also be synchronized from the cloud to other environments. This is useful when the results of processing an analytics workload must be sent to another location.

 

To support data preprocessing, the service offers sophisticated features for creating NetApp Snapshot™ copies and cloning storage volumes. Using NetApp technology, clones of a dataset can be created very quickly. These clones can be easily dropped and re-created to refresh them with up-to-date data. This is possible for a source volume of any size, and gives data engineers the power to work with data across as many concurrent environments as they require.

Experience Azure NetApp Files and Cloud Volumes Service

Azure NetApp Files and Cloud Volumes Service provide rich features that help you to better manage your data with consistently high performance. Register now to access cloud-native file services on the cloud platform of your choice: Azure, AWS, or Google Cloud. Get started and learn how easy it is to manage, protect, and restore your file data.

Jeff Whitaker

Jeff Whitaker is the Sr. Manager, Cloud Solutions Marketing at NetApp. When he is not carving out new horizons in product marketing for Azure NetApp Files for Microsoft Azure, he can be found racing motocross up and down the west coast. You can reach Jeff directly at jeff.whitaker@netapp.com