IDC released a report saying that the continuous upgrading of big data technology and services will drive the global storage market to achieve a compound annual growth rate of 53% between 2011 and 2016. For the foreseeable future, the data generated, processed, and stored by most organizations will continue to grow rapidly. Storage spending in the field of big data and analytics will surge from $379.9 million in 2011 to $6 billion in 2016.
The world is moving towards the era of "big data". The rapid growth rate of 50% of data every year puts the huge problem of "massive data processing" in front of the market. Correspondingly, in this era of "big data" change, whoever can solve the problem of massive data processing is likely to become a leader in the data era
DAS: This solution uses HBA cards to directly connect application servers and fiber optic arrays, and does not provide data sharing capabilities. If multiple applications need to share the same data, it often takes a lot of time to migrate the data, resulting in multiple copies of the same data in the environment, and the synchronization between multiple copies of data is difficult, wasting a lot of manpower and material resources. SAN: This solution uses a storage fiber optic network to connect application servers and fiber arrays, making it more flexible and scalable than DAS. However, SAN still does not have the ability to share data, and the high price of SAN systems and the compatibility of devices between different vendors are problematic. NAS: NAS systems that provide NFS or CIFS protocol access can provide a unified file system image to application servers to meet the needs of data sharing between multiple application servers. However, since traditional NAS often provides file-level storage space in the form of a single server, its limited IO bandwidth and scalability cannot meet the large number of concurrent read and write needs in the big data era. Moreover, NAS has poor support for file system locks, and multiple clients cannot read and write the same file at the same time, resulting in clients not being able to work well together. SAN NAS: This solution has the scalability of SAN and the data sharing ability of NAS to a certain extent, but there are still obvious shortcomings. All data I/O must be carried out through a single NAS server, making the I/O bandwidth of the entire system limited by the bandwidth of the NAS server, which is still difficult to meet the needs of the big data era. Cluster storage refers to the aggregation of storage space in multiple storage devices into a storage pool that can provide a unified access interface and management interface for application servers, through which applications can transparently access and utilize disks on all storage devices, and can give full play to the performance and disk utilization of storage devices. Data will be stored and read from multiple storage devices according to certain rules to achieve higher concurrent access performance. With the advent of the era of big data, the traditional IT storage architecture can no longer fully meet the needs of the big data era, and it remains to be seen whether this new storage method can lead the trend and become the real king of the big data era for the entire storage field and the era of big data. |