Modernized livestock farming meets the growing demand for animal products such as meat, eggs, and milk, and brings new opportunities of large-scale scientific breeding and genetic research. With the help of processes on genomic selection, intelligent breeding and feeding, disease diagnosis, intelligent monitoring and management, data mining and decision support, AI technology optimizes livestock genetic breeding, improves livestock production efficiency and health, and promotes the development of the livestock industry.
Artificial intelligence (AI) technology is optimizing livestock genetic breeding in numerous ways. Some examples include:
With the help of AI technology, a livestock research institute and client of Zhengrui Technology collected and labeled a large amount of biological data, such as genomes and production capacity phenotypes, then trained deep-learning models like intelligent identification of livestock diseases and prediction of breeding value. The combination of AI and genetic technology enables the development of the livestock industry to move forward in a brand-new way and at a brand-new speed.
With AI application of big data, machine learning, multimodal modeling, and other technologies in animal husbandry, the concept of data warehousing has undergone significant changes. Before AI was able to help analyze data, the electronification of manual records in the livestock industry faced bottlenecks, lessening their value. Instead of providing useful insights, data was largely archived, becoming “cold data” that was not used to its full potential. AI technology has allowed farmers and livestock managers to re-explore this archived data, as well as further expand the scope and frequency of data collection, resulting in a massive influx of original, raw, unstructured data into a "data lake" that they can then use for making decisions in management and breeding.
The Institute of Animal Husbandry generates terabytes (TB) of data daily. Traditional data storage and management models are unable to cope with the rapid processing of large-scale and complex data, including structured data, semi-structured data, unstructured data, and binary data (images, audio, and video). AI allows the data in a data lake to be converted from raw data to targeted data for tasks such as reporting, visualization, analytics, and machine learning.
The livestock research institute has constructed an arithmetic base to process and utilize the data collected, with four big data computing servers, four inference servers, and eight storage servers as the infrastructure.
With the import of historical data, as well as the accumulation of data from daily R&D and operation, the constraint on storage and arithmetic power gradually appears. The R&D and operation process involves a huge amount of data, but the data value density is decreasing daily, and there exists a large amount of duplicate data, resulting in sparse effective data. The reading load is too large, the reading efficiency of the original storage system decreases significantly, the arithmetic power cannot be fully utilized, and the R&D efficiency decreases. To improve the data quality, a large amount of data preprocessing work is also required, which introduces more read and write operations and occupies many resources.
For smaller data lakes, HDDs provide good read throughput, with write throughput limited by network bandwidth. However, as data sizes grow, cross-node access to distributed storage consumes a lot of network resources, causing efficiency issues even with high-performance 100G Ethernet or InfiniBand networks with increasing overhead for high-speed networks.
Even more significant than network costs, though, is idle arithmetic power. With the rapid development of AI arithmetic such as GPUs and TPUs, greater data throughput is required to meet computational demand, which puts higher demands on the performance of the storage system. Considering how expensive AI arithmetic builds are, idle arithmetic waiting for data ingest is a huge waste of resources.
Switching to SSDs as a storage medium is a logical step to cope with the high demand for storage performance. But this shift needs to combat several problems:
One practical answer to these storage problems is Solidigm 192-layer 3D NAND. It delivers an industry-leading density of 18.6GB/mm², which is substantially ahead of volume densities of competing products. Solidigm delivers unrivaled high storage density SSDs that address the industry's high-density, power efficiency, and high-throughput sore points.
Solidigm D5-P5336 is part of Solidigm's fourth generation of QLC SSDs for the data center, delivering an industry-leading combination of high capacity (up to 122.88TB*) and read-optimized performance with support for high-throughput read and data-intensive workloads. Its architecture is designed to efficiently accelerate and scale increasingly large data sets in widely deployed read-intensive workloads while increasing storage density, lowering total cost and enabling a more sustainable storage infrastructure than TLC SSDs and HDD-based solutions.
It is now technically and economically feasible to build all-flash servers based on single-tier storage with the entry of high-capacity, high-performance QLC SSDs from Solidigm into the market. Single-tier storage designs significantly reduce the technical difficulty of developing and deploying storage servers and provide more consistent, predictable performance.
Storage servers deploying high-density QLC SSDs achieve petabyte-scale single-node storage capacities that are beyond the reach of mechanical HDDs. The single-tier storage media design also saves the capacity, space, and energy consumption occupied by cache disks, further consolidating each node's advantage in storage density.
High density storage nodes also save rack space, energy consumption, and network port overhead. If the user's target storage capacity can be realized in a single node, it will greatly reduce the difficulty of deployment and operation and maintenance. Solidigm D5-P5336 122.88TB in the U.2 form factor can already achieve a capacity of up to 4PB in a standard 1U server, which can help to move enterprise data completely into all-flash storage, giving them the opportunity to exploit the value of dormant, or “cold,” data.
Zhengrui Technology used Solidigm high-capacity QLC NVMe SSDs to create a set of animal husbandry bio-genetic data storage solutions. The high-density, high-reliability, and scalable storage platform provided both efficiency and cost-effectiveness.
Figure 1. Zhengrui Tech server capable of holding 24 Solidigm SSDs for a maximum possible capacity of 700TB of data storage.
Solidigm, together with Zhengrui Technology, was recognized by customers for this tailor-made solution, which helps them solve multiple problems: Meeting the data volume and performance need with high-capacity QLC single disk
In this typical AI deployment, data storage nodes are prone to performance hotspots due to competition for resources, making computation units stall waiting for data. Solidigm QLC drives can meet the need of petabyte-scale data storage maintaining low-latency, stable and reliable performance.
When switching from 18TB HDDs +TLC to 30.72TB Solidigm D5-P5336 SSDs, rack space was reduced by 79%.2 This in effect reduces the footprint of storage cabinets and rack spaces and gives more flexibility to scale up compute nodes within a particular space budget.
This greatly reduced number of storage devices deployed, reducing the power consumption and cooling energy need of overall deployment, thereby impacting the construction and operating costs positively. The power consumption with hybrid solution (18TB HDDs + TLC ) was 57,600W, compared to 12,000W with All Flash Solution (Solidigm D5-P5336 SSDs) — a power reduction of 79%.
Even if a mixed-flash storage model is adopted, it still must face a series of accompanying problems such as large space occupation and complex management scheduling logic.
Zhengrui Tech + Solidigm solution requires only 2U of space to achieve 700TB class storage. To further increase storage density, options include adopting 61.44TB or 122.88TB single disk, or further introduction of EDSFF form factor support.
Ultra-high density all-flash storage provides stable and efficient IO support for computing centers, improves the overall computing throughput rate, and ensures the output of results. With Solidigm QLC SSDs, customers simplify the operation of the server room and reduce construction and management costs, reducing spent and increasing efficiency.
*Solidigm D5-P5336 122.88TB availability Q1’25.
All product plans, roadmaps, specifications, and product descriptions are subject to change without notice.
Nothing herein is intended to create any express or implied warranty, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non-infringement, or any warranty arising from course of performance, course of dealing, or usage in trade.
The products described in this document may contain design defects or errors known as “errata,” which may cause the product to deviate from published specifications. Current characterized errata are available on request.
Contact your Solidigm representative or your distributor to obtain the latest specifications before placing your product order.
For copies of this document, documents that are referenced within, or other Solidigm literature, please contact your Solidigm representative.
All products, computer systems, dates, and figures specified are preliminary based on current expectations, and are subject to change without notice.
© Solidigm. “Solidigm” is a trademark of SK hynix NAND Product Solutions Corp (d/b/a Solidigm). Other names and brands may be claimed as the property of others.