“Phison aiDAPTIV+ is Cost-Effective With Large Models to Enable Generative AI
分析師觀點: Using a fast SSD as a tier to expand usable RAM is not a new concept; most operating systems have swap files to enable more system memory than the installed RAM. Phison aiDAPTIV+ is a set of hardware and software that enables the same concept for GPU memory, making AI cost-effective with large models. The high-bandwidth memory in a GPU is a significant part of the GPU’s price. As AI models become larger, the need for more memory follows. The fastest option is still to keep the entire model and data in GPU memory, which requires high-end GPUs for large models and often involves using cluster GPUs to accommodate the whole model. Phison aiDAPTIV+ is not intended to address use-cases that require the highest possible training throughput or the lowest inference latency, where the performance justifies the cost of the GPUs. Phison aiDAPTIV+ will deliver cost-effective results in use cases where lower throughput or longer response latency is acceptable for a significant cost saving. For example, for a 66% cost reduction, a weekly task of fine-tuning a model might be completed overnight, rather than in two hours. The aiDAPTIV+ solution comprises the aiDAPTIVcache hardware and aiDAPTIVlink drivers, with the aiDAPTIVProSuite as an optional AI software development environment.”- futurumgroup.com
來源: futurumgroup.com