{"id":85187,"date":"2025-04-08T08:00:34","date_gmt":"2025-04-08T15:00:34","guid":{"rendered":"https:\/\/phisonblog.com\/?p=85187"},"modified":"2025-04-15T21:25:30","modified_gmt":"2025-04-16T04:25:30","slug":"rethinking-computational-storage-unlock-the-processing-power-of-ssds","status":"publish","type":"post","link":"https:\/\/phisonblog.com\/ko\/rethinking-computational-storage-unlock-the-processing-power-of-ssds\/","title":{"rendered":"\ucef4\ud4e8\ud305 \uc2a4\ud1a0\ub9ac\uc9c0 \uc7ac\uace0: SSD\uc758 \ucc98\ub9ac \ub2a5\ub825 \ud65c\uc6a9"},"content":{"rendered":"<p>[et_pb_section fb_built=&#8221;1&#8243; _builder_version=&#8221;4.16&#8243; _module_preset=&#8221;default&#8221; custom_margin=&#8221;0px||||false|false&#8221; custom_padding=&#8221;0px||||false|false&#8221; locked=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_row _builder_version=&#8221;4.16&#8243; _module_preset=&#8221;default&#8221; width=&#8221;100%&#8221; max_width=&#8221;100%&#8221; custom_margin=&#8221;||||false|false&#8221; custom_padding=&#8221;0px||||false|false&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.16&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_text _builder_version=&#8221;4.27.4&#8243; _module_preset=&#8221;default&#8221; header_2_line_height=&#8221;1.7em&#8221; header_3_line_height=&#8221;1.7em&#8221; custom_margin=&#8221;||-10px||false|false&#8221; custom_padding=&#8221;||0px||false|false&#8221; locked=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221;]<\/p>\n<p><span data-contrast=\"auto\">Several years ago, the concept of <a href=\"https:\/\/phisonblog.com\/why-is-computational-storage-inevitable-2\/\">computational storage<\/a> was discussed among industry insiders and touted as a potential answer to the age-old question of how to maximize CPU processing power. The idea seemed compelling at first glance. Imagine if storage devices, like SSDs, could actually do some of the processing of the information they hold, so less data had to move between storage and the CPU. Theoretically, it might help you save on power, reduce the need for data transfer, and speed up computations.\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">To date, however, as with many seemingly revolutionary ideas, there just hasn\u2019t been a way to make a business out of the concept\u2014primarily because each use case is highly unique and it\u2019s simply not scalable.\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">When engineers and developers talk about computational storage, too often it\u2019s a pie-in-the-sky approach: \u201cWhat if we could run Linux on a drive, and we just gave it bigger processors?\u201d While the idea may seem innovative, it lacks focus and practical application. It\u2019s misguided thinking that is overly complicated and driven by technological idealism. Ultimately, it won\u2019t lead to the hoped-for benefits.\u00a0\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<div class=\"banner_wrapper\" style=\"height: 83px;\"><div class=\"banner  banner-45124 bottom vert custom-banners-theme-default_style\" style=\"\"><img decoding=\"async\" width=\"1080\" height=\"150\" src=\"https:\/\/phisonblog.com\/wp-content\/uploads\/2022\/08\/Why-Is-Computational-Storage-Inevitable-Banner.jpg\" class=\"attachment-full size-full\" alt=\"\" style=\"height: 83px;\" srcset=\"https:\/\/phisonblog.com\/wp-content\/uploads\/2022\/08\/Why-Is-Computational-Storage-Inevitable-Banner.jpg 1080w, https:\/\/phisonblog.com\/wp-content\/uploads\/2022\/08\/Why-Is-Computational-Storage-Inevitable-Banner-980x136.jpg 980w, https:\/\/phisonblog.com\/wp-content\/uploads\/2022\/08\/Why-Is-Computational-Storage-Inevitable-Banner-480x67.jpg 480w\" sizes=\"(min-width: 0px) and (max-width: 480px) 480px, (min-width: 481px) and (max-width: 980px) 980px, (min-width: 981px) 1080px, 100vw\" \/><a class=\"custom_banners_big_link\"  href=\"https:\/\/phisonblog.com\/why-is-computational-storage-inevitable-2\/\"><\/a><div class=\"banner_caption\" style=\"\"><div class=\"banner_caption_inner\"><div class=\"banner_caption_text\" style=\"\">Read: Why Is Computational Storage Inevitable?<\/div><\/div><\/div><\/div><\/div>\n<p><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<h3>A smarter approach: Tailored acceleration<\/h3>\n<p><span data-contrast=\"auto\">At Phison, we utilized our knowledge in NAND storage technology innovations to find a better way to offload the processing burden to an SSD, and that\u2019s with a tailored approach to acceleration, one that focuses on the tasks that storage devices are best suited for: applying fixed operations to ranges of logical block addressing (LBA). We integrate specialized accelerators into our SSDs to handle specific tasks that don\u2019t require excessive power or complexity.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">For example, we create hardware accelerators that can perform specific operations at very high speeds, such as qualifying large datasets, object-based erasure coding, checksum verification, and filtering out irrelevant information before it even reaches the CPU. This allows for faster and more efficient data processing, particularly in high-demand environments like data centers or supercomputing clusters. By processing data at the SSD level, you can reduce the amount of data that needs to be moved across the PCIe bus or through the network, which alleviates congestion, eases bandwidth limitations, and speeds up overall performance.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">By focusing on highly specific tasks that are considered \u201cmonkey work\u201d, these accelerators can provide significant benefits without adding substantial cost or power consumption. The accelerated SSDs can handle large volumes of data much faster, all while consuming less power than traditional processors. Importantly, this approach can be scaled across multiple drives, creating a more efficient, parallelized system that outperforms traditional CPU-bound processing.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">The host CPU is cable of doing all the tasks listed above faster than an individual SSD, but there are practical limitations to the total CPU <a href=\"https:\/\/phisonblog.com\/dram-or-not-the-difference-between-dram-and-dram-less-ssds-and-why-it-matters\/\">DRAM<\/a> bandwidth that can be assigned to non-OS tasks.\u00a0 Additionally, moving that data from SSD to DRAM consumes roughly half the DDR bandwidth available to the CPU.\u00a0 When factoring in that an all-flash storage chassis can have 30, 60 or even 90 SSDs, this presents a lot of offload capability to the appliance.\u00a0 A chassis with 90 Gen6 SSDs can process data at 2.5 TB\/s without impacting any CPU resources. In this scenario, the SSDs perform pre-filtering and pre-compute tasks while the CPU manages more important operations.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<div class=\"banner_wrapper\" style=\"height: 83px;\"><div class=\"banner  banner-37934 bottom vert custom-banners-theme-default_style\" style=\"\"><img decoding=\"async\" width=\"1080\" height=\"150\" src=\"https:\/\/phisonblog.com\/wp-content\/uploads\/2021\/11\/SSDs-for-Sustained-Write-Intensive-Workloads.jpg\" class=\"attachment-full size-full\" alt=\"\" style=\"height: 83px;\" srcset=\"https:\/\/phisonblog.com\/wp-content\/uploads\/2021\/11\/SSDs-for-Sustained-Write-Intensive-Workloads.jpg 1080w, https:\/\/phisonblog.com\/wp-content\/uploads\/2021\/11\/SSDs-for-Sustained-Write-Intensive-Workloads-980x136.jpg 980w, https:\/\/phisonblog.com\/wp-content\/uploads\/2021\/11\/SSDs-for-Sustained-Write-Intensive-Workloads-480x67.jpg 480w\" sizes=\"(min-width: 0px) and (max-width: 480px) 480px, (min-width: 481px) and (max-width: 980px) 980px, (min-width: 981px) 1080px, 100vw\" \/><a class=\"custom_banners_big_link\"  href=\"https:\/\/phisonblog.com\/ssds-for-sustained-write-intensive-workloads-2\/\"><\/a><div class=\"banner_caption\" style=\"\"><div class=\"banner_caption_inner\"><div class=\"banner_caption_text\" style=\"\">Read:  SSDs for Sustained Write-Intensive Workloads<\/div><\/div><\/div><\/div><\/div>\n<p><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<h3>Emerging applications in HPC and security are changing the computational storage landscape<span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/h3>\n<p><span data-contrast=\"auto\">Recently, Phison has pivoted and begun to look at new ways to offload some of the CPU workload to SSDs that goes a step beyond targeted accelerators. In some cases, the company is even adding CPU clusters to the storage array\u2014but the big difference here is that the CPU clusters aren\u2019t being used to do calculations, but to actually run web services or microservices. They show up as additional addressable <a href=\"https:\/\/phisonblog.com\/inside-cxl-phison-cto-sebastien-jean-shares-insights-with-ee-times\/\">CXL services<\/a> on the PCIe bus.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">Consider an AI project, for instance, which involves a lot of pipelining. That\u2019s where one large language model (LLM) produces data and outputs it, then another LLM takes it and transforms it and sends it to another LLM, and so on. An example of this would be video translation of a TED talk, in which an LLM extracts the English audio and turns it into text, another LLM translates the text into Chinese, another LLM trained on a celebrity\u2019s voice produces that audio track and so on, until the final output is a brand-new video of that celebrity delivering the talk in Chinese with synchronized lip movment.\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">That complex operation involves a lot of little steps that are typically handled by the CPU or GPU and require a lot of model swaps. Why couldn\u2019t you use the SSDs to do those little steps in the background while using the main CPU to delegate tasks to these accelerators and perform other higher-level tasks? In high-performance computing (HPC) organizations, the results can be impressive.\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">It&#8217;s not uncommon for an HPC cluster to have 100 petabytes of data storage (which includes double and triple redundancy), which means they could have 100,000 SSDs to spread a workload across. Suddenly, operations that were taking a day or two are now being completed in mere seconds.\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">In large HPC arrays, there are so many SSDs that the SSDs\u2019 bandwidth eclipses the entire network or CPU bandwidth. This is where we realized at Phison that there is a massive untapped space where SSDs can do intelligent things.\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">Where the HPC use case is all about speed and compute, we\u2019ve also looked at security use cases, which are more about rock-solid FIPS 140-3 compliant products that provide security services well beyond what TPM 2.0 achieve.\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">An SSD can execute hundreds of cryptographic operations like signing and verification per second\u2014and if a server has 30 to 90 SSDs, the processing power grows accordingly. Each SSD can act as an independent hardware-based agent with a root of trust that can point back to your HSM (hardware security module) server. Collectively, all of those drives surpass what one powerful CPU can do because the CPUs are not designed to be digital signing algorithm (DSA) engines. That multi-SSD power, combined with the fact that it\u2019s hardware that\u2019s already installed in the server, provides great benefits in bolstering security.\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<div class=\"banner_wrapper\" style=\"height: 83px;\"><div class=\"banner  banner-45953 bottom vert custom-banners-theme-default_style\" style=\"\"><img decoding=\"async\" width=\"1080\" height=\"150\" src=\"https:\/\/phisonblog.com\/wp-content\/uploads\/2022\/08\/964_4192971202.jpg\" class=\"attachment-full size-full\" alt=\"\" style=\"height: 83px;\" srcset=\"https:\/\/phisonblog.com\/wp-content\/uploads\/2022\/08\/964_4192971202.jpg 1080w, https:\/\/phisonblog.com\/wp-content\/uploads\/2022\/08\/964_4192971202-980x136.jpg 980w, https:\/\/phisonblog.com\/wp-content\/uploads\/2022\/08\/964_4192971202-480x67.jpg 480w\" sizes=\"(min-width: 0px) and (max-width: 480px) 480px, (min-width: 481px) and (max-width: 980px) 980px, (min-width: 981px) 1080px, 100vw\" \/><a class=\"custom_banners_big_link\"  href=\"https:\/\/phisonblog.com\/6-applications-driven-by-high-performance-memory-and-storage\/\"><\/a><div class=\"banner_caption\" style=\"\"><div class=\"banner_caption_inner\"><div class=\"banner_caption_text\" style=\"\">Read: 6 Applications Driven by High-Performance Memory and Storage<\/div><\/div><\/div><\/div><\/div>\n<p><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<h3>Remove complexity with specificity<\/h3>\n<p><span data-contrast=\"auto\">While Phison still believes that the traditional, generic concept of computational storage won\u2019t ultimately lead anywhere, we do see use cases where another approach to computational storage can be an asset. Targeted accelerators make specific operations less complex. And looking at the massive on-board bandwidth of SSDs and the potential benefits of leveraging that power in new ways could lead to some exciting applications in the very near future.\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<p><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][\/et_pb_section]<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Several years ago, the concept of computational storage was discussed among industry insiders and touted as a potential answer to the age-old question of how to maximize CPU processing power. The idea seemed compelling at first glance. Imagine if storage devices, like SSDs, could actually do some of the processing of the information they hold, [&hellip;]<\/p>\n","protected":false},"author":16,"featured_media":85213,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_et_pb_use_builder":"on","_et_pb_old_content":"","_et_gb_content_width":"","inline_featured_image":false,"footnotes":""},"categories":[23,3,116],"tags":[22],"class_list":["post-85187","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-all-posts","category-enterprise","category-featured","tag-long-content"],"acf":[],"_links":{"self":[{"href":"https:\/\/phisonblog.com\/ko\/wp-json\/wp\/v2\/posts\/85187","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/phisonblog.com\/ko\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/phisonblog.com\/ko\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/phisonblog.com\/ko\/wp-json\/wp\/v2\/users\/16"}],"replies":[{"embeddable":true,"href":"https:\/\/phisonblog.com\/ko\/wp-json\/wp\/v2\/comments?post=85187"}],"version-history":[{"count":12,"href":"https:\/\/phisonblog.com\/ko\/wp-json\/wp\/v2\/posts\/85187\/revisions"}],"predecessor-version":[{"id":85226,"href":"https:\/\/phisonblog.com\/ko\/wp-json\/wp\/v2\/posts\/85187\/revisions\/85226"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/phisonblog.com\/ko\/wp-json\/wp\/v2\/media\/85213"}],"wp:attachment":[{"href":"https:\/\/phisonblog.com\/ko\/wp-json\/wp\/v2\/media?parent=85187"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/phisonblog.com\/ko\/wp-json\/wp\/v2\/categories?post=85187"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/phisonblog.com\/ko\/wp-json\/wp\/v2\/tags?post=85187"}],"curies":[{"name":"\uc6cc\ub4dc\ud504\ub808\uc2a4","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}