火爆全網!AI新星Groq橫空出世,真的能碾壓英偉達GPU?
火爆AI圈,刷屏互聯網!
近期,Groq引發廣泛討論,其大模型每秒能輸出750個tokens,比GPT-3.5快18倍,自研LPU推理速度是英偉達GPU的10倍。

速度快得出奇
Groq名字與馬斯克的大模型Grok讀音類似,成立於2016年,定位為一家人工智能解決方案公司。
Groq爆火主要是因為其處理速度非常快。據媒體報道,該公司的芯片推理速度較英偉達GPU提高10倍,成本只有其1/10。
運行的大模型生成速度接近每秒500 tokens,碾壓ChatGPT-3.5大約40 tokens/秒的速度。
極限情況下,Groq的Llama2 7B甚至能實現每秒750 tokens,為GPT-3.5的18倍。

在Groq的創始團隊中,有8人來自谷歌早期TPU核心設計團隊,但Groq並未選擇TPU、GPU、CPU等路線,而是自研了語言處理單元(LPU)。

Groq官網顯示,在 Groq LPU™推理引擎上運行的Meta AI的Llama 2 70B的性能優於所有其他基於雲的推理提供商,吞吐量提高了18倍。

能否取代英偉達?
不過,速度並不是AI發展的唯一決定性因素。在Groq爆火的同時,也有一些質疑聲音。
首先,Groq似乎只是看起來了便宜。Groq的一張LPU卡僅有230MB的內存,售價為2萬多美元。
有網友分析,英偉達H100的成本效益應為Groq的11倍。

更為關鍵的是,Groq LPU完全不配備高帶寬存儲器(HBM),而是僅配備了一小塊的超高速靜態隨機存取存儲器(SRAM),這種SRAM的速度比HBM3快20倍。

這也意味着,與英偉達的H200相比,在運行單個AI模型時需要配置更多的Groq LPU。
另據Groq員工透露,Groq的LLM在數百個芯片上運行。

對此,騰訊科技的芯片專家姚金鑫認為,Groq的芯片目前並不能取代英偉達。
他認為,速度是Groq的雙刃劍。Groq的架構建立在小內存、大算力上,因此有限的被處理的內容對應着極高的算力,導致其速度非常快。
另一方面,Groq極高的速度是建立在很有限的單卡吞吐能力上的,要保證和H100同樣吞吐量,就需要更多的卡。
他分析,對於Groq這種架構來講,也有其盡顯長處的應用場景,對許多需要頻繁數據搬運的場景來説再好不過。
Follow us
Find us on
Facebook,
Twitter ,
Instagram, and
YouTube or frequent updates on all things investing.Have a financial topic you would like to discuss? Head over to the
uSMART Community to share your thoughts and insights about the market! Click the picture below to download and explore uSMART app!

Disclaimers
uSmart Securities Limited (“uSmart”) is based on its internal research and public third party information in preparation of this article. Although uSmart uses its best endeavours to ensure the content of this article is accurate, uSmart does not guarantee the accuracy, timeliness or completeness of the information of this article and is not responsible for any views/opinions/comments in this article. Opinions, forecasts and estimations reflect uSmart’s assessment as of the date of this article and are subject to change. uSmart has no obligation to notify you or anyone of any such changes. You must make independent analysis and judgment on any matters involved in this article. uSmart and any directors, officers, employees or agents of uSmart will not be liable for any loss or damage suffered by any person in reliance on any representation or omission in the content of this article. The content of the article is for reference only and does not constitute any offer, solicitation, recommendation, opinion or guarantee of any securities, virtual assets, financial products or instruments. Regulatory authorities may restrict the trading of virtual asset-related ETFs to only investors who meet specified requirements. Any calculations or images in the article are for illustrative purposes only.
Investment involves risks and the value and income from securities may rise or fall. Past performance is not indicative of future performance. Please carefully consider your personal risk tolerance, and consult independent professional advice if necessary.