LLMs Archives - High-Performance Computing News Analysis | insideHPC https://insidehpc.com/tag/llms/ At the Convergence of HPC, AI and Quantum Wed, 19 Feb 2025 19:20:25 +0000 en-US hourly 1 https://wordpress.org/?v=6.7.2 https://insidehpc.com/wp-content/uploads/2024/06/ihpc-favicon.png LLMs Archives - High-Performance Computing News Analysis | insideHPC https://insidehpc.com/tag/llms/ 32 32 57143778 Toward AGI: AI Innovation Will Be Driven by Applications, Not LLMs https://insidehpc.com/2025/02/toward-agi-ai-innovation-will-be-driven-by-applications-not-llms/ Thu, 13 Feb 2025 18:59:37 +0000 https://insidehpc.com/?p=95661

DeepSeek’s LLM has caused a stir, but ... companies like OpenAI and Anthropic are aiming higher, their sights are set on artificial general intelligence, for which LLMs will be a component. No matter how fast, powerful, or efficient they get, LLMs alone won’t be enough to achieve AGI.

The post Toward AGI: AI Innovation Will Be Driven by Applications, Not LLMs appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
95661
QCT Leverages NVIDIA AI Enterprise Software Platform to Enhance AI Powerhouses https://insidehpc.com/2024/12/qct-leverages-nvidia-ai-enterprise-software-platform-to-enhance-ai-powerhouses/ Thu, 05 Dec 2024 18:51:13 +0000 https://insidehpc.com/?p=95297

When we last took a close look at QCT (Quanta Cloud Technology), the data center, hyperscale and cloud server maker based in Taiwan, we pointed out that the company is a bigger player in the server industry than ....

The post QCT Leverages NVIDIA AI Enterprise Software Platform to Enhance AI Powerhouses appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
95297
AWS Announces EC2 UltraCluster and GA of Trainium2 Instances https://insidehpc.com/2024/12/aws-announces-ec2-ultracluster-and-ga-of-trainium2-instances/ Tue, 03 Dec 2024 21:09:19 +0000 https://insidehpc.com/?p=95279

LAS VEGAS, Dec. 3, 2024 -- At AWS re:Invent, Amazon Web Services today announced the general availability of AWS Trainium2-powered Amazon Elastic Compute Cloud (Amazon EC2) instances, introduced new Trn2 UltraServers, enabling customers to train ....

The post AWS Announces EC2 UltraCluster and GA of Trainium2 Instances appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
95279
HPC News Bytes 20241202: Do LLM’s Understand?, Agentic AI, Simulating the Universe, France Adding Reactors, TSMC 2nm Chips https://insidehpc.com/2024/12/hpc-news-bytes-20241202-do-llms-understand-agentic-ai-simulating-the-universe-france-adding-reactors-tsmc-2nm-chips/ Mon, 02 Dec 2024 19:11:45 +0000 https://insidehpc.com/?p=95266

A happy December start to you! From the world of HPC-AI, here's a rapid (7:38) romp through recent news, including: LLMs and “emergent” understanding, collaborative Agentic AI, Frontier exascale ....

The post HPC News Bytes 20241202: Do LLM’s Understand?, Agentic AI, Simulating the Universe, France Adding Reactors, TSMC 2nm Chips appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
95266
Oriole Networks Raises $22M for Photonics to Cut LLM Energy Use https://insidehpc.com/2024/10/oriole-networks-raises-22m-for-photonics-to-cut-llm-energy-use/ Mon, 21 Oct 2024 13:30:37 +0000 https://insidehpc.com/?p=94987

London, 21st October: Oriole Networks – a company using light to train Large Language Models with low energy consumption – has raised an additional $22 million from investors to scale its “super-brain” solution.  The round was led by Plural with all existing investors – UCL Technology Fund, XTX Ventures, Clean Growth Fund, and Dorilton Ventures – reinvesting. Oriole Networks addresses […]

The post Oriole Networks Raises $22M for Photonics to Cut LLM Energy Use appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
94987
HPC News Bytes 20241014: AMD Rollout, Foxconn’s Massive AI HPC, AI Drives Nobels, Are LLM’s Intelligent? https://insidehpc.com/2024/10/hpc-news-bytes-20241014-amd-rollout-foxconns-massive-ai-hpc-ai-drives-nobels-are-llms-intelligent/ Mon, 14 Oct 2024 15:55:59 +0000 https://insidehpc.com/?p=94942

A good mid-October morn to you! Here’s a brief (6:30) run-through of developments from the world of HPC-AI, including: AMD's products rollout, Foxconn's big Blackwell AI HPC in Taiwan, AI for science drives Nobel Prizes, Meta AI guru's AGI skepticism

The post HPC News Bytes 20241014: AMD Rollout, Foxconn’s Massive AI HPC, AI Drives Nobels, Are LLM’s Intelligent? appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
94942
Cerebras Claims Fastest AI Inference https://insidehpc.com/2024/08/cerebras-claims-fastest-ai-inference/ Tue, 27 Aug 2024 19:40:53 +0000 https://insidehpc.com/?p=94665

AI compute company Cerebras Systems today announced what it said is the fastest AI inference solution. Cerebras Inference delivers 1,800 tokens per second for Llama3.1 8B and 450 tokens per second for Llama3.1 70B, according to the company, making it 20 times faster than GPU-based solutions in hyperscale clouds.

The post Cerebras Claims Fastest AI Inference appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
94665
NVIDIA and Google DeepMind Collaborate on LLMs https://insidehpc.com/2024/05/nvidia-and-google-deepmind-collaborate-on-llms/ Wed, 15 May 2024 09:52:05 +0000 https://insidehpc.com/?p=94047

Intended to make it easier for developers to create AI-powered applications with world-class performance, NVIDIA and Google today announced three new collaborations at Google I/O ’24. Using TensorRT-LLM, NVIDIA is working with Google to optimize two new models it introduced at the event: Gemma 2 and PaliGemma. These models are built from the same research and […]

The post NVIDIA and Google DeepMind Collaborate on LLMs appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
94047
Amazon Adds $2.75B to Stake in GenAI Startup Anthropic https://insidehpc.com/2024/03/amazon-adds-2-75b-to-stake-in-genai-startup-anthropic/ Wed, 27 Mar 2024 19:29:48 +0000 https://insidehpc.com/?p=93743

Amazon announced it has made its biggest-ever investment, $2.75 billion, in OpenAI/Chat-GPT competitor Anthropic, another indication that the generative AI phenomenon continues to heat up. Today’s news follows Amazon and Anthropic announcing an earlier $1.25 billion investment last September – the announcement today brings the total investment to $4 billion. “We have a notable history with […]

The post Amazon Adds $2.75B to Stake in GenAI Startup Anthropic appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
93743
Oriole Networks Raises £10m for Faster LLM Training https://insidehpc.com/2024/03/oriole-networks-raises-10m-for-faster-llm-training/ Wed, 27 Mar 2024 16:28:08 +0000 https://insidehpc.com/?p=93738

London, 27 March 2024: Oriole Networks – a startup using light to train LLMs faster with less power – has raised £10 million in seed funding to improve AI performance and adoption, and solve AI’s energy problem. The round, which the company said is one of the UK’s largest seed raises in recent years, was co-led […]

The post Oriole Networks Raises £10m for Faster LLM Training appeared first on High-Performance Computing News Analysis | insideHPC.

]]>
93738