AI Accelerator Groq Adapts and Runs LLaMA, the Meta Chatbot

Groq, a leading artificial intelligence (AI) and machine learning (ML) systems innovator, last week announced it adapted a new large language model (LLM), LLaMA–chatbot technology from Meta and a proposed alternative to ChatGPT–to run on its systems.

Facebook® parent, Meta, released LLaMA, which can be used by chatbots to generate human-like text, on February 24th. Three days later the Groq team downloaded the model and within a few days had it running on a production GroqNode™ server, including eight GroqChip™ inference processors. This is a rapid time-to-functionality; a development task that can often take a larger team of engineers weeks to months to complete, while Groq executed with just a small group from its compiler team.

Jonathan Ross, CEO and founder of Groq said, “This speed of development at Groq validates that our generalizable compiler and software-defined hardware approach is keeping up with the accelerating pace of LLM innovation–something traditional kernel-based approaches struggle with.”

Marketing Technology News: MarTech Interview with Jason Lyman, Chief Marketing Officer at Customer.io

The rapid LLaMA bring-up by Groq is a particularly unique and noteworthy milestone because Meta researchers originally developed LLaMA for NVIDIA™ chips. With Groq engineers successfully running a cutting-edge model on its technology, they demonstrated GroqChip as a ready-to-use alternative to incumbent technology. Generative AI is carving out a place for itself in the market, and as transformers continue to advance the pace of LLM development, customers will need solutions that provide tangible time-to-production advantages, reducing developer complexity for fast iteration.

Bill Xing, Tech Lead Manager, ML Compiler at Groq said, “The complexity of computing platforms is permeating into user code and slowing down innovation. Groq is reversing this trend. Since we’re working on models that were trained on Nvidia GPUs, the first step of porting customer workloads to Groq is removing non-portable, vendor-specific code targeted for specific vendors and architectures. This might include replacing vendor-specific code calling kernels, removing manual parallelism or memory semantics, etc. The resulting code ends up looking a lot simpler and more elegant. Imagine not having to do all that ‘performance engineering’ in the first place to achieve stellar performance! This also helps by not locking a business down to a specific vendor.”

Marketing Technology News: 2023 SEO Predictions: How ChatGPT is Changing the Game

Recently Published

Bandwidth Partners with New Agentforce Contact Center

SAP to Acquire Reltio: Make SAP and Non-SAP Data AI-Ready

VisiGEO Launches: The Tool to Make Your Brand Visible to AI

DTOM Soft Launch Breadth Edits AI Motion Engine for SaaS Promos

Former OpenAI & Google AI Experts Launch HyperDev

Related Posts

Bandwidth Partners with New Agentforce Contact Center

SAP to Acquire Reltio: Make SAP and Non-SAP Data AI-Ready

VisiGEO Launches: The Tool to Make Your Brand Visible to AI

DTOM Soft Launch Breadth Edits AI Motion Engine for SaaS Promos

Former OpenAI & Google AI Experts Launch HyperDev

Wondershare Launches Relumi, an AI-Powered App to Retake and Perfect Missed Photo Moments

AI Accelerator Groq Adapts and Runs LLaMA, the Meta Chatbot Model and Competitor to ChatGPT, for Its Systems

Popular Articles

Bandwidth Partners with New Agentforce Contact Center

SAP to Acquire Reltio: Make SAP and Non-SAP Data AI-Ready

VisiGEO Launches: The Tool to Make Your Brand Visible to AI

DTOM Soft Launch Breadth Edits AI Motion Engine for SaaS Promos

Former OpenAI & Google AI Experts Launch HyperDev

About Us

Quick Links

Visit our Other Sites

Follow Us

Interested in our Customized Editorial Services? Please fill your details and we'll get in touch with you!