Small is big: Meta bets on AI models for mobile devices

Meta researchers say small language models for mobile with less than a billion parameters could be as effective as large language models.

Credit: Skorzewiak / Shutterstock

Facebook-parent Meta has been working on developing a new small language model (SLM) compatible with mobile devices with the aim of running on-device applications while mitigating energy consumption during model inferencing tasks, a paper published by company researchers showed.

To set the context, large language models (LLMs) have a lot more parameters. For instance, Mistral-22B has 22 billion parameters while GPT-4 has 1.76 trillion parameters. In contrast, smaller language models have relatively fewer parameters, such as Microsoft’s Phi-3 family of SLMs, which have different versions starting from 3.8 billion parameters.

A parameter helps an LLM decide between different answers it can provide to queries — the more the number of parameters, the more the need for a larger computing infrastructure.

However, Meta researchers believe that effective SLMs with less than a billion parameters can be developed and it would unlock the adoption of generative AI across use cases involving mobile devices, which have relatively less compute infrastructure than a server or a rack.

The researchers, according to the paper, ran experiments with models, architected differently, having 125 million and 350 million parameters, and found that smaller models prioritizing depth over width enhance model performance.

“Contrary to prevailing belief emphasizing the pivotal role of data and parameter quantity in determining model quality, our investigation underscores the significance of model architecture for sub-billion scale LLMs,” the researchers wrote.

“Leveraging deep and thin architectures, coupled with embedding sharing and grouped-query attention mechanisms, we establish a strong baseline network denoted as MobileLLM, which attains a remarkable 2.7%/4.3% accuracy boost over preceding 125M/350M state-of-the-art models,” they added.

The 125 and 350 million models, dubbed MobileLLM, according to the researchers, were as effective as large language models, such as Llama 2, in handling chat and several API calling tasks, highlighting the capability of small models for common on-device use cases. While MobileLLM is not available across any of Meta’s products for public use, the researchers have made the code and data for the experiment available along with the paper.

More Meta news:

Americas

Asia

Europe

Oceania

Topics

About

Policies

Our Network

More

Small is big: Meta bets on AI models for mobile devices

Meta researchers say small language models for mobile with less than a billion parameters could be as effective as large language models.

More from this author

Mistral’s new Codestral Mamba to aid longer code generation

OpenAI whistleblowers seek SEC probe into ‘restrictive’ NDAs with staffers

Amazon Q Business now available with new app-builder capabilities

Elon Musk sues OpenAI alleging breach of founding agreement

Google calls Microsoft’s cloud practices in the EU anti-competitive

Meta to label AI-generated images from Google, OpenAI and Adobe

Generative AI boosts cloud revenue for Microsoft, Google

Italian watchdog says ChatGPT breached data privacy norms

Most popular authors

Show me more

Why going online is no longer fun

Nerdio enables remote work across the Canadian wilderness for the Government of Alberta

3 ways Nerdio simplifies Microsoft Azure Virtual Desktop operations and management for IT

Is it fair to call the Vision Pro a flop?

Podcast: Why a TikTok ban makes sense

Podcast: Are audio AI companies infringing on musicians' rights?

New hacks keep summer heat on businesses

Does AI need to be for 'everyone'?

It's OK to call Apple Vision Pro a flop

Small is big: Meta bets on AI models for mobile devices

Meta researchers say small language models for mobile with less than a billion parameters could be as effective as large language models.

Related content

CrowdStrike failure: What you need to know

Android security checkup: 18 steps to a safer phone

Is Copilot for Microsoft 365 a lying liar?

8 ways to prep your Windows PC for disaster

From our editors straight to your inbox

More from this author

Mistral’s new Codestral Mamba to aid longer code generation

OpenAI whistleblowers seek SEC probe into ‘restrictive’ NDAs with staffers

Amazon Q Business now available with new app-builder capabilities

Elon Musk sues OpenAI alleging breach of founding agreement

Google calls Microsoft’s cloud practices in the EU anti-competitive

Meta to label AI-generated images from Google, OpenAI and Adobe

Generative AI boosts cloud revenue for Microsoft, Google

Italian watchdog says ChatGPT breached data privacy norms

Most popular authors

Show me more

Why going online is no longer fun

Nerdio enables remote work across the Canadian wilderness for the Government of Alberta

3 ways Nerdio simplifies Microsoft Azure Virtual Desktop operations and management for IT

Is it fair to call the Vision Pro a flop?

Podcast: Why a TikTok ban makes sense

Podcast: Are audio AI companies infringing on musicians' rights?

New hacks keep summer heat on businesses

Does AI need to be for 'everyone'?

It's OK to call Apple Vision Pro a flop