Gian Marco Iodice

Aug 20, 2025 • World Congress 2025

Mobile AI Just Got Faster: What’s Coming for Developers on Arm

What if you could get a 6x performance boost for on-device AI with zero code changes? See how Arm's new SME2 instructions make it a reality for developers.

#1about 3 minutes

Exploring generative AI use cases on mobile devices

Generative AI on mobile enables powerful, local-first applications like group chat summarization and high-quality audio generation without an internet connection.

#2about 3 minutes

Why you should run AI workloads on the Arm CPU

The Arm CPU offers scalability, security, and an "optimize once, deploy everywhere" model, making it ideal for high-performance, low-latency AI applications.

#3about 2 minutes

Navigating the diverse mobile AI framework ecosystem

A wide range of open-source frameworks, each with unique strengths, are available for deploying AI models on Arm-powered mobile devices.

#4about 3 minutes

How the KleidiAI library unifies AI performance

The KleidiAI library provides highly optimized, low-level routines that integrate directly into popular AI frameworks to ensure the best performance on Arm CPUs.

#5about 3 minutes

A deep dive into the on-device AudioGen pipeline

The AudioGen pipeline runs locally by combining multiple models and processing steps, requiring data type flexibility like FP32 and FP16 for optimal quality.

#6about 2 minutes

Building a private, fully on-device smart assistant

Generative AI enables smart speakers to run entirely locally, combining speech-to-text, LLM, and text-to-speech models for a private user experience.

#7about 3 minutes

Introducing SME2 for next-generation AI acceleration

The Scalable Matrix Extension 2 (SME2) for Armv9 CPUs uses the Matrix Outer Product Accumulate (MPA) instruction to dramatically accelerate matrix multiplication.

#8about 1 minute

Measuring performance gains with SME2 acceleration

SME2 delivers over six times better performance for key generative AI models like Gemma and Whisper, enabling real-time text summarization and audio generation.

#9about 2 minutes

How Android developers can prepare for SME2

With SME2 support coming to Android, developers using AI frameworks with KleidiAI integration will automatically receive significant performance boosts without any code changes.

Andrew Comp
Berlin, Germany

Intermediate

Java

JavaScript

Admir Comp

Remote

Intermediate

DevOps

Accelerating AI and machine learning with Arm C-Cloudy

01:50 MIN

Accelerating AI and machine learning with Arm C-Cloudy

Unleashing the Full Potential of the Arm Architecture – Write Once, Deploy Anywhere

Unlock full access

Log in or set up an account to access this feature and more.

The future of on-device AI hardware and APIs

02:08 MIN

The future of on-device AI hardware and APIs

From ML to LLM: On-device AI in the Browser

Unlock full access

Log in or set up an account to access this feature and more.

Achieving high performance on low-power devices

01:24 MIN

Achieving high performance on low-power devices

Focoos AI: Building the Future of Computer Vision

Unlock full access

Log in or set up an account to access this feature and more.

Using open source Gemma for local AI processing

03:49 MIN

Using open source Gemma for local AI processing

What’s New with Google Gemini?

Unlock full access

Log in or set up an account to access this feature and more.

The future of on-device AI in web development

01:14 MIN

The future of on-device AI in web development

Generative AI power on the web: making web apps smarter with WebGPU and WebNN

Unlock full access

Log in or set up an account to access this feature and more.

Leveraging hardware like the CPU, GPU, and NPU

04:03 MIN

Leveraging hardware like the CPU, GPU, and NPU

Privacy-first in-browser Generative AI web apps: offline-ready, future-proof, standards-based

Unlock full access

Log in or set up an account to access this feature and more.

Addressing the complexity of modern software development on Arm

03:12 MIN

Addressing the complexity of modern software development on Arm

Unleashing the Full Potential of the Arm Architecture – Write Once, Deploy Anywhere

Unlock full access

Log in or set up an account to access this feature and more.

Leveraging open software and AI for code development

05:03 MIN

Leveraging open software and AI for code development

The Future of Computing: AI Technologies in the Exascale Era

Unlock full access

Log in or set up an account to access this feature and more.

Featured Partners

Unleashing the Full Potential of the Arm Architecture – Write Once, Deploy Anywhere

Unleashing the Full Potential of the Arm Architecture – Write Once, Deploy Anywhere

Andrew Waafa

about 2 years ago • World Congress 2024

From Model to Metal: An Open Source Stack for Accelerating Intelligence

From Model to Metal: An Open Source Stack for Accelerating Intelligence

Andrew Wafaa

about 6 months ago • World Congress 2025

Prompt API & WebNN: The AI Revolution Right in Your Browser

Prompt API & WebNN: The AI Revolution Right in Your Browser

Christian Liebel

about 6 months ago • World Congress 2025

Generative AI power on the web: making web apps smarter with WebGPU and WebNN

Generative AI power on the web: making web apps smarter with WebGPU and WebNN

Christian Liebel

about 2 years ago • World Congress 2024

From ML to LLM: On-device AI in the Browser

From ML to LLM: On-device AI in the Browser

Nico Martin

about a year ago • WeAreDevelopers LIVE

Your Next AI Needs 10,000 GPUs. Now What?

Your Next AI Needs 10,000 GPUs. Now What?

Anshul Jindal & Martin Piercy

about 6 months ago • World Congress 2025

Bringing AI Everywhere

Bringing AI Everywhere

Stephan Gillich

about 2 years ago • World Congress 2024

WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA

WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA

Ankit Patel

about 2 years ago • World Congress 2024

Related Articles

View all articles

CH

Chris Heilmann

Exploring AI: Opportunities and Risks for Developers

In today's rapidly evolving tech landscape, the integration of Artificial Intelligence (AI) in development presents both exciting opportunities and notable risks. This dynamic was the focus of a recent panel discussion featuring industry experts Kent...

Exploring AI: Opportunities and Risks for Developers

LM

Luis Minvielle

13 AI Tools for Developers

Artificial intelligence has rapidly transitioned from a hype item to a must-have tool for devs. Its adoption rate had seen a dramatic increase even before LLMs hit desktop computers, with AI in companies surging by 270% between 2015 and 2019.Develope...

13 AI Tools for Developers

BB

Benedikt Bischof

How we Build The Software of Tomorrow

Welcome to this issue of the WeAreDevelopers Live Talk series. This article recaps an interesting talk by Thomas Dohmke who introduced us to the future of AI – coding.This is how Thomas describes himself:I am the CEO of GitHub and drive the company’s...

How we Build The Software of Tomorrow

DC

Daniel Cranney

Stephan Gillich - Bringing AI Everywhere

In the ever-evolving world of technology, AI continues to be the frontier for innovation and transformation. Stephan Gillich, from the AI Center of Excellence at Intel, dove into the subject in a recent session titled "Bringing AI Everywhere," sheddi...

Stephan Gillich - Bringing AI Everywhere

From learning to earning

Jobs that call for the skills explored in this talk.

AI & Embedded ML Engineer (Real-Time Edge Optimization)

autonomous-teaming

Remote

GIT

Linux

PyTorch

Principal Software Engineer (Productization)

Arm

GIT

Gitlab

Embedded C

Full Stack Developer focused on AI Development

SBI GmbH

DevOps

Gitlab

Pandas

Docker

PyTorch

+8

Product Engineer | AI Developer Automation

Neural Concept

DevOps

Continuous Integration

AIML - Machine Learning Research (Speech), DMLI

Apple Firmenprofil
Aachen, Germany

Confluence

Machine Learning

Backend Developer - Generative AI

INNIO Group

NoSQL

Docker

PyTorch

FastAPI

GraphQL

+3

Software Engineer with a focus on AI

Power Reply GmbH & Co. KG

Remote

NoSQL

Docker

PyTorch

FastAPI

+4

AI Developer*

PULS GmbH

GraphQL

Continuous Integration

AI Developer Conversational AI & Azure

APRIORI - business solutions AG

Machine Learning