Why it matters
AI Engineer session on Running LLMs on your iPhone: 40 tok/s Gemma 4 with MLX, presented by Adrien Grondin, Locally AI. It adds practical context for how teams are building and operating AI systems in production.
My takeaway: Running LLMs on your iPhone: 40 tok/s Gemma 4 with MLX — Adrien Grondin, Locally AI is an enterprise-adoption signal. The practical read is to watch how deployment scale, data boundaries, operational ownership, and platform controls change as AI moves out of experiments.