Consulting / Foundation Models & Apple Intelligence

Foundation Models & Apple Intelligence

Apple's Foundation Models framework gives your app an on-device LLM, Private Cloud Compute for the heavier prompts, image input, and custom tools the model can call. It also leaves out every device that cannot run the on-device model, which on iOS 27 is a large share of your users. I integrate the framework and build the fallback for the users it ignores, because that fallback is the part that decides whether the feature ships.

  • Foundation Models integration: on-device generation, Private Cloud Compute for bigger prompts, image input, and custom tool-calling
  • the device-tier fallback for the majority of iOS 27 devices that cannot run Apple's on-device model
  • structured output with @Generable so the model returns typed data your app can use, instead of a string you have to parse and pray over
Recognition
App Store Best New Apps 2026 Product Hunt Product of the Day 2025 CES Best of Innovation 2021 CES Innovation Award 2021 Webby Honoree 2021 Google Material Design 2020
Credentials
Member of British Computer Society 2024 BEng (Hons) 2017 Apple WWDC Scholarship 2015

Tell me what you're working on. I reply within 48 hours.


"Vadim was instrumental to the success Epsy enjoyed on iOS, taking it from an idea on a Miro board to the highest rated and most downloaded app of its kind on the store."

James C. · Mobile Engineering Lead, Epsy

"We had a strict deadline, and Vadim managed to complete the job in time. He gave us meaningful feedback and suggested better approaches, not trying to blindly stick to our specification."

Founder · Pre-seed streaming service

"I can say with confidence that it will be difficult to find a better developer. Vadim is achievement-oriented, highly organized, with very good communication skills."

Alex Z. · Co-Founder, eda.so




Advisory
£110
per hour

Architecture reviews, hiring help, second opinions on that thing that's been bugging you.

Retainer
£4,000
per month

Priority support: review agency code, join architecture calls, catch problems before they ship.


Is Apple Intelligence free? Does this kill our OpenAI bill?

The on-device model is free to call and runs without a network. The server side, Private Cloud Compute, does not have published developer pricing yet, so nobody can tell you it zeroes your bill. What is safe to say today: prompts that fit the on-device model stop costing you per-token money on supported devices, and you get a privacy and latency win. For the prompts that need the big model, the honest move is to wait for Apple's pricing before you rip out your current vendor, and that judgment is part of what I bring.

What share of our users can actually run this?

Fewer than you would like. iOS 27 installs back to the iPhone 11, but Apple's on-device model needs much newer silicon, so a large part of your base is on a device that cannot run it. That gap is exactly why the fallback is the real work. If you only build the happy path, you have shipped a feature a chunk of your paying users will never see, and you will hear about it in the reviews.

Can the model read images now, or is it text only?

It takes images as of this year's release, which opens up features that look at a photo and reason about it: read a label, check a garment, parse a page of a cookbook. The same availability rules apply, so the image path needs its own fallback. And do not assume the image generation side is fully open to developers yet; the input side is the part you can build on with confidence today, and knowing that boundary stops you designing around a capability that may not ship to you.

We have our own model. Should we use Foundation Models instead?

Maybe for part of it. Foundation Models is a fixed Apple model: you do not control its weights, its updates, or its exact behaviour, and it will change under you with OS releases. If your value is a model you trained, keep it and run it through Core ML. If you are calling a general LLM for summaries, rewriting, or extraction, the Apple model probably covers it on-device for free. Splitting the feature along that line, rather than going all-in on either, is the decision I help you get right.

How quickly can you start?

Advisory calls can happen within days. For project work, I typically need 1-2 weeks notice to clear the calendar, though I keep some buffer for urgent firefighting. Check the availability badges above for current openings.

Do you work with early-stage startups?

Yes, from pre-seed to Series C and beyond. For very early teams, the advisory tier often makes more sense than project work: you get architecture guidance without committing to a large engagement before you've validated the product.

What's included in the day rate?

Everything: code, architecture decisions, code review, documentation, async Slack availability during working hours. No surprise add-ons. I bill for time spent working on your project, not for "thinking about it in the shower."

We're in a different timezone. Will that slow things down?

I'm currently in Vancouver (PST), with full overlap for North American teams. For UK and Europe, I'm online by their afternoon. For Gulf or APAC, we'd agree on overlap hours and handle the rest async. I've worked with teams from San Francisco to Dubai.


Foundation Models frameworkLanguageModelSession, prompt design, and the availability discipline that keeps the feature from crashing on the hardware it was never going to run on.
On-device vs Private Cloud Computewhich prompts run locally for privacy and speed, and which go to PCC for capability. That split has data-handling consequences most teams notice too late.
Structured output (@Generable)typed results from the model so your code gets a struct back instead of a string to wrestle, with the parsing and validation already handled.
Image inputthe multimodal path, for features that read a photo (a garment, a label, a page) and reason about it. The fallback question applies here too, and it is easy to forget.
Custom toolsgiving the model functions it can call into your app, and the guardrails that stop it doing something the user never asked for, which is where this gets dangerous if rushed.
Device-tier fallbackthe on-device, server, or degraded path for the large share of iOS 27 devices that cannot run Apple's model at all. This is the billable work hiding behind the demo.
Cost & privacy trade-offwhether moving off your current LLM vendor saves money or only buys you privacy and integration. Apple has not published server pricing yet, so anyone promising you a cost cut is guessing, and I would rather tell you that than sell you a number.

Where I've worked CV LinkedIn

Drobinin Limited Founder · 2025 - present 12+ apps from idea to App Store. Featured by Apple in EMEA & Americas.
LivaNova (NASDAQ: LIVN) Senior iOS · 2020-2025 Epsy, an epilepsy management app. Shipped inside an FDA-regulated medical-device company. HIPAA, CES Innovation Award.
Sphere (acquired by Twitter/X) Senior iOS · 2017-2020 Early Employee. $30M funding to acquisition.
VK.com iOS Consultant · 2016-2017 Authored & delivered an onsite course on iOS development.
ToBox Lead iOS · 2015-2016 Built team, MVVM architecture, full Swift rewrite.

Integrating Foundation Models?

Tell me what you're working on. I reply within 48 hours.

work@drobinin.com