feat(constants)!: switch URLs to v0.9.0 layout + add MODEL_REGISTRY by msluszniak · Pull Request #1148 · software-mansion/react-native-executorch

msluszniak · 2026-05-13T14:31:10Z

Description

Refreshes every URL constant to the restructured HF layout under
resolve/v0.9.0 and adds the typed models accessor.

URL refresh

All URLs follow <model>_<size>_<backend>_<precision>.pte, files sit
under per-size and per-backend directories on HF.

modelUrls.ts — every URL rewritten; multi-backend URLs hoisted here so the registry stays declarative. The lfm2_5_350m_xnnpack_8w4da.pte typo is corrected to _8da4w.pte.
ocr/models.ts, tts/models.ts, tts/voices.ts — paths updated to the new shape.
versions.ts — VERSION_TAG → resolve/v0.9.0; PREVIOUS_VERSION_TAG = resolve/v0.8.0 retained for the @deprecated Llama QLoRA aliases.

`models` accessor

New constants/modelRegistry.ts exports models, a typed accessor grouped one-to-one with hooks:

Group	Hook
`llm`	`useLLM` (includes vision-capable LLMs like `lfm2_5_vl_*`)
`classification`	`useClassification`
`privacy_filter`	`usePrivacyFilter`
`object_detection`	`useObjectDetection`
`pose_estimation`	`usePoseEstimation`
`semantic_segmentation`	`useSemanticSegmentation`
`instance_segmentation`	`useInstanceSegmentation`
`style_transfer`	`useStyleTransfer`
`speech_to_text`	`useSpeechToText`
`text_to_speech`	`useTextToSpeech`
`text_embedding`	`useTextEmbeddings`
`image_embedding`	`useImageEmbeddings`
`image_generation`	`useTextToImage`
`vad`	`useVAD`
`ocr`	`useOCR` / `useVerticalOCR`

Each entry is a function — call it (optionally with { quant, backend }) to get the resolved config:

models.llm.llama3_2_3b()                       // default (quantized, platform-default backend)
models.llm.llama3_2_3b({ quant: false })       // base
models.llm.lfm2_5_vl_1_6b()                    // vision-capable LLM (same hook)
models.text_embedding.distiluse_base_multilingual_cased_v2({ backend: 'coreml' })
models.ocr({ language: 'en' })                 // OCR is parameterized by language

The backend parameter is typed to exactly the backends each model ships with — models.llm.llama3_2_3b({ backend: 'coreml' }) is a compile-time error (xnnpack-only).
Defaults to the quantized variant when { quant } is omitted.
text_to_speech exposes kokoro_small/kokoro_medium plus plain voice configs under voices.*.

ESLint's camelcase rule is relaxed to properties: 'never' so the snake_case property keys pass while bindings/functions stay camelCase.

Migration

// Before                                          // After
LLAMA3_2_1B_SPINQUANT                            → models.llm.llama3_2_1b()
LFM2_5_1_2B_INSTRUCT                             → models.llm.lfm2_5_1_2b_instruct({ quant: false })
LFM2_5_VL_1_6B_QUANTIZED                         → models.llm.lfm2_5_vl_1_6b()
EFFICIENTNET_V2_S                                → models.classification.efficientnet_v2_s()
PRIVACY_FILTER_OPENAI                            → models.privacy_filter.openai()
YOLO26N_POSE                                     → models.pose_estimation.yolo26n()
WHISPER_TINY_EN                                  → models.speech_to_text.whisper_tiny_en()
{ model: KOKORO_MEDIUM, voice: KOKORO_VOICE_AF_HEART }
                                                 → { model: models.text_to_speech.kokoro_medium(),
                                                     voice: models.text_to_speech.voices.af_heart }
OCR_ENGLISH                                      → models.ocr({ language: 'en' })
MODEL_REGISTRY.LLM.LLAMA3_2_3B                   → models.llm.llama3_2_3b()

Individual constant imports (LLAMA3_2_1B_SPINQUANT, KOKORO_MEDIUM, etc.) still work — the new accessor is the recommended path. The flat MODEL_REGISTRY = { ALL_MODELS: {...} } export from modelUrls.ts is removed; the internal getModelNameForUrl lookup is preserved.

Example apps + docs

All example apps migrated to models.*(). Heavily-used groups are destructured at the top of the file (const segmentation = models.semantic_segmentation;).
Picker entries compared by modelName to handle accessor-function values.
bare-rn LLM demo switched to LFM-2.5.
Every documentation code snippet that selected a model via a named constant is rewritten to use the typed models.<group>.<entry>() accessor across 03-hooks/**, 04-typescript-api/**, 01-fundamentals/**, etc. The webrtc-integration page intentionally keeps the named-constant style — the snippet reads cleaner alongside imports from other libraries.
Model Registry docs page rewritten for the new accessor; the 0.8.x version's anchor is repointed to its own version to survive the rename.

Deprecations

LLAMA3_2_3B_QLORA, LLAMA3_2_1B_QLORA — @deprecated; the .pte files stay at v0.8.0 and the constants still resolve those URLs. Use LLAMA3_2_*_SPINQUANT going forward.

Introduces a breaking change?

Yes
No

URL paths under ${VERSION_TAG} change — code that hardcoded resolve/v0.8.0 URLs through the constants keeps working only if it read them at runtime. The flat MODEL_REGISTRY export is removed in favour of the new models accessor.

Type of change

New feature (change which adds functionality)
Other (chores, tests, code style improvements etc.)

Tested on

iOS
Android

yarn typecheck and yarn lint clean across the monorepo. Every example app runs against the v0.9.0 HF state.

Testing instructions

yarn typecheck
yarn lint

In application code:

import { models } from 'react-native-executorch';

const llm = useLLM({ model: models.llm.llama3_2_3b() });
const emb = useTextEmbeddings({
  model: models.text_embedding.distiluse_base_multilingual_cased_v2({ backend: 'coreml' }),
});

Related issues

#431
#612

Checklist

I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have updated the documentation accordingly
My changes generate no new warnings

barhanc · 2026-05-14T12:07:34Z

There are some problems with HF repos:

https://huggingface.co/software-mansion/react-native-executorch-clip-vit-base-patch32/tree/v0.9.0/xnnpack doesn't have text encoder and the exported url points to the image encoder. The example app fails with {"code": 18, "message": "The model's forward function did not succeed. Ensure the model input is correct."}
https://huggingface.co/software-mansion/react-native-executorch-distiluse-base-multilingual-cased-v2/tree/v0.9.0 not all models conform to the naming convention (the fp32 variants use "-" instead of "_")
https://huggingface.co/software-mansion/react-native-executorch-paraphrase-multilingual-MiniLM-L12-v2/tree/v0.9.0 not all models conform to the naming convention (the fp32 variants use "-" instead of "_")
https://huggingface.co/software-mansion/react-native-executorch-deeplab-v3/tree/v0.9.0 doesn't have all the variants that v0.8.0 had
https://huggingface.co/software-mansion/react-native-executorch-fcn/tree/v0.9.0 doesn't have all the variants
https://huggingface.co/software-mansion/react-native-executorch-fast-sam/tree/v0.9.0 silently drops the xnnpack from fastsam-x/ and fastsam-s/ and adds xnnpack/ and coreml/ without the size distinction

https://huggingface.co/software-mansion/react-native-executorch-qwen-3.5/tree/v0.9.0 has both the 0_8b and Qwen3.5-0.8B (and the same for 2B) directories
https://huggingface.co/software-mansion/react-native-executorch-detector-craft/tree/v0.9.0/xnnpack has both craft_xnnpack_int8.pte and xnnpack_craft.pte files

That's what I managed to find, probably someone else should also have a look at the updated HF repos. I will be testing example apps now.

barhanc

~~Regarding HF repos, the DeepLab and FCN are not categorized by size (the same case as FastSAM before).~~ Other HF repos look fine :)

msluszniak · 2026-05-18T07:19:57Z

Regarding HF repos, the DeepLab and FCN are not categorized by size (the same case as FastSAM before). Other HF repos look fine :)

@barhanc I don't think these are sizes per se in this case. You have specified only the backbone of the final model. These models might be smaller or bigger but semantically do not indicate sizes immediately (in comparison where models from the same family have different number of parameters and the name derives from this number of is explicitly named s,m,l, xl etc.).

modelRegistry.ts duplicated the same `${URL_PREFIX}-…/${VERSION_TAG}/…` strings that modelUrls.ts already had inline in each Platform.OS branch. Hoist a single set of per-backend URL constants into modelUrls.ts and have both consumers reference them, so each URL string lives in exactly one place. - Add per-backend exports for efficientnet-v2-s, ssdlite320-mobilenet-v3- large, rfdetr-nano-detector, rfdetr-nano-segmentation, fast-sam {s,x}, distiluse-base-multilingual-cased-v2. - Add `styleTransferUrls(display, slug)` helper for the 4 style-transfer styles; the registry's `styleTransferVariants` now consumes it. - Drop the now-unused `URL_PREFIX, VERSION_TAG` import from modelRegistry.ts. Addresses #1148 (comment)

…al/tts/ocr Adopts Bartek's feedback on #1148 — the accessor is no longer dual-shaped (value AND function). Each leaf is a pure function: call it (optionally with \`{ quant, backend }\`) to get the resolved config. This eliminates the \`useState\` lazy-init footgun and \`useMemo\`/\`useCallback\` dep hazards, so pickers fall back to plain \`===\` reference equality (drops the \`sameValue\` workaround across four \`ModelPicker.tsx\` files). Renames: - \`MODEL_REGISTRY\` → \`models\` (lowercase top-level) - group keys lowercased: \`LLM\` → \`llm\`, etc. - per Kuba: \`vlm\` → \`multimodal\` (anticipates audio-capable LMs like Gemma 4) Adds: - \`models.text_to_speech\` group: \`kokoro_small\`, \`kokoro_medium\`, plus voices as plain configs under \`voices\` (no quant/backend axis). - \`models.ocr({ language })\` parameterized accessor — covers all ISO language tokens via a runtime map built from the existing \`OCR_<LANGUAGE>\` exports. Example apps (22 files, ~150 substitutions) migrated by script. bare-rn demo swapped from \`llama3_2_1b\` to \`lfm2_5_1_2b_instruct\` per Kuba's note. Docs rewritten with the new syntax + TTS + OCR sections. Relaxes the project's \`camelcase\` rule with \`properties: 'never'\` so the lowercase snake_case keys in \`models\` (which mirror the \`.pte\` filename convention) pass without per-file disables. Variable and function names still require camelCase.

Per Kuba's review on #1148 — hoist a camelCase alias for any group used ≥ 2 times in a file, e.g. const instanceSegmentation = models.instance_segmentation; const objectDetection = models.object_detection; Then \`models.instance_segmentation.yolo26n_seg()\` becomes \`instanceSegmentation.yolo26n_seg()\`. Applied to 14 files where it actually reduces noise. Skips aliasing when the camelCase name would shadow an existing local identifier — common in the LLM/STT/embeddings screens where \`llm\`, \`speechToText\`, \`imageEmbedding\` etc. already name hook return values or temporaries.

barhanc

In docs we have many snippets that use the old API for selecting the model, these should probably be changed as well.

msluszniak · 2026-05-19T11:37:39Z

The fact that currently models from both llm and lmm are used by useLLM and analogical for modules seems to be a bit off, what do you think, @barhanc @chmjkb?

barhanc · 2026-05-19T12:17:10Z

The fact that currently models from both llm and lmm are used by useLLM and analogical for modules seems to be a bit off, what do you think, @barhanc @chmjkb

We can just put all these models under llm and leave the hooks and modules unchanged, as the multimodal models are still language models at their core just with added capabilities for other modalities (in this option as a user I would probably want some easy way to see what modalities a given model supports). The other option would be to add useLMM hook and the rest of API mirroring the LLM, but this would lead to a lot of duplicated code, so imo the first option is better.

msluszniak · 2026-05-19T12:20:21Z

Agreed, I'm also in favour of moving them under llm. Also regarding this one:

in this option as a user I would probably want some easy way to see what modalities a given model supports

Do you have anything particular solution on your mind?

barhanc · 2026-05-19T12:27:16Z

I guess we already have something like this in place, since the user can check it like this

const LFM2_5_VL = models.llm.lfm2_5_vl_1_6b()
console.log(LFM2_5_VL.capabilities)

msluszniak · 2026-05-19T14:56:12Z

@barhanc @chmjkb I will rebase this PR once change to TTS will land on main. But you can review it and then only the rebase part.

barhanc

LGTM 🚀

Conflict-resolution slip flipped "need to" -> "need ot" in the LLMController delete-while-generating error. The voice_chat screen was deleted in #1132 but the drawer entry was left behind.

The two-step install layout from #1146 (separate core / resource-fetcher sections with per-package-manager tabs) was lost during the rebase. Restore main's version verbatim.

Keep the registry-accessor useLLM example but bring back the npm / pnpm / yarn Tabs around the resource-fetcher install commands that were lost during the rebase.

The trailing block linking to the typedoc-generated selectByPoint / selectByBox / selectByText pages was dropped during the rebase.

The reference section's explanation that `code` is typed `RnExecutorchErrorCode | number` (and the guidance to include a `default` branch when switching on it) was lost during the rebase, along with two row wordings in the input/runtime error tables.

Rebase artifact — this file shouldn't have been touched by the PR at all.

The previous link pointed at /docs/next/api-reference/variables/MODEL_REGISTRY, which now 404s because this PR removes the MODEL_REGISTRY export from main. Switch to the 0.8.x snapshot's own api-reference page using the canonical .md-suffixed relative form so docusaurus resolves it through the source-file URL map.

DownloadInterrupted, ModuleNotLoaded and ModelGenerating already resolve to the exact same strings through DefaultErrorMessages in errorUtils.ts, so the per-call-site duplicates were dead weight that #1141 was meant to remove.

…on accessors The instance_segmentation namespace already implies segmentation, so the _seg suffix on yolo26* and rf_detr_nano was redundant and inconsistent with pose_estimation.yolo26n (no _pose suffix). Drop the suffix and update demo apps and docs accordingly.

…entation FastSAM is consumed via useInstanceSegmentation, so its accessor belongs in models.instance_segmentation. Move fastsam_s/fastsam_x and update demo apps and docs accordingly.

The craft() OCR accessor was throwing a plain Error when called with an unpublished language; use RnExecutorchError with LanguageNotSupported so consumers can switch on code like every other error path.

The three sanity checks in firstBackend/resolveCell/resolveVariant were the last plain Error throws in modelRegistry. The first two are truly-internal invariants -> Internal; the backend-missing one is reachable from untyped callers -> InvalidConfig.

## Description Tightens the LFM2.5 quickstart in the root README and all five translated readmes: - Flags the quickstart as Expo-targeted and links bare React Native users to the [Getting Started guide](https://docs.swmansion.com/react-native-executorch/docs/fundamentals/getting-started) instead of duplicating the bare install steps inline. - Drops the `react-native-executorch-bare-resource-fetcher` / `@dr.pogodin/react-native-fs` / `@kesha-antonov/react-native-background-downloader` lines from the install snippet now that the bare path lives in the docs. - Switches the sample to the model-registry accessor (`models.llm.lfm2_5_1_2b_instruct()`), matching the apps and the new MODEL_REGISTRY flow added in #1148. - Normalizes `yarn <ios|android>` placeholders — no spaces inside the angle brackets. ### Introduces a breaking change? - [ ] Yes - [x] No ### Type of change - [ ] Bug fix (change which fixes an issue) - [ ] New feature (change which adds functionality) - [x] Documentation update (improves or adds clarity to existing documentation) - [ ] Other (chores, tests, code style improvements etc.) ### Tested on - [ ] iOS - [ ] Android ### Testing instructions Docs-only change. Render the affected readmes on GitHub and confirm: - Quickstart heading is followed by the Expo / bare-RN doc-link sentence. - Install block lists only the Expo fetcher trio. - Code sample imports `models` and calls `models.llm.lfm2_5_1_2b_instruct()`. - No `< ios | android >` (with spaces) remains. ### Screenshots N/A ### Related issues N/A ### Checklist - [x] I have performed a self-review of my code - [x] I have commented my code, particularly in hard-to-understand areas - [x] I have updated the documentation accordingly - [x] My changes generate no new warnings ### Additional notes The translated readmes carry through the same edits with the disclaimer + doc link translated into each language; the docs URL is shared across all of them.

msluszniak assigned msluszniak and unassigned msluszniak May 13, 2026

msluszniak added feature PRs that implement a new feature labels May 13, 2026

msluszniak marked this pull request as ready for review May 13, 2026 15:16

barhanc self-requested a review May 13, 2026 15:22

msluszniak requested review from NorbertKlockiewicz and chmjkb May 14, 2026 09:14

msluszniak force-pushed the @ms/model-registry branch from 667d6b3 to fc5eeb0 Compare May 14, 2026 09:36

barhanc reviewed May 14, 2026

View reviewed changes

Comment thread apps/llm/components/ModelPicker.tsx Outdated

Comment thread packages/react-native-executorch/src/constants/modelRegistry.ts Outdated

msluszniak requested a review from mkopcins May 14, 2026 11:48

barhanc reviewed May 14, 2026

View reviewed changes

Comment thread packages/react-native-executorch/src/constants/modelRegistry.ts Outdated

msluszniak linked an issue May 18, 2026 that may be closed by this pull request

Model constants grouped by type #612

Closed

chmjkb requested changes May 18, 2026

View reviewed changes

barhanc reviewed May 18, 2026

View reviewed changes

Comment thread packages/react-native-executorch/src/constants/modelRegistry.ts Outdated

msluszniak requested review from barhanc and chmjkb May 19, 2026 14:51

mkopcins reviewed May 19, 2026

View reviewed changes

Comment thread apps/llm/app/llm/index.tsx Outdated

barhanc approved these changes May 19, 2026

View reviewed changes

msluszniak force-pushed the @ms/model-registry branch from c87ba4a to 6950512 Compare May 21, 2026 09:20

feat!: typed models accessor + v0.9.0 model layout

5e10f9a

msluszniak force-pushed the @ms/model-registry branch from 6950512 to 5e10f9a Compare May 21, 2026 09:38

barhanc reviewed May 21, 2026

View reviewed changes

Comment thread packages/react-native-executorch/src/constants/modelUrls.ts Outdated

Comment thread packages/react-native-executorch/src/constants/modelUrls.ts Outdated

Comment thread packages/react-native-executorch/src/constants/modelUrls.ts Outdated

fix(urls): make legacy whisper exports prefer coreml on iOS

7336f58

chmjkb requested changes May 21, 2026

View reviewed changes

Comment thread docs/docs/05-utilities/model-registry.md Outdated

docs(model-registry): fix migration section to reflect actual prior API

ade50c4

msluszniak requested a review from chmjkb May 21, 2026 10:31

msluszniak added 2 commits May 21, 2026 12:54

fix: restore typo and drop stale voice_chat drawer entry

6752228

Conflict-resolution slip flipped "need to" -> "need ot" in the LLMController delete-while-generating error. The voice_chat screen was deleted in #1132 but the drawer entry was left behind.

docs: restore getting-started install section from main

1769f4e

The two-step install layout from #1146 (separate core / resource-fetcher sections with per-package-manager tabs) was lost during the rebase. Restore main's version verbatim.

chmjkb requested changes May 21, 2026

View reviewed changes

msluszniak added 7 commits May 21, 2026 13:12

docs: restore loading-models package-manager tabs from main

a917564

Keep the registry-accessor useLLM example but bring back the npm / pnpm / yarn Tabs around the resource-fetcher install commands that were lost during the rebase.

docs: restore selectBy* API reference links in useInstanceSegmentation

fe8aa41

The trailing block linking to the typedoc-generated selectByPoint / selectByBox / selectByText pages was dropped during the rebase.

docs: restore stray blank line in custom-adapter

9cc5749

Rebase artifact — this file shouldn't have been touched by the PR at all.

fix(llm-controller): drop redundant message args on throws

c32623c

DownloadInterrupted, ModuleNotLoaded and ModelGenerating already resolve to the exact same strings through DefaultErrorMessages in errorUtils.ts, so the per-call-site duplicates were dead weight that #1141 was meant to remove.

msluszniak requested a review from chmjkb May 21, 2026 12:13

barhanc reviewed May 21, 2026

View reviewed changes

Comment thread docs/docs/03-hooks/02-computer-vision/useInstanceSegmentation.md Outdated

refactor(models): move FastSAM from object_detection to instance_segm…

4d9ce61

…entation FastSAM is consumed via useInstanceSegmentation, so its accessor belongs in models.instance_segmentation. Move fastsam_s/fastsam_x and update demo apps and docs accordingly.

chmjkb reviewed May 21, 2026

View reviewed changes

Comment thread packages/react-native-executorch/src/constants/modelRegistry.ts

Comment thread packages/react-native-executorch/src/constants/modelRegistry.ts Outdated

fix(models): throw RnExecutorchError on unsupported OCR language

f63b5d1

The craft() OCR accessor was throwing a plain Error when called with an unpublished language; use RnExecutorchError with LanguageNotSupported so consumers can switch on code like every other error path.

msluszniak requested a review from chmjkb May 21, 2026 13:33

chmjkb approved these changes May 21, 2026

View reviewed changes

msluszniak merged commit 3314cf5 into main May 21, 2026
5 checks passed

msluszniak deleted the @ms/model-registry branch May 21, 2026 13:58

msluszniak mentioned this pull request May 22, 2026

docs(readme): streamline LFM2.5 quickstart for Expo #1171

Merged

12 tasks

Conversation

msluszniak commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

URL refresh

models accessor

Migration

Example apps + docs

Deprecations

Introduces a breaking change?

Type of change

Tested on

Testing instructions

Related issues

Checklist

Uh oh!

Uh oh!

Uh oh!

barhanc commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

barhanc left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

msluszniak commented May 18, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

barhanc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

msluszniak commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

barhanc commented May 19, 2026

Uh oh!

msluszniak commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

barhanc commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

msluszniak commented May 19, 2026

Uh oh!

Uh oh!

barhanc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

msluszniak commented May 13, 2026 •

edited

Loading

`models` accessor

barhanc commented May 14, 2026 •

edited

Loading

barhanc left a comment •

edited

Loading

msluszniak commented May 19, 2026 •

edited

Loading

msluszniak commented May 19, 2026 •

edited

Loading

barhanc commented May 19, 2026 •

edited

Loading