Model Sources

This repo supports two model paths:

Hugging Face snapshots pulled into the local mere.run model store with mere.run model pull
Explicit local paths passed to commands with --model, --model-root, or equivalent command-specific options

The public repo no longer supports private model archives, R2 credentials, or a packaged central model host.

The canonical local model store is:

text

~/Library/Application Support/MereRun/models

Override that with MERERUN_MODELS_DIR or --models-root.

Canonical Managed Model IDs

mere.run model pull works for catalog entries that have a Hugging Face source:

Category	Hugging Face pull IDs
Image	`image-klein-nano`, `image-klein-base`, `image-klein-max`, `image-zimage-nano`, `image-zimage-base`, `image-zimage-max`, `image-hidream-o1`, `image-hidream-o1-dev`
Text chat	`text-chat-gemma4`, `text-chat-gemma4-nano`, `text-chat-gemma4-max`, `text-chat-q35`, `text-chat-q35-nano`, `text-agent-deepseek-v4-flash`
Text code / agents	`text-agent-qwen35-9b`, `text-code-qwen3`
Text embed	`text-embed-qwen3-0.6b`
Text anonymize	`text-anonymize-privacy-filter`
Speech TTS	`speech-tts-qwen3-nano`, `speech-tts-qwen3-customvoice`
Speech ASR	`speech-asr-qwen3`, `speech-asr-parakeet`
Vision	`vision-ocr-lighton`, `vision-segment-sam31`, `vision-ground-falcon-perception`
Music	`music-acestep`
Video	`video-ltx-av`

Some legacy/local IDs remain in the catalog so existing installs and explicit local paths keep working:

text

image-klein-shared
text-chat-mebot
text-chat-psi-agent

image-klein-shared is an internal shared-component install shape, and the text-chat IDs listed here remain local-path-only until they have public Hugging Face sources.

text-agent-qwen35-9b is the low-memory setup-agent model. It uses the public Hugging Face source unsloth/Qwen3.5-9B-GGUF and selects Qwen3.5-9B-Q4_K_M.gguf.

text-agent-deepseek-v4-flash is the preferred managed setup-agent tier on 96 GB+ Apple Silicon Macs. Q35/Qwen setup agents are lower-memory or comparison alternatives, not upgrades from DeepSeek V4 Flash.

text-chat-gemma4 is also the default chat model in the CLI. You can install it explicitly with mere.run model pull text-chat-gemma4, or let supported runtime paths resolve a Hugging Face snapshot on first use unless you point them at a local model root.

Useful environment variables for that path:

MERERUN_HUB_CACHE: override the native Hugging Face snapshot cache path

vision-segment-sam31 packages the native SAM 3.1 segmentation and tracking runtime used by mere.run vision segment, mere.run vision track, and mere.run vision track-live. Managed or local SAM roots are expected to contain:

config.json
tokenizer/
model.safetensors or model.safetensors.index.json

The manifest for this package advertises both vision_segmentation and vision_tracking capabilities.

image-hidream-o1 and image-hidream-o1-dev map to the public HiDream O1 image checkpoints:

HiDream-ai/HiDream-O1-Image
HiDream-ai/HiDream-O1-Image-Dev

HiDream O1 uses a unified pixel-transformer root layout rather than a VAE/text-encoder component tree. Managed or local roots are expected to contain:

config.json
tokenizer_config.json
tokenizer.json or vocab.json plus merges.txt
preprocessor_config.json
model.safetensors or model.safetensors.index.json

The native Swift runtime validates this layout, decodes the typed root configuration, prepares text/reference sample metadata, and runs generation through the downloaded Qwen3-VL decoder, vision tower, timestep embedder, patch embedder, generation-aware attention mask, and HiDream pixel head. Text-only generation, one-reference instruction editing, and multi-reference subject personalization share the same native path; reference modes additionally run Qwen3-VL vision preprocessing and replace chat-template image placeholders before denoising.

Runtime defaults come from the managed manifest:

image-hidream-o1-dev: 28 steps, CFG 0.0, fixed flash FlowMatch schedule
image-hidream-o1: 50 steps, CFG 5.0, shifted Flow UniPC schedule

Both checkpoints are large BF16 unified-transformer roots, about 33 GiB on disk each before filesystem compression effects. Expect high unified-memory pressure and prefer one-step smokes before full-quality 28/50 step runs.

Hugging Face Cache

Hub snapshots use the shared cache location managed by the runtime. Override it when you want large models on an external disk:

bash

export MERERUN_HUB_CACHE=/Volumes/Models/huggingface
swift run mere.run model pull image-zimage-nano

Model pulls are resumable at the file level through the Hugging Face snapshot cache. The CLI writes a managed-model symlink from the mere.run model store to the prepared snapshot when needed.

Hardware Support Checks

Managed pulls are gated by the local capability catalog before any download. The check uses Apple Silicon macOS plus unified-memory thresholds for each model family, then blocks models that are unlikely to run reliably on the current Mac.

Inspect the local recommendation first:

bash

swift run mere.run model capabilities
swift run mere.run model capabilities --all

If you are intentionally testing an unsupported setup, pass --allow-unsupported to mere.run model pull.

Model Store Behavior

The CLI resolves models in this order:

--models-root process override
MERERUN_MODELS_DIR
persisted local model-store setting
default ~/Library/Application Support/MereRun/models

Examples:

bash

# Pull into the default model store
swift run mere.run model pull image-zimage-nano

# Pull into a custom SSD-backed store
MERERUN_MODELS_DIR=/Volumes/Models swift run mere.run model pull text-chat-q35

# Inspect what is currently installed
swift run mere.run status
swift run mere.run model list
swift run mere.run model info image-klein-max

Music And Video Layouts

Two retained surfaces have more structure than a flat model root.

`music-acestep`

The top-level model root is:

text

.../models/music-acestep

That root may contain:

music-acestep-v15-turbo/
music-acestep-5hz-lm-1.7b/ or another supported LM subdirectory
Qwen3-Embedding-0.6B/
vae/

mere.run music generate auto-discovers that layout unless you override the root with --checkpoints-root or MERERUN_MUSIC_ACESTEP_ROOT.

`video-ltx-av`

The unified AV model root is:

text

.../models/video-ltx-av

mere.run video generate --variant unified-av can use that root directly or resolve it from MERERUN_VIDEO_LTX_MODEL_ROOT.

Model Sources ​

Canonical Managed Model IDs ​

Hugging Face Cache ​

Hardware Support Checks ​

Model Store Behavior ​

Music And Video Layouts ​

music-acestep ​

video-ltx-av ​

Model Sources

Canonical Managed Model IDs

Hugging Face Cache

Hardware Support Checks

Model Store Behavior

Music And Video Layouts

`music-acestep`

`video-ltx-av`