Appearance
Model Sources
This repo supports two model paths:
- Hugging Face snapshots pulled into the local mere.run model store with
mere.run model pull - Explicit local paths passed to commands with
--model,--model-root, or equivalent command-specific options
The public repo no longer supports private model archives, R2 credentials, or a packaged central model host.
The canonical local model store is:
text
~/Library/Application Support/MereRun/modelsOverride that with MERERUN_MODELS_DIR or --models-root.
Canonical Managed Model IDs
mere.run model pull works for catalog entries that have a Hugging Face source:
| Category | Hugging Face pull IDs |
|---|---|
| Image | image-klein-nano, image-klein-base, image-klein-max, image-zimage-nano, image-zimage-base, image-zimage-max, image-hidream-o1, image-hidream-o1-dev |
| Text chat | text-chat-gemma4, text-chat-gemma4-nano, text-chat-gemma4-max, text-chat-q35, text-chat-q35-nano, text-agent-deepseek-v4-flash |
| Text code / agents | text-agent-qwen35-9b, text-code-qwen3 |
| Text embed | text-embed-qwen3-0.6b |
| Text anonymize | text-anonymize-privacy-filter |
| Speech TTS | speech-tts-qwen3-nano, speech-tts-qwen3-customvoice |
| Speech ASR | speech-asr-qwen3, speech-asr-parakeet |
| Vision | vision-ocr-lighton, vision-segment-sam31, vision-ground-falcon-perception |
| Music | music-acestep |
| Video | video-ltx-av |
Some legacy/local IDs remain in the catalog so existing installs and explicit local paths keep working:
text
image-klein-shared
text-chat-mebot
text-chat-psi-agentimage-klein-shared is an internal shared-component install shape, and the text-chat IDs listed here remain local-path-only until they have public Hugging Face sources.
text-agent-qwen35-9b is the low-memory setup-agent model. It uses the public Hugging Face source unsloth/Qwen3.5-9B-GGUF and selects Qwen3.5-9B-Q4_K_M.gguf.
text-agent-deepseek-v4-flash is the preferred managed setup-agent tier on 96 GB+ Apple Silicon Macs. Q35/Qwen setup agents are lower-memory or comparison alternatives, not upgrades from DeepSeek V4 Flash.
text-chat-gemma4 is also the default chat model in the CLI. You can install it explicitly with mere.run model pull text-chat-gemma4, or let supported runtime paths resolve a Hugging Face snapshot on first use unless you point them at a local model root.
Useful environment variables for that path:
MERERUN_HUB_CACHE: override the native Hugging Face snapshot cache path
vision-segment-sam31 packages the native SAM 3.1 segmentation and tracking runtime used by mere.run vision segment, mere.run vision track, and mere.run vision track-live. Managed or local SAM roots are expected to contain:
config.jsontokenizer/model.safetensorsormodel.safetensors.index.json
The manifest for this package advertises both vision_segmentation and vision_tracking capabilities.
image-hidream-o1 and image-hidream-o1-dev map to the public HiDream O1 image checkpoints:
HiDream-ai/HiDream-O1-ImageHiDream-ai/HiDream-O1-Image-Dev
HiDream O1 uses a unified pixel-transformer root layout rather than a VAE/text-encoder component tree. Managed or local roots are expected to contain:
config.jsontokenizer_config.jsontokenizer.jsonorvocab.jsonplusmerges.txtpreprocessor_config.jsonmodel.safetensorsormodel.safetensors.index.json
The native Swift runtime validates this layout, decodes the typed root configuration, prepares text/reference sample metadata, and runs generation through the downloaded Qwen3-VL decoder, vision tower, timestep embedder, patch embedder, generation-aware attention mask, and HiDream pixel head. Text-only generation, one-reference instruction editing, and multi-reference subject personalization share the same native path; reference modes additionally run Qwen3-VL vision preprocessing and replace chat-template image placeholders before denoising.
Runtime defaults come from the managed manifest:
image-hidream-o1-dev: 28 steps, CFG 0.0, fixed flash FlowMatch scheduleimage-hidream-o1: 50 steps, CFG 5.0, shifted Flow UniPC schedule
Both checkpoints are large BF16 unified-transformer roots, about 33 GiB on disk each before filesystem compression effects. Expect high unified-memory pressure and prefer one-step smokes before full-quality 28/50 step runs.
Hugging Face Cache
Hub snapshots use the shared cache location managed by the runtime. Override it when you want large models on an external disk:
bash
export MERERUN_HUB_CACHE=/Volumes/Models/huggingface
swift run mere.run model pull image-zimage-nanoModel pulls are resumable at the file level through the Hugging Face snapshot cache. The CLI writes a managed-model symlink from the mere.run model store to the prepared snapshot when needed.
Hardware Support Checks
Managed pulls are gated by the local capability catalog before any download. The check uses Apple Silicon macOS plus unified-memory thresholds for each model family, then blocks models that are unlikely to run reliably on the current Mac.
Inspect the local recommendation first:
bash
swift run mere.run model capabilities
swift run mere.run model capabilities --allIf you are intentionally testing an unsupported setup, pass --allow-unsupported to mere.run model pull.
Model Store Behavior
The CLI resolves models in this order:
--models-rootprocess overrideMERERUN_MODELS_DIR- persisted local model-store setting
- default
~/Library/Application Support/MereRun/models
Examples:
bash
# Pull into the default model store
swift run mere.run model pull image-zimage-nano
# Pull into a custom SSD-backed store
MERERUN_MODELS_DIR=/Volumes/Models swift run mere.run model pull text-chat-q35
# Inspect what is currently installed
swift run mere.run status
swift run mere.run model list
swift run mere.run model info image-klein-maxMusic And Video Layouts
Two retained surfaces have more structure than a flat model root.
music-acestep
The top-level model root is:
text
.../models/music-acestepThat root may contain:
music-acestep-v15-turbo/music-acestep-5hz-lm-1.7b/or another supported LM subdirectoryQwen3-Embedding-0.6B/vae/
mere.run music generate auto-discovers that layout unless you override the root with --checkpoints-root or MERERUN_MUSIC_ACESTEP_ROOT.
video-ltx-av
The unified AV model root is:
text
.../models/video-ltx-avmere.run video generate --variant unified-av can use that root directly or resolve it from MERERUN_VIDEO_LTX_MODEL_ROOT.