Appearance
Architecture Reading Map
This repo is easiest to understand by starting at the CLI surface and then following one runtime family at a time. The goal of this map is not to explain every model detail, but to give contributors a reliable reading order.
For the broader documentation set, start at mere.run Documentation.
Start here
- CLI entrypoint:
Sources/MereRunCLI/MereRunCLI.swift - Shared CLI helpers:
Sources/MereRunCLI/Support - Model paths and manifests:
Sources/MereRunCore/MereRunModelPaths.swiftSources/MereRunCore/MereRunModelManifest.swiftSources/MereRunCore/ModelResolver.swift
If you want to understand what a command does end to end, start at the command file, then jump to the family entrypoint listed below.
Image families
Klein image generation:
- CLI:
Sources/MereRunCLI/Commands/ImageGenerateCommand.swift - Runtime entrypoint:
Sources/MereRunCore/Flux2Klein/Flux2KleinGenerator.swift - Read next:
Sources/MereRunCore/Flux2Klein/Flux2KleinGenerator+ModelLoading.swiftSources/MereRunCore/Flux2Klein/Flux2KleinGenerator+Generation.swiftSources/MereRunCore/Flux2Klein/Flux2KleinGenerator+Chat.swift
ZImage generation:
- Runtime entrypoint:
Sources/MereRunCore/ZImageTurbo/ZImageTurboGenerator.swift - Read next:
Sources/MereRunCore/ZImageTurbo/ZImageTurboGenerator+ModelLoading.swiftSources/MereRunCore/ZImageTurbo/ZImageTurboGenerator+Inference.swiftSources/MereRunCore/ZImageTurbo/ZImageTurboGenerator+LoRA.swift
Qwen image editing:
- Runtime entrypoint:
Sources/MereRunCore/QwenImageEdit/QwenImageEditGenerator.swift - Read next:
Sources/MereRunCore/QwenImageEdit/QwenImageEditGenerator+ModelLoading.swiftSources/MereRunCore/QwenImageEdit/QwenImageEditGenerator+Encoding.swift
Shared text encoder stack used by image models:
- Public entrypoint:
Sources/MereRunCore/ZImageTurbo/Model/TextEncoder/TextEncoder.swift - Architecture internals:
Sources/MereRunCore/ZImageTurbo/Model/TextEncoder/TextEncoder+RoPE.swiftSources/MereRunCore/ZImageTurbo/Model/TextEncoder/TextEncoder+Blocks.swift
Speech stack
Speech synthesis command path:
- CLI:
Sources/MereRunCLI/Commands/SpeechSynthesizeCommand.swift - Runtime entrypoint:
Sources/AudioTTS/Qwen3TTS/Qwen3TTSGenerator.swift - Read next:
Sources/AudioTTS/Qwen3TTS/Qwen3TTSGenerator+Loading.swiftSources/AudioTTS/Qwen3TTS/Qwen3TTSGenerator+Generation.swiftSources/AudioTTS/Qwen3TTS/Qwen3TTSGenerator+Support.swift
Speech tokenizer internals:
- Public tokenizer surface:
Sources/AudioTTS/Qwen3TTS/Qwen3TTSSpeechTokenizer.swift - Decoder stack:
Sources/AudioTTS/Qwen3TTS/Qwen3TTSSpeechTokenizer+Decoder.swift - Encoder stack:
Sources/AudioTTS/Qwen3TTS/Qwen3TTSSpeechTokenizer+Encoder.swift
Speech transcription:
- CLI:
Sources/MereRunCLI/Commands/SpeechTranscribeCommand.swift - Runtime roots:
Sources/AudioSTT/Qwen3ASR/Qwen3ASRGenerator.swiftSources/AudioSTT/Parakeet/ParakeetGenerator.swift
OCR and vision
OCR:
- CLI:
Sources/MereRunCLI/Commands/VisionOCRCommand.swift - Runtime entrypoint:
Sources/MereRunCore/LightOnOCR/LightOnOCRGenerator.swift - Read next:
Sources/MereRunCore/LightOnOCR/LightOnOCRGenerator+Loading.swiftSources/MereRunCore/LightOnOCR/LightOnOCRGenerator+Inference.swiftSources/MereRunCore/LightOnOCR/LightOnOCRSupport.swift
Captioning and inspect flows:
- CLI:
Sources/MereRunCLI/Commands/VisionCaptionCommand.swiftSources/MereRunCLI/Commands/VisionInspectCommand.swiftSources/MereRunCLI/Commands/VisionSegmentCommand.swift
- Runtime roots:
Sources/MereRunCore/VLM/QwenVLCaptioner.swiftSources/MereRunCore/VLM/Qwen3VLAutoCaptioner.swiftSources/MereRunCore/VLM/QwenVLEncoder.swiftSources/MereRunCore/ZImageTurbo/Model/TextEncoder/Vision/QwenVisionAttention.swift
Segmentation and tracking runtime:
- CLI:
Sources/MereRunCLI/Commands/VisionSegmentCommand.swiftSources/MereRunCLI/Commands/VisionTrackCommand.swiftSources/MereRunCLI/Commands/VisionTrackLiveCommand.swift
- Native runtime:
Sources/MereRunCore/SAM3/SAM31Config.swiftSources/MereRunCore/SAM3/SAM31Resources.swiftSources/MereRunCore/SAM3/SAM31Tokenizer.swiftSources/MereRunCore/SAM3/SAM31Model.swiftSources/MereRunCore/SAM3/SAM31InteractiveSAM.swiftSources/MereRunCore/SAM3/SAM31Prompts.swiftSources/MereRunCore/SAM3/SAM31ImageSegmenter.swiftSources/MereRunCore/SAM3/SAM31VideoIO.swiftSources/MereRunCore/SAM3/SAM31VideoTracker.swiftSources/MereRunCore/SAM3/SAM31CameraCapture.swift
Music and video
Music generation:
- CLI:
Sources/MereRunCLI/Commands/MusicGenerateCommand.swift - Runtime entrypoint:
Sources/MereRunCore/ACEStep/ACEStepPipeline.swift - Read next:
Sources/MereRunCore/ACEStep/ACEStepPipeline+Prompting.swiftSources/MereRunCore/ACEStep/ACEStepPipeline+Generation.swift
Video generation:
- CLI:
Sources/MereRunCLI/Commands/VideoCommand.swift(VideoGenerateandVideoExportLatents)
- Runtime root:
Sources/MereRunCore/LTX/LTXDistilledLatentGenerator.swift - Read next:
Sources/MereRunCore/LTX/LTXGemmaTextEncoder.swiftSources/MereRunCore/LTX/LTXVideoMP4Writer.swift
The LTX file is still large because it also contains low-level model definitions. Read it in this order:
- public generation types and
LTXDistilledLatentGenerator - diffusion helpers such as
denoiseLoop, latent conditioning, and image loading - decoder and encoder support modules
- unified AV generator types near the end
Validation and diagnostics
The advanced image validator is intentionally separate from generation:
- CLI root:
Sources/MereRunCLI/Commands/ImageValidateCommand.swift - Read next:
Sources/MereRunCLI/Commands/ImageValidateCommand+VAE.swiftSources/MereRunCLI/Commands/ImageValidateCommand+EncoderTransformer.swiftSources/MereRunCLI/Commands/ImageValidateCommand+Pipeline.swift
Suggested reading order for new contributors
- Start with
Sources/MereRunCLI/MereRunCLI.swift. - Pick one modality and read the matching command file.
- Jump to that modality’s runtime entrypoint.
- Follow companion files in order: loading or preparation, then generation, then output.
- Only then drop into the larger model-definition files if you need implementation detail.