Understand memory requirements for Mixture-of-Experts (MoE) models versus standard models
SelfHostLLM distinguishes between total parameters and active parameters for MoE models (Mixtral, DeepSeek V3, Qwen3 MoE, GLM-4.7), showing users they only need RAM for active experts, not the full model size, enabling more efficient deployments.
