vllm.model_executor.layers.fused_moe.experts ¶

Modules:

Name	Description
`aiter_mxfp4_w4a8_moe`
`batched_deep_gemm_moe`
`cpu_moe`	CPU FP8 W8A16 and MXFP4 W4A16 fused MoE experts.
`cutlass_moe`	CUTLASS based Fused MoE kernels.
`deep_gemm_moe`
`fallback`
`flashinfer_cutedsl_batched_moe`
`flashinfer_cutedsl_moe`
`flashinfer_cutlass_moe`
`fused_batched_moe`	Fused batched MoE kernel.
`fused_humming_moe`	Fused MoE utilities for Humming.
`gpt_oss_triton_kernels_moe`
`lora_context`
`lora_experts_mixin`
`marlin_moe`	Fused MoE utilities for GPTQ.
`nvfp4_emulation_moe`	NVFP4 quantization emulation for MoE.
`ocp_mx_emulation_moe`	OCP MX quantization emulation for MoE.
`rocm_aiter_moe`
`triton_cutlass_moe`
`triton_deep_gemm_moe`
`triton_moe`	Triton-based MoE expert implementations.
`trtllm_bf16_moe`
`trtllm_fp8_moe`
`trtllm_mxfp4_moe`
`trtllm_nvfp4_moe`