maxtext.models.qwen3_5 module

maxtext.models.qwen3_5 module#

Qwen3.5 family of model decoder layers.

class maxtext.models.qwen3_5.Qwen3_5GatedDeltaNet(*args, **kwargs)[source]#

Qwen3.5 GatedDeltaNet layer that is identical to Qwen3-Next GatedDeltaNet

Parameters:

Return type:

Any

class maxtext.models.qwen3_5.Qwen3_5FullAttention(*args, **kwargs)[source]#

Qwen3.5 Gated Attention layer that is identical to Qwen3-Next

Parameters:

Return type:

Any

class maxtext.models.qwen3_5.Qwen3_5SparseMoEBlock(*args, **kwargs)[source]#

Shares same MoE code as Qwen3-Next

Parameters:

Return type:

Any

class maxtext.models.qwen3_5.Qwen3_5ScannableBlock(*args, **kwargs)[source]#

Bases: Module

Scanned Structure for Text-only Architecture, explicitly invoking Qwen3_5 layers.

Parameters:

Return type:

Any

class maxtext.models.qwen3_5.Qwen3_5DecoderLayer(*args, **kwargs)[source]#

Bases: Module

This layer is a hybrid, capable of functioning as either: 1. A standard attention + MoE layer. 2. A linear attention + MoE layer.

Parameters:

Return type:

Any