maxtext.layers.encoders module#
Module for encoder layers.
- class maxtext.layers.encoders.VisionEncoder(*args, **kwargs)[source]#
Bases:
ModuleVision encoder to encode images into soft tokens.
- Parameters:
args (Any)
kwargs (Any)
- Return type:
Any
- class maxtext.layers.encoders.AudioEncoder(*args, **kwargs)[source]#
Bases:
ModuleAudio encoder to encode audio features into soft tokens.
- Parameters:
args (Any)
kwargs (Any)
- Return type:
Any