maxtext.models.mistral module

maxtext.models.mistral module#

Transformer model definition.

class maxtext.models.mistral.MistralDecoderLayer(*args, **kwargs)[source]#

Bases: Module

Transformer decoder layer that attends to the encoder.

Parameters:
  • args (Any)

  • kwargs (Any)

Return type:

Any