Sat.Nov 02, 2024

article thumbnail

Jamba 1.5: Hybrid Mamba-Transformer Model for Advanced NLP

Analytics Vidhya

Jamba 1.5 is an instruction-tuned large language model that comes in two versions: Jamba 1.5 Large with 94 billion active parameters and Jamba 1.5 Mini with 12 billion active parameters. It combines the Mamba Structured State Space Model (SSM) with the traditional Transformer architecture. This model, developed by AI21 Labs, can process a 256K effective […] The post Jamba 1.5: Hybrid Mamba-Transformer Model for Advanced NLP appeared first on Analytics Vidhya.

Modeling 244