Today News

MiniMax M3 open-weight AI model features million-token context window

 

The technical architecture, known as MiniMax Sparse Attention, optimizes GPU performance by processing data blocks sequentially to reduce memory access overhead. Internal benchmarks indicate that M3 excels in autonomous software development tasks, including independent debugging and kernel optimization for Nvidia Hopper hardware. During testing, the model successfully increased hardware utilization from 7.6 to over 71 percent during complex matrix multiplication tasks.

MiniMax M3 open-weight AI model features million-token context window
MiniMax M3 open-weight AI model features million-token context window (image from the-decoder.com)

MiniMax plans to release the model weights and a technical report on Hugging Face and GitHub within the next ten days. While the model is currently accessible via an API with tiered pricing, the company is also open-sourcing its associated agent application, MiniMax Code. Despite these technical milestones, the company is reportedly facing a lawsuit in California from major media entities regarding alleged intellectual property theft.

     
                    WhatsApp Channel                             Join Now            
   
                    Telegram Channel                             Join Now            
   
                    Instagram follow us                             Join Now