MiniMax Releases M2 Technical Report; Forge System Achieves 40x Training Speedup

According to Beating, MiniMax released its M2 technical report on arXiv, detailing its flagship MoE (mixture-of-experts) architecture and Agent training system Forge. The company disclosed how Forge optimizes long-context Agent reinforcement learning through windowed FIFO scheduling and prefix-tree merging techniques, achieving up to 40x training speedup.

M2.7 demonstrated autonomous agent self-evolution capabilities, completing over 100 rounds of analysis, code revision, and testing cycles. On performance benchmarks, M2.7 reached 56.22% on SWE-Pro and 52.7% on Multi-SWE-bench, with a 66.6% average reward rate on MLE Bench, approaching Gemini 3.1 performance levels.

Disclaimer: The information on this page may come from third-party sources and is for reference only. It does not represent the views or opinions of Gate and does not constitute any financial, investment, or legal advice. Virtual asset trading involves high risk. Please do not rely solely on the information on this page when making decisions. For details, see the Disclaimer.
Comment
0/400
No comments