×
Community Blog Alibaba Cloud Launches Open-Source Math LLMs that Can Solve Complex Math Problems

Alibaba Cloud Launches Open-Source Math LLMs that Can Solve Complex Math Problems

Alibaba Cloud has launched Qwen2-Math, an advanced open-source LLM designed to solve complex math problems.

Alibaba Cloud’s new Qwen2-Math is an LLM capable of solving complex mathematical problems, even those from the International Mathematical Olympiad.

Until recently, large language models often struggle with solving mathematical problems due to less robust reasoning skills. To overcome this, Qwen2-Math was trained on large-scale, high-quality mathematical web texts, books, codes, and exam questions.

As a result, the models achieved strong performance in linguistically diverse grade school math word problems and even Olympiad-level bilingual multimodal scientific problems.

They also demonstrated strong results in Chinese mathematical benchmarks, such as the Chinese college entrance exam known as Gaokao. The largest math-specific model in the series, Qwen2-Math-72B-Instruct, outperformed state-of-the-art models on the MATH benchmark—a dataset of 12,500 challenging competition mathematics problems.

Alibaba_Cloud_Qwen2_MATH
Qwen2-Math-72B-Instruct outcompetes other state-of-the-art models on the MATH Benchmark

Developers, researchers and enterprises can access the models, including base models and their instruction-tuned versions trained on more specialized datasets on open-source communities including GitHub, Hugging Face and Modelscope. The models come in a variety of sizes, including 1.5 billion, 7 billion, and 72 billion parameters.

English is the primary language supported by the models at this time, though bilingual versions supporting English and Chinese are in the pipeline, according to Alibaba Cloud.


This article was originally published on Alizila, written by Elizabeth Utley.

0 1 0
Share on

Alibaba Cloud Community

1,062 posts | 262 followers

You may also like

Comments