|
|
|
|
|
<br>DeepSeek open-sourced DeepSeek-R1, an LLM [fine-tuned](https://git.jiewen.run) with [reinforcement learning](https://nmpeoplesrepublick.com) (RL) to enhance thinking capability. DeepSeek-R1 attains outcomes on par with [OpenAI's](http://ufidahz.com.cn9015) o1 design on several criteria, including MATH-500 and SWE-bench.<br> |