Open Reproduction of DeepSeek-R1
DeepSeek-R1reproductionreasoningsupervised fine-tuninggroup relative policy optimizationAImachine learningnatural language processingcode generationreasoningproblem-solving.
Author: yogthos
Date: 6/11/2026
Article Summary:
This repository is a fully open reproduction of DeepSeek-R1, a project that aims to replicate the reasoning capabilities of DeepSeek-R1 using a combination of supervised fine-tuning and group relative policy optimization.