On SWE-Bench Verified, the model achieved a score of 70.6%. This performance is notably competitive when placed alongside significantly larger models; it outpaces DeepSeek-V3.2, which scores 70.2%, ...
To complete the above system, the author’s main research work includes: 1) Office document automation based on python-docx. 2) Use the Django framework to develop the website.