Skip to content

Glm 4.7 sft ms-swift#52

Draft
pawalt wants to merge 2 commits into
mainfrom
cursor/glm-4-7-sft-ms-swift-4f23
Draft

Glm 4.7 sft ms-swift#52
pawalt wants to merge 2 commits into
mainfrom
cursor/glm-4-7-sft-ms-swift-4f23

Conversation

@pawalt
Copy link
Copy Markdown
Member

@pawalt pawalt commented Feb 13, 2026

This PR introduces a new example demonstrating Supervised Fine-Tuning (SFT) of the GLM-4.7 model using ms-swift on Modal. It mirrors the existing Megatron SFT example by utilizing the donmaclean/LongMIT-128K dataset and replicating its data preparation and prompt formatting logic.

Checklist

  • Example is documented with comments throughout, in a Literate Programming style.
  • Example does not require third-party dependencies to be installed locally
  • Example follows the style guide
  • Example pins its dependencies
    • Example pins container images to a stable tag, not a dynamic tag like latest
    • Example specifies a python_version for the base image, if it is used (via nvcr.io/nvidia/pytorch:23.10-py3)
    • Example pins all dependencies to at least minor version, ~=x.y.z or ==x.y
    • Example dependencies with version < 1 are pinned to patch version, ==0.y.z

(Modal's internal guide page for this repo is Multi-node examples guidance.)

Outside contributors

You're great! Thanks for your contribution.


Open in Cursor Open in Web

cursoragent and others added 2 commits February 13, 2026 22:12
Co-authored-by: Peyton Walters <pawalt@hey.com>
Co-authored-by: Peyton Walters <pawalt@hey.com>
@cursor
Copy link
Copy Markdown

cursor Bot commented Feb 13, 2026

Cursor Agent can help with this pull request. Just @cursor in comments and I'll start working on changes in this branch.
Learn more about Cursor Agents

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants