๐Ÿ‘‹ Training support for transformers/megatron backends

#20
by study-hjt - opened

๐Ÿš€ ms-swift/mcore-bridge supports training gemma-4-12B-it using the transformers/megatron backend.
PR: https://github.com/modelscope/ms-swift/pull/9487, https://github.com/modelscope/mcore-bridge/pull/108
shell: https://github.com/modelscope/ms-swift/tree/main/examples/models/gemma4

Hi @study-hjt -

Thanks for sharing this! It's Great to see support for training gemma-4-12B-it with the Transformers/Megatron backend. The PRs will be helpful for the community.

Sign up or log in to comment