Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
7
30
30
Sangwoo Park
Sangsang
Follow
Jackson0018's profile picture
YuminChoi's profile picture
jiongdao's profile picture
15 followers
·
30 following
swgger
AI & ML interests
I do LLM post-training research (KAIST AI)
Recent Activity
updated
a model
6 days ago
Sangsang/feedback_asymmetric_kl_fixed_ema_Qwen2.5-7B-Instruct_bw0p75_fw0p25_ema0p999_ep30
published
a model
6 days ago
Sangsang/feedback_asymmetric_kl_fixed_ema_Qwen2.5-7B-Instruct_bw0p75_fw0p25_ema0p999_ep30
updated
a model
6 days ago
Sangsang/feedback_asymmetric_kl_fixed_ema_Qwen2.5-7B-Instruct_bw0p25_fw0p75_ema0p999_ep30
View all activity
Organizations
None yet
Sangsang
's models
218
Sort: Recently updated
Sangsang/qwen3-0.6B-thinksafe-0.6B-n1-ablation-64-pm-e1
Text Generation
•
Updated
Jan 17
Sangsang/qwen3-0.6B-thinksafe-0.6B-n1-ablation-32-pm-e3
Text Generation
•
Updated
Jan 17
Sangsang/qwen3-0.6B-thinksafe-0.6B-n1-ablation-32-pm-e2
Text Generation
•
Updated
Jan 17
Sangsang/qwen3-0.6B-thinksafe-0.6B-n1-ablation-16-pm-e3
Text Generation
•
Updated
Jan 17
Sangsang/qwen3-0.6B-thinksafe-0.6B-n1-ablation-16-pm-e2
Text Generation
•
Updated
Jan 17
Sangsang/qwen3-0.6B-thinksafe-0.6B-n1-ablation-16-pm-e1
Text Generation
•
Updated
Jan 17
Sangsang/qwen3-8B-star-1-41K-32-pm
Text Generation
•
Updated
Jan 16
Sangsang/qwen3-4B-star-1-41K-32-pm
Text Generation
•
Updated
Jan 16
•
1
Sangsang/qwen3-1.7B-star-1-41K-32-pm
Text Generation
•
Updated
Jan 16
•
2
Sangsang/qwen3-0.6B-star-1-41K-32-pm
Text Generation
•
Updated
Jan 16
Sangsang/qwen3-8B-star-1-8B-modified-32-pm
Text Generation
•
Updated
Jan 16
Sangsang/qwen3-4B-star-1-4B-modified-32-pm
Text Generation
•
Updated
Jan 16
Sangsang/qwen3-1.7B-star-1-1.7B-modified-32-pm
Text Generation
•
Updated
Jan 16
•
3
Sangsang/qwen3-0.6B-star-1-0.6B-modified-32-pm
Text Generation
•
Updated
Jan 16
Sangsang/thinksafe-1.7B-n1-ablation_R32_BZ64_Gen8
Text Generation
•
Updated
Jan 14
Sangsang/DS_CT_20250811_LIN_1024_LOUT_4096_N100_NH100_B8_LEN_NAIVE
Text Generation
•
Updated
Jan 13
Sangsang/thinksafe-0.6B-n1-ablation_R32_BZ32_Gen8
Updated
Jan 10
Sangsang/DS_CT_20250811_LIN_1024_LOUT_4096_N200_NH800_B8_LEN_NAIVE
Text Generation
•
Updated
Jan 7
Sangsang/DS_CT_20250811_LIN_1024_LOUT_4096_N200_NH800_B8_LEN_PRIOR
Text Generation
•
Updated
Jan 7
Sangsang/DS_CT_20250811_N200_NH800_B2_LEN_PRIOR
Text Generation
•
Updated
Jan 6
Sangsang/DS_CT_20250811_N0_NH1000_B2_LEN_PRIOR
Text Generation
•
Updated
Jan 6
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-32-pm-kl
Text Generation
•
Updated
Jan 2
Sangsang/R1-8B-thinksafe-r1-8B-ablation-32-pm-kl
Text Generation
•
Updated
Jan 2
Sangsang/R1-7B-thinksafe-r1-7B-ablation-32-pm-kl
Text Generation
•
Updated
Jan 2
Sangsang/R1-8B-thinksafe-r1-8B-32-ep3-pm
Text Generation
•
Updated
Jan 2
Sangsang/R1-8B-thinksafe-r1-8B-32-ep2-pm
Text Generation
•
Updated
Jan 2
Sangsang/R1-7B-thinksafe-r1-7B-32-ep3-pm
Text Generation
•
Updated
Jan 2
•
1
Sangsang/R1-7B-thinksafe-r1-7B-32-ep2-pm
Text Generation
•
Updated
Jan 2
Sangsang/R1-1.5B-thinksafe-r1-1.5B-32-ep3-pm
Text Generation
•
Updated
Jan 2
Sangsang/qwen3-8B-thinksafe-8B-n1-32-ep2-pm
Text Generation
•
Updated
Jan 2
Previous
1
...
3
4
5
6
7
8
Next