5 37 15

Jiwon Song

jiwonsong

https://jiwonsong-dev.github.io/

AI & ML interests

Efficient AI | Ph.D Student @ SNU-VLSI

Recent Activity

authored a paper 19 days ago

CompactAttention: Accelerating Chunked Prefill with Block-Union KV Selection

upvoted a paper 19 days ago

CompactAttention: Accelerating Chunked Prefill with Block-Union KV Selection

submitted a paper 19 days ago

CompactAttention: Accelerating Chunked Prefill with Block-Union KV Selection

View all activity

Organizations

authored a paper 19 days ago

CompactAttention: Accelerating Chunked Prefill with Block-Union KV Selection

Paper • 2605.16839 • Published 22 days ago • 13

upvoted a paper 19 days ago

CompactAttention: Accelerating Chunked Prefill with Block-Union KV Selection

Paper • 2605.16839 • Published 22 days ago • 13

submitted a paper to Daily Papers 19 days ago

CompactAttention: Accelerating Chunked Prefill with Block-Union KV Selection

Paper • 2605.16839 • Published 22 days ago • 13

upvoted a paper 25 days ago

Teaching Language Models to Think in Code

Paper • 2605.07237 • Published 27 days ago • 30

updated a model about 1 month ago

jiwonsong/SeerAttention-Gemma3-12B-AttnGates

Text Generation • Updated Apr 27 • 5

published a model about 1 month ago

jiwonsong/SeerAttention-Gemma3-12B-AttnGates

Text Generation • Updated Apr 27 • 5

updated a model 2 months ago

jiwonsong/SeerAttention-Qwen3-8B-AttnGates

Text Generation • Updated Apr 6 • 1 • 1

published a model 2 months ago

jiwonsong/SeerAttention-Qwen3-8B-AttnGates

Text Generation • Updated Apr 6 • 1 • 1

upvoted 3 papers 3 months ago

authored a paper 4 months ago

RelayGen: Intra-Generation Model Switching for Efficient Reasoning

Paper • 2602.06454 • Published Feb 6 • 12

liked a model 4 months ago

dongwonjo/Llama-1-13B-BinaryMoS-E4

13B • Updated Sep 9, 2024 • 4 • 1

upvoted 2 papers 4 months ago

Squeezing Large-Scale Diffusion Models for Mobile

Paper • 2307.01193 • Published Jul 3, 2023 • 2

RelayGen: Intra-Generation Model Switching for Efficient Reasoning

Paper • 2602.06454 • Published Feb 6 • 12

submitted a paper to Daily Papers 4 months ago

RelayGen: Intra-Generation Model Switching for Efficient Reasoning

Paper • 2602.06454 • Published Feb 6 • 12

authored a paper 4 months ago

Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection

Paper • 2602.03216 • Published Feb 3 • 13

upvoted a paper 4 months ago

L4Q: Parameter Efficient Quantization-Aware Training on Large Language Models via LoRA-wise LSQ

Paper • 2402.04902 • Published Feb 7, 2024 • 5

published a Space 4 months ago

README

🏢

upvoted a paper 4 months ago

LRAgent: Efficient KV Cache Sharing for Multi-LoRA LLM Agents

Paper • 2602.01053 • Published Feb 1 • 8

Jiwon Song

AI & ML interests

Recent Activity

Organizations

jiwonsong's activity

README