3 8 2

Zhi Zheng

zz1358m

https://zz1358m.github.io/zhizheng.github.io/

AI & ML interests

LLM reasoning, Trustworthy LLM, LLM application, Neural combinatorial optimization.

Recent Activity

upvoted a paper 10 days ago

Rethinking Muon Beyond Pretraining: Spectral Failures and High-Pass Remedies for VLA and RLVR

upvoted a paper 11 days ago

Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents

upvoted a paper 13 days ago

Inference-Time Attribute Distribution Alignment for Unconditional Diffusion

View all activity

Organizations

upvoted a paper 10 days ago

Rethinking Muon Beyond Pretraining: Spectral Failures and High-Pass Remedies for VLA and RLVR

Paper • 2605.19282 • Published May 19 • 9

upvoted a paper 11 days ago

Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents

Paper • 2606.06036 • Published 22 days ago • 73

upvoted a paper 13 days ago

Inference-Time Attribute Distribution Alignment for Unconditional Diffusion

Paper • 2605.07456 • Published May 8 • 2

upvoted a paper 14 days ago

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Paper • 2606.13681 • Published 15 days ago • 140

authored a paper 15 days ago

One Token per Multimodal Evidence: Latent Memory for Resource-Constrained QA

Paper • 2606.10572 • Published 17 days ago • 16

upvoted a paper 16 days ago

One Token per Multimodal Evidence: Latent Memory for Resource-Constrained QA

Paper • 2606.10572 • Published 17 days ago • 16

submitted a paper to Daily Papers 16 days ago

One Token per Multimodal Evidence: Latent Memory for Resource-Constrained QA

Paper • 2606.10572 • Published 17 days ago • 16

published a model 16 days ago

zz1358m/Latent-Memory-Master

Updated May 8

upvoted a paper 17 days ago

Why Muon Outperforms Adam: A Curvature Perspective

Paper • 2606.04662 • Published 23 days ago • 10

updated a model about 2 months ago

zz1358m/Latent-Memory-Master

Updated May 8

liked a dataset about 2 months ago

jiayingwu19/DeceptionDecoded

Preview • Updated Apr 30 • 23 • 2

updated a model 5 months ago

zz1358m/ATP-Latent-Master

Updated Jan 31 • 1

published a model 5 months ago

zz1358m/ATP-Latent-Master

Updated Jan 31 • 1

authored a paper 5 months ago

Beyond Imitation: Reinforcement Learning for Active Latent Planning

Paper • 2601.21598 • Published Jan 29 • 10

upvoted a paper 5 months ago

Beyond Imitation: Reinforcement Learning for Active Latent Planning

Paper • 2601.21598 • Published Jan 29 • 10

submitted a paper to Daily Papers 5 months ago

Beyond Imitation: Reinforcement Learning for Active Latent Planning

Paper • 2601.21598 • Published Jan 29 • 10

liked a model 7 months ago

zz1358m/SofT-GRPO-master

Updated Nov 13, 2025 • 8

updated a model 7 months ago

zz1358m/SofT-GRPO-master

Updated Nov 13, 2025 • 8

authored a paper 8 months ago

SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization

Paper • 2511.06411 • Published Nov 9, 2025 • 18

commented a paper 8 months ago

SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization

Paper • 2511.06411 • Published Nov 9, 2025 • 18 •

Zhi Zheng

AI & ML interests

Recent Activity

Organizations

zz1358m's activity