17 20 17

Alex Jinpeng Wang

Awiny

https://fingerrec.github.io

FingerRec

AI & ML interests

Multi-Modality Pre-training, Data-Centric AI, Video Self-supervised Learning

Recent Activity

liked a dataset about 1 month ago

CSU-JPG/IESBench

published a dataset about 1 month ago

CSU-JPG/IESBench

upvoted a paper about 1 month ago

When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models

View all activity

Organizations

liked a dataset about 1 month ago

CSU-JPG/IESBench

Updated Feb 12 • 1.68k • 7

published a dataset about 1 month ago

CSU-JPG/IESBench

Updated Feb 12 • 1.68k • 7

upvoted a paper about 1 month ago

When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models

Paper • 2602.10179 • Published Feb 10 • 6

submitted a paper to Daily Papers about 1 month ago

When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models

Paper • 2602.10179 • Published Feb 10 • 6

upvoted a paper about 1 month ago

Olaf-World: Orienting Latent Actions for Video World Modeling

Paper • 2602.10104 • Published Feb 10 • 27

liked a model 4 months ago

CSU-JPG/Glance

Text-to-Image • Updated Dec 15, 2025 • 39 • • 14

upvoted 2 papers 4 months ago

Glance: Accelerating Diffusion Models with 1 Sample

Paper • 2512.02899 • Published Dec 2, 2025 • 30

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

Paper • 2511.11434 • Published Nov 14, 2025 • 47

liked a Space 4 months ago

VCode

🐨

Convert images to SVG code

updated a Space 5 months ago

README

📈

liked a dataset 5 months ago

CSU-JPG/Chart2Code

Updated Jan 21 • 830 • 5

updated a collection 5 months ago

🔱 Sailor2 Language Models

Collection

Sailing in South-East Asia with Inclusive Multilingual LLMs • 32 items • Updated 20 days ago • 30

upvoted 3 papers 5 months ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published Nov 4, 2025 • 103

UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback

Paper • 2511.01678 • Published Nov 3, 2025 • 38

From Charts to Code: A Hierarchical Benchmark for Multimodal Models

Paper • 2510.17932 • Published Oct 20, 2025 • 8

New activity in deepseek-ai/DeepSeek-OCR 5 months ago

Clarifying Prior Research on Visual Compression of Textual Contexts

❤️👍 14

#18 opened 5 months ago by

Awiny

upvoted a paper 6 months ago

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6, 2025 • 120

liked a dataset 9 months ago

CSU-JPG/MVPBench

Viewer • Updated May 15, 2025 • 4.7k • 38 • 1

liked a model 9 months ago

showlab/show-o2-1.5B-HQ

Any-to-Any • Updated Sep 5, 2025 • 75 • 3

authored a paper 12 months ago

V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models

Paper • 2504.06148 • Published Apr 8, 2025 • 13

Alex Jinpeng Wang

AI & ML interests

Recent Activity

Organizations

Awiny's activity

VCode

README

Clarifying Prior Research on Visual Compression of Textual Contexts