2 19 29

Xie

QianqianXie1994

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Appear2Meaning: A Cross-Cultural Benchmark for Structured Cultural Metadata Inference from Images

upvoted a paper 3 months ago

Conv-FinRe: A Conversational and Longitudinal Benchmark for Utility-Grounded Financial Recommendation

published a dataset 4 months ago

TheFinAI/german-credit-benchmark

View all activity

Organizations

upvoted a paper about 2 months ago

Appear2Meaning: A Cross-Cultural Benchmark for Structured Cultural Metadata Inference from Images

Paper • 2604.07338 • Published Apr 8 • 5

upvoted a paper 3 months ago

Conv-FinRe: A Conversational and Longitudinal Benchmark for Utility-Grounded Financial Recommendation

Paper • 2602.16990 • Published Feb 19 • 11

published a dataset 4 months ago

TheFinAI/german-credit-benchmark

Viewer • Updated Jan 27 • 1k • 70

upvoted an article 6 months ago

Article

Introducing the Open FinLLM Leaderboard

QianqianXie1994, jiminHuang, Effoula, yanglet, alejandroll10, Benyou, ldruth, xiangr, Me1oy, ShirleyY, mirageco, blitzionic, clefourrier

•

Oct 4, 2024

• 80

upvoted a paper 6 months ago

MentraSuite: Post-Training Large Language Models for Mental Health Reasoning and Assessment

Paper • 2512.09636 • Published Dec 10, 2025 • 26

authored a paper 6 months ago

MentraSuite: Post-Training Large Language Models for Mental Health Reasoning and Assessment

Paper • 2512.09636 • Published Dec 10, 2025 • 26

upvoted a paper 8 months ago

DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation

Paper • 2510.09116 • Published Oct 10, 2025 • 97

updated a Space 8 months ago

README

🧠

upvoted a paper 8 months ago

FinAuditing: A Financial Taxonomy-Structured Multi-Document Benchmark for Evaluating LLMs

Paper • 2510.08886 • Published Oct 10, 2025 • 20

published a Space 8 months ago

FinAnnotation

🟧

Label data with an open‑source platform

upvoted a paper 9 months ago

From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models

Paper • 2508.13491 • Published Aug 19, 2025 • 59

authored a paper 10 months ago

From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models

Paper • 2508.13491 • Published Aug 19, 2025 • 59

upvoted a paper 12 months ago

MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation

Paper • 2506.14028 • Published Jun 16, 2025 • 94

liked a Space about 1 year ago

Open Financial LLM Leaderboard

🏆

Evaluating LLMs on Multilingual Multimodal Financial Tasks

updated a collection about 1 year ago

MultiFinBen

Collection

6 items • Updated Apr 18 • 4

updated 2 datasets about 1 year ago

TheFinAI/MultiFinBen-EnglishOCR

Viewer • Updated Jun 18, 2025 • 7.96k • 89 • 4

TheFinAI/MultiFinBen-SpanishOCR

Viewer • Updated Jun 18, 2025 • 12.9k • 95

Xie