Jinsei Shiraishi's picture

Jinsei Shiraishi

OsakanaTeishoku

·

Osakana7777777

AI & ML interests

Large Language Models, Computer Vision, AI/ML application to medical settings

Recent Activity

updated a dataset 14 days ago

OsakanaTeishoku/Magpie-Tanuki-8B-CoT-formatted

published a dataset 14 days ago

OsakanaTeishoku/Magpie-Tanuki-8B-CoT-formatted

liked a model 16 days ago

tokyotech-llm/Qwen3-Swallow-8B-RL-v0.2

View all activity

Organizations

upvoted a paper 16 days ago

On the Optimal Reasoning Length for RL-Trained Language Models

Paper • 2602.09591 • Published 26 days ago • 5

upvoted an article 18 days ago

Article

NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル

18 days ago

•

23

upvoted an article 25 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

Dec 1, 2025

•

304

upvoted an article 5 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

+5

Sep 11, 2025

•

184

upvoted an article 6 months ago

Article

The 4 Things Qwen-3’s Chat Template Teaches Us

Apr 30, 2025

•

84

upvoted an article 8 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

760

upvoted a collection 8 months ago

OpenMathReasoning

Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" • 7 items • Updated 1 day ago • 46

upvoted a collection 9 months ago

Any-to-Any Models, Datasets, Spaces

19 items • Updated 27 days ago • 30

upvoted an article 9 months ago

Article

Vision Language Models Explained

Apr 11, 2024

•

522

upvoted a collection 11 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 43 items • Updated 6 days ago • 698

upvoted a collection about 1 year ago

Asagi-VLM

Asagi is a Japanese Vision & Language model, trained on a large-scale synthetic dataset. • 4 items • Updated Nov 27, 2025 • 7