yeji-8b-dora-v6 (Deprecated)

โš ๏ธ ์ด ๋ชจ๋ธ์€ ๋” ์ด์ƒ ์‚ฌ์šฉ๋˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค. tellang/yeji-8b-rslora-v7-AWQ๋ฅผ ์‚ฌ์šฉํ•˜์„ธ์š”.

Why Deprecated?

์ด ๋ชจ๋ธ์€ QDoRA (DoRA + rsLoRA) ๋ฐฉ์‹์œผ๋กœ ํ•™์Šต๋˜์—ˆ์œผ๋‚˜ ๋‹ค์Œ ๋ฌธ์ œ๋กœ ์ธํ•ด ํ๊ธฐ๋˜์—ˆ์Šต๋‹ˆ๋‹ค:

1. vLLM ๋ฏธ์ง€์›

ValueError: LoRA adapter 'tellang/yeji-8b-dora-v6' uses DoRA which is not supported by vLLM
  • vLLM์€ DoRA (Weight-Decomposed Low-Rank Adaptation) ์–ด๋Œ‘ํ„ฐ๋ฅผ ์ง€์›ํ•˜์ง€ ์•Š์Œ
  • ํ”„๋กœ๋•์…˜ ๋ฐฐํฌ ์‹œ vLLM์ด ํ•„์ˆ˜์ด๋ฏ€๋กœ ์น˜๋ช…์ ์ธ ์ œ์•ฝ

2. ๊ทน๋„๋กœ ๋А๋ฆฐ ํ•™์Šต ์†๋„

  • ํ•™์Šต ์†๋„: 0.03 it/s (์ดˆ๋‹น 0.03 ์ƒ˜ํ”Œ)
  • ๋น„๊ต: ์ผ๋ฐ˜ rsLoRA๋Š” 0.5-1.0 it/s
  • ์›์ธ: DoRA์˜ ๋ณต์žกํ•œ weight decomposition ์—ฐ์‚ฐ

3. ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ๋ณต์žก๋„

# DoRA ์ „์šฉ ํŒŒ๋ผ๋ฏธํ„ฐ
use_dora: true
lora_r: 32-64  # DoRA๋Š” ๋” ํฐ rank ํ•„์š”
lora_alpha: 64-128
  • DoRA๋Š” ์ผ๋ฐ˜ LoRA๋ณด๋‹ค 2๋ฐฐ ํฐ rank๊ฐ€ ํ•„์š”
  • ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ํŠœ๋‹ ๋‚œ์ด๋„ ์ฆ๊ฐ€

Technical Details

  • ๋ฒ ์ด์Šค ๋ชจ๋ธ: Qwen/Qwen3-8B-Base
  • ํŒŒ์ธํŠœ๋‹ ๋ฐฉ์‹: QDoRA (Quantized DoRA + rsLoRA)
  • ํ•™์Šต ๋ฐ์ดํ„ฐ: ์šด์„ธ/ํ‹ฐํ‚คํƒ€์นด ๋ฐ์ดํ„ฐ 5,000 ์ƒ˜ํ”Œ
  • Rank: 32-64
  • Alpha: 64-128

Recommended Alternative

ํ”„๋กœ๋•์…˜ ์‚ฌ์šฉ

  • ๋ชจ๋ธ: tellang/yeji-8b-rslora-v7-AWQ
  • ์žฅ์ : AWQ ์–‘์žํ™”๋กœ 3๋ฐฐ ๋น ๋ฅธ ์ถ”๋ก , vLLM ์™„์ „ ์ง€์›
  • ์ •ํ™•๋„: v6 ๋Œ€๋น„ 5% ํ–ฅ์ƒ
from vllm import LLM, SamplingParams

llm = LLM(
    model="tellang/yeji-8b-rslora-v7-AWQ",
    quantization="awq",
    gpu_memory_utilization=0.9,
)

์ตœ์‹  ๋ฒ„์ „ (2026-02-01)

  • 4B ๋ชจ๋ธ: tellang/yeji-4b-rslora-v8.1 (๋ฆฌ์†Œ์Šค ์ ˆ์•ฝ)
  • 8B ๋ชจ๋ธ: tellang/yeji-8b-rslora-v7-AWQ (์ตœ๊ณ  ์„ฑ๋Šฅ)

Migration Guide

Before (v6)

# DoRA ์–ด๋Œ‘ํ„ฐ - vLLM ๋ฏธ์ง€์›
llm = LLM(model="tellang/yeji-8b-dora-v6")  # โŒ ValueError

After (v7-AWQ)

# AWQ ์–‘์žํ™” - vLLM ๋„ค์ดํ‹ฐ๋ธŒ ์ง€์›
llm = LLM(
    model="tellang/yeji-8b-rslora-v7-AWQ",
    quantization="awq",
)

Performance Comparison

์ง€ํ‘œ v6 (DoRA) v7-AWQ (rsLoRA)
vLLM ์ง€์› โŒ ๋ฏธ์ง€์› โœ… ์™„์ „ ์ง€์›
์ถ”๋ก  ์†๋„ N/A (๋ฐฐํฌ ๋ถˆ๊ฐ€) 50 tokens/s
ํ•™์Šต ์†๋„ 0.03 it/s 0.8 it/s
๋ฉ”๋ชจ๋ฆฌ 16GB 5.3GB (AWQ)
์ •ํ™•๋„ Baseline +5%

References

License

Apache 2.0

Citation

@misc{yeji-8b-dora-v6,
  title={YEJI Fortune Telling Model (DoRA v6 - Deprecated)},
  author={SSAFY YEJI Team},
  year={2026},
  note={Deprecated: Use yeji-8b-rslora-v7-AWQ instead}
}
Downloads last month
1
Safetensors
Model size
8B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for tellang/yeji-8b-dora-v6

Finetuned
(376)
this model

Paper for tellang/yeji-8b-dora-v6