Trinity Nano Hermes QLoRA

Fine-tuned MoE Adapter — Trinity Nano (Trinity-Nano-Preview) fine-tuned on Hermes function-calling data via QLoRA + Axolotl.

Model Details

  • Base Model: arcee-ai/Trinity-Nano-Preview (AfmoeForCausalLM, MoE)
  • Format: QLoRA LoRA adapter (safetensors)
  • Fine-tune Data: NousResearch/hermes-function-calling-v1
  • Training Stack: Axolotl + Akash Network
  • Fine-tuned by: arealicehole
  • License: Apache 2.0

LoRA Configuration

{
  "peft_type": "LORA",
  "r": 64,
  "lora_alpha": 128,
  "use_rslora": true,
  "target_modules": ["q_proj", "k_proj", "v_proj", "o_proj", "gate_up_proj"],
  "lora_dropout": 0.0
}

Usage

from transformers import AutoTokenizer, AutoModelForCausalLM
import torch

tokenizer = AutoTokenizer.from_pretrained("arealicehole/trinity-nano-hermes-qlora")
model = AutoModelForCausalLM.from_pretrained(
    "arealicehole/trinity-nano-hermes-qlora",
    torch_dtype=torch.bfloat16,
    device_map="auto"
)

messages = [{"role": "user", "content": "What's 84 * 3 / 2?"}]
inputs = tokenizer.apply_chat_template(messages, return_tensors="pt").to("cuda")
outputs = model.generate(inputs, max_new_tokens=256)
print(tokenizer.decode(outputs[0]))

Merge with Base (Transformers)

from transformers import AutoModelForCausalLM
from peft import PeftModel

base = AutoModelForCausalLM.from_pretrained(
    "arcee-ai/Trinity-Nano-Preview",
    torch_dtype=torch.bfloat16,
    device_map="auto"
)
model = PeftModel.from_pretrained(base, "arealicehole/trinity-nano-hermes-qlora")
merged = model.merge_and_unload()

Model Sources

Cite

@misc{trinity-nano-hermes-qlora,
  author = {arealicehole},
  title = {Trinity Nano Hermes QLoRA},
  year = {2026},
  url = {https://huggingface.co/arealicehole/trinity-nano-hermes-qlora}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for arealicehole/trinity-nano-hermes-qlora