OmniGene-4-SFT-v3-GGUF

GGUF format models for OmniGene-4-SFT-v3 (instruction-tuned version)

GGUF format quantized versions of OmniGene-4 for efficient inference on consumer GPUs and CPUs using llama.cpp, llama-cpp-python, Ollama, LM Studio, and other GGUF-compatible runtimes.

Performance

Benchmark Accuracy
Standard Homology 99.95% (6,000 pairs)
Remote Homology 59.50% (2,000 pairs)
BixBench Knowledge 93.66%

vs. ESM-2 (650M): OmniGene-4 59.5% vs ESM-2 50.5% (+9 pp on same 2,000 pairs)

Available Quantizations

Quantization File Size RAM Required Quality
F16 OmniGene-4-SFT-v3-f16.gguf 50.6 GB ~52 GB Best quality
Q4_K_M OmniGene-4-SFT-v3-Q4_K_M.gguf 16 GB ~17 GB Recommended balance

Hardware Requirements

Quantization GPU CPU + RAM
F16 RTX A6000 (48GB) 64GB+ system RAM
Q4_K_M RTX 5090 (32GB) / RTX 4090 (24GB) / RTX 3090 (24GB) 32GB+ system RAM

Quick Start

Option 1: llama-cpp-python

pip install llama-cpp-python
from llama_cpp import Llama

llm = Llama(
    model_path="OmniGene-4-SFT-v3-Q4_K_M.gguf",
    n_ctx=4096,
    n_gpu_layers=-1,  # Offload all layers to GPU
)

# Example: Protein homology detection
prompt = """### Instruction:
Determine if the two protein sequences below are structurally related (homologous).

### Sequence 1:
MKTAYIAKQRQISFVKSHFSRQLEERLGLIEVQAPILSRVGDGTQDNLSGAEKAVQVKVKALPDAQFEVVHSLAKWKRQTLGQHDFSAGEGLYTHMKALRPDEDRLSPLHSVYVDQWDWERVMGDGERQFSTLKSTVEAIWAGIKATEAAVSEEFGLAPFLPDQIHFVHSQELLSRYPDLDAKGRERAIAKDLGAVFLVGIGGKLSDGHRHDVRAPDYDDWSTPSELGHAGLNGDILVWNPVLEDAFELSSMGIRVDADTLKHQLALTGDEDRLELEWHQALLRGEMPQTIGGGIGQSRLTMLLLQLPHIGQVQAGVWPAAVRESVPSLL

### Sequence 2:
MKKFDRGEQVVKVKALPQAQFEEVHSLAKWKRQTLGQHDFSAGEGLYTHMKALRPDEDRLSPLHSVYVDQWDWERVMGDGERQFSTLKSTVEAIWAGIKATEAAVSEEFGLAPFLPDQIHFVHSQELLSRYPDLDAKGRERAIAKDLGAVFLVGIGGKLSDGHRHDVRAPDYDDWSTPSELGHAGLNGDILVWNPVLEDAFELSSMGIRVDADTLKHQLALTGDEDRLELEWHQALLRGEMPQTIGGGIGQSRLTMLLLQLPHIGQVQAGVWPAAVRESVPSLL

### Answer:
"""

output = llm(prompt, max_tokens=5, temperature=0)
print(output['choices'][0]['text'])  # Expected: "Yes" or "No"

Option 2: llama.cpp Command Line

./llama-cli -m OmniGene-4-SFT-v3-Q4_K_M.gguf \
    -p "### Instruction:\nDetermine if the two protein sequences are homologous.\n### Sequence 1:\nMKTAYIAK...\n### Answer:\n" \
    -n 5 -ngl -1

Option 3: Ollama

# Create Modelfile
cat > Modelfile <<EOF
FROM ./OmniGene-4-SFT-v3-Q4_K_M.gguf

TEMPLATE """### Instruction:
{{ .System }}

{{ .Prompt }}

### Answer:
"""
EOF

ollama create omnigene-4-sft -f Modelfile
ollama run omnigene-4-sft

Option 4: LM Studio

  1. Download OmniGene-4-SFT-v3-Q4_K_M.gguf
  2. Place in LM Studio models folder
  3. Load in LM Studio
  4. Use the instruction template below

Supported Tasks

The model is instruction-tuned on 8 task families:

Task Examples Proportion
Protein homology 49,894 25.0%
Literature (UniProtQA) 39,915 20.0%
Mutation (MutaDescribe) 29,936 15.0%
Cell biology 29,936 15.0%
Molecule (SMILES) 25,945 13.0%
Structure (3D) 19,958 10.0%
DNA homology 3,992 2.0%

Example Tasks

1. Protein Homology Detection

### Instruction:
Determine if the two protein sequences below are structurally related (homologous).

### Sequence 1:
[protein sequence 1]

### Sequence 2:
[protein sequence 2]

### Answer:

2. Protein Function Prediction

### Instruction:
Predict the biological function of the following protein sequence.

### Protein Sequence:
[protein sequence]

### Answer:

3. Mutation Effect Prediction

### Instruction:
Describe the effect of the mutation on protein function.

### Wild-type:
[wild-type sequence]

### Mutant:
[mutant sequence]

### Answer:

4. Cell Type Identification

### Instruction:
Identify the cell type based on the gene expression profile.

### Gene Expression:
CD4: high, CD8: low, IL2: high

### Answer:

5. SMILES to Properties

### Instruction:
Predict the drug-likeness of the following molecule.

### SMILES:
CC(C)Cc1ccc(cc1)C(C)C(O)=O

### Answer:

Model Description

OmniGene-4-SFT-v3 is the final instruction-tuned biological foundation model with:

  • Base: Gemma-4-26B-A4B-Instruct + Bio CPT + Bio SFT
  • Vocabulary: 290,048 tokens
  • SFT data: 199,576 instruction examples
  • Precision: GGUF F16 / Q4_K_M

Other Versions

Citation

@article{wang2026omnigene4,
  title={OmniGene-4: A Unified Bio-Language MoE Model with Router-Level Interpretability},
  author={Wang, Liang},
  journal={bioRxiv},
  year={2026}
}

Paper

Full paper: https://github.com/maris205/omnigene4

License

Apache 2.0

Contact

Liang Wang (wangliang.f@gmail.com)
School of Artificial Intelligence and Automation
Huazhong University of Science and Technology

Downloads last month
177
GGUF
Model size
25B params
Architecture
gemma4
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for dnagpt/OmniGene-4-SFT-v3-GGUF

Quantized
(1)
this model