Text Generation
Safetensors
English
qwen2
reward-model
scientific-writing
evaluation
reinforcement-learning
grpo
conversational
Instructions to use UKPLab/SciRM-7B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Inference
File too large to display, you can check the raw version instead.