- Summary
- 9.87
- Summary
- 8.35
- Therapy
- 9.05
- Therapy
- Finance
- 8.88
- Finance
- Health
- 9.27
- Health
- Balanced performer: Most consistent across five metrics with no wild swings.
- Excellent factual grounding: Barely hallucinates and hugs the source context.
- Compact and efficient: Nearly matches 7B models while running cheaper and faster.
- Clean delivery: Keeps summaries precise without extra fluff.
- Slightly rigid phrasing that can feel terse on creative prompts.
- Stays conservative even when light speculation would help.
Default baseline when you need consistent, factual scoring without burning GPU hours.