Multiple answers for a context

RFTSystems · January 2, 2026, 1:58pm

Hey
this is a very common with extractive QA.

RoBERTa (fine-tuned for extractive QA like SQuAD) is fundamentally trained to return one contiguous span per (question, context). So if your real intent is “given this context, return all matching spans (endorsement/code pairs)”, you’re trying to force a single-span head to do a multi-span extraction job — it will usually look “confused” because the supervision is contradictory (same input → different single-span labels).

There are basically 3 options:

If you only need one answer at a time → make the question unambiguous
- Example questions:
  • “What is the code for Endorsement 1?” → answer “code1”
  • “What is the code for Endorsement 2?” → answer “code2”
- Same context is fine; the question must disambiguate.
If you truly need multiple answers from one question → treat it as extraction / tagging, not classic QA
- Best fit: token classification / sequence labeling (BIO tags) to mark all spans in the context, then collect the tagged spans.
- This is the standard approach for “multi-span QA” in practice.
If you want the model to output a list/JSON of all pairs → use a generative (seq2seq) model
- e.g., T5/BART style: prompt like “Extract all endorsement-code pairs as JSON.”
- This avoids the “single span” limitation entirely.

Small note:
“multiple answers” can also mean “multiple acceptable ground-truth strings for the same answer” (synonyms / aliases). That is supported by the SQuAD-style answers = { "text": [...], "answer_start": [...] } format — but the model still predicts one span; the multiple answers are mainly for evaluation and robustness, not for returning all spans.

Links:

HF Transformers QA task guide:
Question answering
HF course section showing the expected answers.text / answers.answer_start list format:
Question answering - Hugging Face LLM Course
Token classification guide (good fit for multi-span extraction):
Token classification
HF forum thread on multi-span QA → “cast it as token classification”:
How to do multi-span question answering?
Example reference paper for multi-span QA as sequence tagging:
https://aclanthology.org/2020.emnlp-main.248.pdf

If you share what your question looks like (and whether you need “all pairs” vs “one specific pair”), I can suggest what 1 of the 3 approaches above is best,
hope this helps, Liam

Topic		Replies	Views
How to do multi-span question answering? Beginners	1	3076	March 31, 2022
Question-Answering/Text-generation/Summarizing: Fine-tune on multiple answers Beginners	8	5393	November 20, 2021
Any ways for the QnA model tp highlight the content of a answer? 🤗Transformers	1	193	February 24, 2023
Masked language modelling with specific entities or POS 🤗Transformers	0	226	July 21, 2021
Https://huggingface.co/allenai/longformer-large-4096-finetuned-triviaqa Model cards	0	1162	March 28, 2022

Multiple answers for a context

Related topics