phi-4 is the only one I am aware of that was deliberately trained to refuse instead of hallucinating. it’s mindblowing to me that that isn’t standard. everyone is trying to maximize benchmarks at all cost.
I wonder if diffusion LLMs will be lower in hallucinations, since they inherently have error correction built into their inference process
phi-4 is the only one I am aware of that was deliberately trained to refuse instead of hallucinating. it’s mindblowing to me that that isn’t standard. everyone is trying to maximize benchmarks at all cost.
I wonder if diffusion LLMs will be lower in hallucinations, since they inherently have error correction built into their inference process
Even that won’t be truly effective. It’s all marketing, at this point.
The problem of hallucination really is fundamental to the technology. If there’s a way to prevent it, it won’t be as simple as training it differently