From Interpretability to Performance: Optimizing Retrieval Heads for Long-Context Language Models — arXiv2