Acknowledgments

The following research contributed valuable insights to our benchmark design, evaluation methodologies, and best practices for measuring LLM capabilities. While Sansa Bench features original queries and proprietary evaluation code, these works informed our approach and helped us build on established research in the field.