A Software Engineering Perspective on Testing Large Language Models: Research, Practice, Tools and Benchmarks
Published: in Software Engineering 2030 Workshop
Position paper identifying gaps in LLM Testing tools and strategies.
Recommended citation: Hudson, Sinclair & Jit, Sophia & Hu, Boyue & Chechik, Marsha. (2024). A Software Engineering Perspective on Testing Large Language Models: Research, Practice, Tools and Benchmarks. 10.48550/arXiv.2406.08216.
Download Paper