A Software Engineering Perspective on Testing Large Language Models: Research, Practice, Tools and Benchmarks
Hudson, Sinclair & Jit, Sophia & Hu, Boyue & Chechik, Marsha. (2024). A Software Engineering Perspective on Testing Large Language Models: Research, Practice, Tools and Benchmarks. 10.48550/arXiv.2406.08216.