A Software Engineering Perspective on Testing Large Language Models: Research, Practice, Tools and Benchmarks

Published in Software Engineering 2030 Workshop, 2024

A software engineering paper, spun out of a term research project I did for the Software Engineering for Machine Learning class.

Recommended citation: Hudson, Sinclair & Jit, Sophia & Hu, Boyue & Chechik, Marsha. (2024). A Software Engineering Perspective on Testing Large Language Models: Research, Practice, Tools and Benchmarks. 10.48550/arXiv.2406.08216.
Download Paper