Portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 2 
Published in arXiv preprint, 2025
We construct E2EDevBench and a hybrid evaluation framework to benchmark LLM-based agent systems for end-to-end software development.