[1]
“Efficient LLM Serving Systems A Survey of Model Placement, Batching, and Resource Optimization Techniques”, JGREC, vol. 2, no. 5, pp. 43–47, May 2026, doi: 10.5281/.