(1)
Efficient LLM Serving Systems A Survey of Model Placement, Batching, and Resource Optimization Techniques. JGREC 2026, 2 (5), 43-47. https://doi.org/10.5281/.