1.
Efficient LLM Serving Systems A Survey of Model Placement, Batching, and Resource Optimization Techniques. JGREC [Internet]. 2026 May 29 [cited 2026 Jun. 27];2(5):43-7. Available from: https://jgrec.info/index.php/jgrec/article/view/146