Return to Article Details Efficient LLM Serving Systems A Survey of Model Placement, Batching, and Resource Optimization Techniques Download Download PDF