1.
Sullivan M, He B, Evans P. Learning When to Reason: Gating LLM Inference for Cost-Efficient Serverless Function Scheduling at Scale. AJAS. 2026;2(1):39-45. doi:10.54097/gwmv0761