(1)
Sullivan, M.; He, B.; Evans, P. Learning When to Reason: Gating LLM Inference for Cost-Efficient Serverless Function Scheduling at Scale. AJAS 2026, 2 (1), 39-45. https://doi.org/10.54097/gwmv0761.