Sullivan, M., He, B., & Evans, P. (2026). Learning When to Reason: Gating LLM Inference for Cost-Efficient Serverless Function Scheduling at Scale. Academic Journal of Applied Sciences, 2(1), 39-45. https://doi.org/10.54097/gwmv0761