Sullivan, M., He, B. and Evans, P. (2026) “Learning When to Reason: Gating LLM Inference for Cost-Efficient Serverless Function Scheduling at Scale”, Academic Journal of Applied Sciences, 2(1), pp. 39–45. doi:10.54097/gwmv0761.