[1]

Sullivan, M. et al. 2026. Learning When to Reason: Gating LLM Inference for Cost-Efficient Serverless Function Scheduling at Scale. Academic Journal of Applied Sciences. 2, 1 (Jun. 2026), 39–45. DOI:https://doi.org/10.54097/gwmv0761.