Sullivan, Megan, Boyang He, and Patrick Evans. 2026. “Learning When to Reason: Gating LLM Inference for Cost-Efficient Serverless Function Scheduling at Scale”. Academic Journal of Applied Sciences 2 (1): 39-45. https://doi.org/10.54097/gwmv0761.