Elevance Health Interview Question

How to optimize API call usage to AI models under different constraints?