Small models matter again, and not just because they are cheaper
Small models are not simply budget options. They fit specific parts of the stack well: fast response paths, constrained deployments, narrow tasks, and local workflows where a larger model would add cost and complexity without much extra value.
The real shift is architectural. Teams are getting better at placing different model sizes in different roles instead of asking one large model to do everything.