Small offline models with limited context windows should be used as system components for specialized tasks rather than as primary workhorses.