Optimizing Data Pipelines for AI Workflows