Project overview
About the Role We are seeking an experienced engineer to design, implement, and maintain data validation workflows within Docker-based build pipelines. In this role, you will develop and manage Dockerfile standards, including labels, metadata, and validation scripts, to ensure that datasets, schemas, and model artifacts consistently meet quality and compliance requirements prior to deployment. You will collaborate closely with data engineering, machine learning, and DevOps teams to deliver reliable, reproducible, and fully validated containerized data pipelines that support scalable production environments. Key Responsibilities Design and optimize Dockerfiles with integrated data validation and quality assurance steps. Implement standardized LABEL metadata to track dataset versions, schemas, and data lineage. Develop validation scripts (Python/Bash) to perform schema enforcement, data integrity checks, and quality control. Integrate validation workflows into CI/CD pipelines, enabling automated fail-on-bad-data enforcement. Document and maintain standards for Dockerfile labeling, validation logic, and data governance practices. Qualifications/Requirements Experienced DevOps engineer with hands-on expertise in containerized environments. Strong proficiency in Docker and Dockerfile development , including image optimization and best practices. Skilled in Python or Bash scripting for data validation, automation, and build integration. Solid understanding of data formats, schemas, and validation frameworks. Familiarity with CI/CD systems and container registries, ensuring automated and compliant build pipelines. Nice to Have: Previous participation in LLM research or evaluation projects. Experience building or testing developer tools or automation agents. Experience with MLOps workflows, data versioning, or Great Expectations. Knowledge of Kubernetes or container security tools. Benefits Work in a fully remote environment. Opportunity to work on cutting-edge AI projects with leading LLM companies. Details: Commitments Required: At least 4 hours per day and minimum 20 hours per week with overlap of 4 hours with PST. (We have 3 options of time commitment: 20 hrs/week, 30 hrs/week or 40 hrs/week) Employment type: Contractor assignment (no medical/paid leave) Duration of contract: 1 month; [expected start date is next week]