Job Description
We are looking for a Data Architect responsible for automated data pipelines, infrastructure as code, deployment patterns, and architecture governance for an ML-based demand forecasting system in the public transportation sector.
Key Responsibilities
- Create and implement AWS data pipeline designs using AWS Glue, Step Functions, and Amazon S3.
- Develop architecture for analytics workloads, including data lake patterns and event-driven pipelines.
- Establish security architecture ensuring IAM policies, encryption, and least-privilege access.
- Utilize AWS CloudFormation and/or AWS CDK (TypeScript or Python) for infrastructure automation.
- Deliver Infrastructure as Code (IaC) solutions in production environments.
- Design and deploy ML model deployment patterns using Amazon SageMaker, including endpoints, batch transforms, and inference pipelines.
- Manage the CI/CD processes for ML pipelines and infrastructure deploym...