We’re looking for a Senior Software Engineer who views data through the lens of high-quality software engineering. You will design sophisticated data architectures and implement rigorous best practices to ensure that our systems are resilient and scalable. This is a multi-faceted role where you will build the software necessary to manage and move data across our organization.
What your day could consist of:
Architecting and Building Pipelines: Design, develop, and maintain high-throughput data pipelines to move and transform data across our ecosystem
Data Modeling and Schema Design: Execute DDL and DML operations to manage and optimize our data structures
Engineering Excellence: Implement rigorous software engineering best practices, including automated testing, CI/CD pipelines, managing infrastructure as code, and monitoring of system health
Collaboration: Partner with Data Science and AI teams to engineer feature stores and automated MLOps Pipelines, enabling AI models to move from research to production
Technical environment:
Databases: MySQL, Amazon Aurora, Amazon RDS, PostgreSQL
ata Storage and Analytics: AWS S3, Apache Iceberg, Snowflake, Apache Druid, Looker
Data Processing and Transformation: Apache Spark, DBT
Languages: Python (primary)
Infrastructure as Code: Terraform
Containerization and Orchestration: Kubernetes, Docker, Airflow
What is needed:
Engineering Experience: 4+ years professional software engineering experience, with a proven track record of building and maintaining production-grade data systems
Data Architecture Experience: Deep understanding of modern data stack components. You should be comfortable designing and building schemas that balance performance with scalability
Infrastructure: Proficient with managing your own infrastructure in code, using tools like Terraform
Software rigor: Strong background in software engineering best practices, including automated testing and CI/CD
Advanced SQL and Optimization: Expert-level SQL skills (especially MySQL); you don’t just write queries, you optimize for performance and scalability
Communication: You are able to have technical discussions internally as well as translate these ideas into non-technical conversations with stakeholders