Posted 13 months ago
Lead Data Engineer - AWS
AI Summary
Lead Data Engineer focused on building scalable Generative AI architectures within the AWS ecosystem, architecting data foundations to power LLMs and autonomous agents for Fortune 500 partners.
About this role
Tiger Analytics is a fast-growing advanced analytics consulting firm. Our consultants bring deep expertise in Data Science, Machine Learning and AI. We are the trusted analytics partner for multiple Fortune 500 companies, enabling them to generate business value from data. Our business value and leadership has been recognized by various market research firms, including Forrester and Gartner. We are looking for top-notch talent as we continue to build the best global analytics consulting team in the world.
Tiger Analytics is seeking an experienced Senior Data Engineer to join our team, specifically focused on building scalable Generative AI architectures within the AWS ecosystem. You will architect the data foundations that power LLMs and autonomous agents for our Fortune 500 partners.
Key Responsibilities:
-
GenAI Infrastructure: Architect data pipelines using Amazon Bedrock and Amazon SageMaker to build, deploy, and scale Generative AI applications.
-
Vector Foundations: Implement and optimize vector search capabilities using Amazon OpenSearch Serverless or specialized vector engines for RAG (Retrieval-Augmented Generation).
-
Serverless Data Engineering: Build highly scalable, event-driven ETL pipelines using AWS Lambda, AWS Glue, and Amazon Kinesis.
-
Modern Data Stack: Manage large-scale data lakehouses leveraging Amazon S3, AWS Lake Formation, and Amazon Redshift.
-
LLM Ops: Integrate AWS Step Functions and SageMaker Pipelines to automate the fine-tuning and deployment of foundation models.
Requirements
-
Experience: 8-12 years in Data Engineering with a heavy focus on the AWS Cloud stack.
-
AWS Expertise: Deep hands-on experience with Glue, Athena, EMR, and Redshift.
-
AI/ML Tools: Proficiency in LangChain or LlamaIndex integrated with AWS services to handle unstructured data (text, images, PDFs).
-
DevOps & IAC: Experience deploying infrastructure using AWS CDK or Terraform.
-
Core Skills: Advanced SQL, Python and PySpark skills tailored for distributed processing on AWS.
Benefits
This position offers an excellent opportunity for significant career development in a fast-growing and challenging entrepreneurial environment with a high degree of individual responsibility.
_**Tiger Analytics provides equal employment opportunities to applicants and employees without regard to race, color, religion, age, sex, sexual orientation, gender identity/expression, pregnancy,
national origin, ancestry, marital status, protected veteran status, disability
status, or any other basis as protected by federal, state, or local law.**_
Skills
Explore related jobs
More jobs at Tiger Analytics Inc.
- Senior Manager/ Associate Director - Analytics Consulting (RGM)United States
- Director / Associate Director - Omnichannel & Commercial PharmaNew Jersey, United States
- Associate/Sr. Associate - Delivery OperationsIndia
- Sr Data EngineerSan Jose, California
- Sr. Manager/Associate Director - Commercial Data Product (Pharma/ Life science)New Jersey, United States
- Sr. Manager / Associate Director - Commercial Data Product (Pharma/ Life scienceToronto, Ontario
Similar Amazon Bedrock jobs
Jobs in Dallas
- Commercial HVAC Sales SpecialistYellowstone Local · Dallas, Texas
- Fire Protection Field Operations ManagerMullins Mechanical · Atlanta, GA
In-Process Quality Assurance (IPQA) Tech 2T1 Energy · Dallas, TX
In-Process Quality Assurance (IPQA) Tech 1T1 Energy · Dallas, TX
HR GeneralistT1 Energy · Dallas, TX- Cost Manager – Data Center ConstructionTurner & Townsend · Dallas, Texas