Jobless Developer
SOSi logo
SOSi

Posted 3 months ago

Open

Data Lake Engineer

Doral, FL, United StatesRemoteFull-time

AI Summary

Data Lake Engineer Founded in 1989, SOSi is among the largest private, founder-owned technology and services integrators in the defense and government services industry.

About this role

Data Lake Engineer

Founded in 1989, SOSi is among the largest private, founder-owned technology and services integrators in the defense and government services industry. We deliver tailored solutions, tested leadership, and trusted results to enable national security missions worldwide.

SOSi is seeking a Data Lake Engineer to support mission requirements for a structured approach to further develop, integrate, and sustain a scalable, federated data ecosystem that enhances interoperability, governance, and mission-driven analytics for a DoD customer. The primary objective of the program is to bridge the operational gaps between DoD, IC, interagency, and non-traditional international partners to enable real-time information sharing, dynamic data integration, and mission-tailored analytical capabilities.

Essential Job Duties:

  • The contractor shall design, implement, and maintain scalable Data Lake architectures to support structured and unstructured data ingestion, ensuring efficient data access and retrieval.
  • The contractor shall configure and manage the integration interface between the Data Lake and the knowledge graph platform (Stardog), including SPARQL endpoint access, metadata federation, and catalog alignment.
  • The contractor shall follow access control policies and usage scope defined by the Government and other coordinated Work Orders.
  • The contractor shall confirm compliance with access policies on a quarterly basis and document the results in the Data Governance & Compliance Report.
  • The contractor shall optimize ETL pipelines for high-volume data transformation, ensuring compliance with DoD IL-4/IL-5 security standards.
  • The contractor shall implement storage tiering strategies and access controls, ensuring data is properly classified, retained, and accessed per DoD governance requirements.
  • The contractor shall submit the Data Lake Performance & Optimization Report, detailing ingestion efficiency, access control improvements, and storage utilization metrics.

Qualifications

  • Active TS/SCI Clearance.
  • Master’s degree or higher (e.g., Ph.D.) in Computer Science, Information Technology, Systems Engineering, Data Science, Business Administration, Engineering Management, or a closely related field, or 
    • a minimum of eleven (11) years of experience managing complex technical projects in enterprise data architecture, Databricks administration, and cloud-based data platforms.
  • Knowledge and capability to support Data Lake platform administration and enterprise data architecture for DoD data-driven projects.
  • Skilled in Data Lake platform administration, including workspace management and configuration, cluster optimization and performance tuning, cloud integration, and Unity Catalog integration for secure data governance.
  • Proficient in ETL/ELT pipeline development, Delta Lake architecture and optimization, AI/ML workflow integration, and Data Lakehouse optimization for DoD analytics and mission-critical data workflows.
  • Experienced in SysEngOps, DevSecOps, version control systems (Git), and CI/CD pipelines to streamline Data Lake development and deployment.
  • Knowledgeable in identity and access management (IAM), role-based access control (RBAC), and cloud security best practices across AWS, Azure, and GCP.
  • Hands-on expertise in Python, SQL/NoSQL, Apache Spark, Databricks SQL, Terraform, and cloud-native data services for large-scale data processing and analytics.

Additional Information

Work Environment

  • Normal office conditions

Working at SOSi

All interested individuals will receive consideration and will not be discriminated against for any reason.

Explore related jobs

Browse these categories