Jobless Developer
Carnegie Affiliates logo
Carnegie Affiliates

Posted 120 months ago

Open

DATA ENGINEER

New York, NY, United StatesRemoteFull-time

AI Summary

DATA ENGINEER Major Corporation Responsibilities:Design and develop high-throughput, low-latency data processing pipelines to quickly ingest and make data available on the platform across various distributed data storesAnalyze large datasets to identify opportunities to tune and improve the systemExperiment with various Hadoop frameworks like Hive, Pig and Scalding to identify the optimal approach for extracting valuable insights from massive datasetsTool We Used:ScalaHadoop (Hive, Pig, Scaldin

About this role

DATA ENGINEER

Major Corporation

Responsibilities:

  • Design and develop high-throughput, low-latency data processing pipelines to quickly ingest and make data available on the platform across various distributed data stores
  • Analyze large datasets to identify opportunities to tune and improve the system
  • Experiment with various Hadoop frameworks like Hive, Pig and Scalding to identify the optimal approach for extracting valuable insights from massive datasets

Tool We Used:

  • Scala
  • Hadoop (Hive, Pig, Scalding, Spark)
  • Kafka
  • MySQL, Redis, Vertica, Aerospike

Qualifications

Requirements:

  • Bachelors or Masters in Computer Science or related field
  • 3+ years of experience ingesting, processing, storing and querying large datasets
  • Professional Hadoop ecosystem experience, including storage optimization and job performance tuning
  • Expertise in Java, Python or similar language(s). Functional programming experience is a plus
  • Passion for code correctness and intuition about which values in data are to be expected in a business context

Additional Information

All your information will be kept confidential according to EEO guidelines.

Explore related jobs

Browse these categories