Jobless Developer
Hyphen Connect Limited logo
Hyphen Connect Limited

Posted 1 month ago

Open

AI Specialist (AI Engineering)

AustraliaRemoteFull-time

AI Summary

AI Specialist Engineer focusing on compressing and deploying large language and vision models for on-device inference, including hardware-specific optimization and benchmarking.

About this role

We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.

Responsibilities:

  • Compress and optimize large language and vision models for on-device inference.
  • Develop pipelines for model distillation and hardware-specific compilation.
  • Benchmark performance across various NPU/GPU architectures.

Qualifications:

  • Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.
  • Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.
  • Strong C++ and Python skills.

Skills

BenchmarkingC++Edge DeploymentHardware-specific CompilationModel DistillationONNX RuntimePipeline DevelopmentPruningPythonQuantization 4-bit/8-bitTensorRT

Explore related jobs

Browse these categories