Posted 1 month ago

AI Specialist (AI Engineering)

SingaporeOn-siteFull-time

AI Summary

AI Specialist Engineer focused on compressing and optimizing large language and vision models for on-device inference, including distillation, pruning, quantization, and hardware-specific deployment across NPU/GPU architectures.

About this role

We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.

Responsibilities:

Compress and optimize large language and vision models for on-device inference.
Develop pipelines for model distillation and hardware-specific compilation.
Benchmark performance across various NPU/GPU architectures.

Qualifications:

Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.
Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.
Strong C++ and Python skills.

Skills

Benchmarking Across NPU/GPU ArchitecturesC++Edge DeploymentHardware-specific CompilationModel DistillationModel PruningOn-device InferenceONNX RuntimePythonQuantization 4-bit/8-bitTensorRT

AI Specialist (AI Engineering)

About this role

Skills

Explore related jobs

More jobs at Hyphen Connect Limited

Similar Benchmarking Across NPU/GPU Architectures jobs

Jobs in Singapore

Browse these categories