Hyphen Connect Limited
Posted 1 month ago
AI Specialist (AI Engineering)
Oregon, USARemote
AI Summary
AI Specialist Engineer focused on compressing and optimizing large language and vision models for on-device inference, including distillation, quantization, and hardware-specific compilation.
About this role
We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.
Responsibilities:
- Compress and optimize large language and vision models for on-device inference.
- Develop pipelines for model distillation and hardware-specific compilation.
- Benchmark performance across various NPU/GPU architectures.
Qualifications:
- Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.
- Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.
- Strong C++ and Python skills.
Skills
BenchmarkingC++Edge DeploymentGPUHardware ArchitecturesHardware-specific CompilationModel DistillationModel PruningNPUOn-device InferenceONNX RuntimePipeline DevelopmentPythonQuantization (4-bit, 8-bit)TensorRT
Explore related jobs
More jobs at Hyphen Connect Limited
- Sales Operations ManagerRemote - Hangzhou, China
- Compliance Officer/ Money Laundering Reporting Officer (CO/MLRO)Hong Kong
- DeFi Product Owner (Bilingual: English & Mandarin)Remote - Global
- Senior 2D Game Artist (Characters & Animation) - Taiwan/ Hong KongAPAC
- Social Media Content Creator (Instagram Reels | AI-Driven)APAC
- Founding Engineer/ Tech Lead (Stablecoin Cross-border Payments Infrastructure)APAC