Company: AI71Role: Lead ML Engineer
Location: Abu Dhabi, UAE
About Us:
AI71 is an applied research team dedicated to creating helpful and responsible AI agents for knowledge workers.
Working closely with our industry partners, our cross-functional teams of AI experts build products grounded in the cutting-edge research of our colleagues from the Technology Innovation Institute (TII).
Job Description:
Are you a seasoned ML Engineer with a passion for AI and a track record of deploying, monitoring, and maintaining of machine learning models in production environments for optimal performance. As our ML Engineer at AI71, you'll play a critical role in shaping and delivering cutting-edge solutions that redefine industries and create transformative impact.
What You'll Do:
- Deploy and maintain machine learning models in production environments.
- Implement monitoring solutions to track model performance and detect anomalies.
- Collaborate with data scientists and engineers to streamline the model deployment process.
- Optimize models for scalability, reliability, and real-world performance.
- Implement and manage model versioning and rollback procedures.
- Troubleshoot and resolve issues related to model inference and data pipelines.
- Stay updated on the latest developments in ML Ops and implement best practices for model lifecycle management.
What You'll Bring:
- 5+ years of experience in ML / Data science
- Bachelor's or higher degree in Computer Science, Machine Learning, or a related field.
- Solid understanding of machine learning concepts and model deployment.
- Proficiency in programming languages such as Python.
- Experience with model deployment frameworks (e.g., TensorFlow Serving, ONNX Runtime).
- Knowledge of containerization and orchestration tools (e.g., Docker, Kubernetes).
- Familiarity with monitoring and logging tools for ML Ops.
- Strong collaboration and communication skills.
Why AI71:
- Proven performance of our large language models
- Strong traction and adoption from the open-source community
- Secured proprietary data to build specialized distinctive models.
- Locked large compute power to support our roadmap.
- Signed anchor clients, to develop POCs and demonstrate our solutions.