Search by job, company or skills

G42

Infrastructure Architect

Early Applicant
  • 13 days ago
  • Be among the first 50 applicants

Job Description

Overview

As the Infrastructure Architect at Inception, you will play a pivotal role in architecting, developing, and optimizing infrastructure for Inception's advanced AI platforms. With a robust background in cloud and on-premises infrastructure design, you'll focus on ensuring the scalability, security, and high performance of our AI systems. Collaborating closely with engineering and operations teams, you'll work to enhance the infrastructure that supports AI workloads, with an emphasis on cloud optimization, networking, and storage.

Inception is the UAE's national-scale enabler in AI Research and Development. Partnering with Microsoft's AI SaaS, we offer domain-specific Agentic AI Orchestrator platform utilizing reasoning agents for precise and cost-effective services. Our focus includes AI incubation, IP creation, applied AI R&D, and AI investment products. By creating models tailored to specific domains and languages, we ensure superior accuracy and efficiency. Collaborating with top universities and industry giants to drive significant advancements in AI technology within the region.

Responsibilities

  • Architect Infrastructure Solutions: Design and develop scalable, secure, and highly available infrastructure that supports Inception's AI platforms and applications.
  • Cloud & On-Premises Architecture: Develop cloud-based and on-premises infrastructure to meet AI/ML workload requirements for performance, scalability, and security.
  • Collaboration with Key Teams: Work closely with engineering, DevOps, and product teams to optimize infrastructure for AI training, inference, and data processing.
  • High Availability & Disaster Recovery: Build high availability and disaster recovery solutions to ensure the resilience of critical AI platforms.
  • Networking Architecture: Design and implement robust networking solutions for secure and reliable connectivity across AI systems.
  • Automation & Infrastructure as Code: Champion the use of automation and IaC, leveraging tools like Terraform, Ansible, and Kubernetes for streamlined infrastructure deployment and management.
  • Cloud Efficiency & Cost Management: Implement best practices for cloud resource efficiency and cost-effectiveness while maintaining stringent security standards.
  • Technology Evaluation & Integration: Continuously assess and integrate new technologies to enhance infrastructure capabilities in storage, networking, and cloud-native services.
  • Compliance & Security: Ensure infrastructure adheres to regulatory and security requirements, focusing on data privacy, encryption, and access control.
  • Technical Guidance: Provide mentorship and technical leadership to the infrastructure team, promoting best practices in infrastructure quality and efficiency.
  • Industry Trends & Innovation: Stay current on industry advancements and drive infrastructure innovation to support evolving AI and technology trends.

Qualifications


Qualifications & Requirements:

  • Experience: Minimum of 12 years in infrastructure architecture, cloud platforms, or systems engineering, with extensive experience in large-scale, mission-critical environments.
  • Proven Cloud Expertise: Solid track record in architecting and managing infrastructures (AWS, Azure, GCP) for AI/ML workloads with a focus on scalability and resilience.
  • Technical Proficiency: Expertise in cloud computing, networking, and storage, and experience with IaC tools like Terraform, Ansible, or CloudFormation.
  • Containerization & Orchestration: Proficient in Docker and Kubernetes, with experience deploying and managing AI/ML workloads.
  • Networking & Security Knowledge: Advanced understanding of networking protocols, VPNs, firewalls, and security best practices for hybrid and cloud environments.
  • Storage Systems: Familiarity with storage solutions for AI/ML applications, including distributed storage and high-performance computing architectures.
  • Resilience Strategies: Strong background in designing disaster recovery, business continuity, and high-availability plans.
  • Cross-Functional Collaboration: Ability to work with DevOps, software engineering, and security teams to align infrastructure goals.

Skills and attributes for success

  • Strategic Insight: Ability to design and implement infrastructure that aligns with both technical and business objectives.
  • Problem Solving: Strong analytical skills, particularly in addressing challenges related to scalability, performance, and security.
  • Leadership & Mentorship: Effective communicator with leadership capabilities to guide cross-functional teams and mentor junior engineers.
  • Innovative Mindset: Passion for staying at the forefront of infrastructure advancements to support high-impact AI applications.

To qualify for the role you must have

  • Experience: Minimum of 12 years in infrastructure architecture, cloud platforms, or systems engineering, with extensive experience in large-scale, mission-critical environments.
  • Proven Cloud Expertise: Solid track record in architecting and managing infrastructures (AWS, Azure, GCP) for AI/ML workloads with a focus on scalability and resilience.
  • Technical Proficiency: Expertise in cloud computing, networking, and storage, and experience with IaC tools like Terraform, Ansible, or CloudFormation.
  • Containerization & Orchestration: Proficient in Docker and Kubernetes, with experience deploying and managing AI/ML workloads.
  • Networking & Security Knowledge: Advanced understanding of networking protocols, VPNs, firewalls, and security best practices for hybrid and cloud environments.
  • Storage Systems: Familiarity with storage solutions for AI/ML applications, including distributed storage and high-performance computing architectures.
  • Resilience Strategies: Strong background in designing disaster recovery, business continuity, and high-availability plans.
  • Cross-Functional Collaboration: Ability to work with DevOps, software engineering, and security teams to align infrastructure goals.

Ideally, you'll also have

  • Education: Advanced degree in Computer Science, Engineering, or a related field.
  • Certifications: Cloud architecture certifications (e.g., AWS Solutions Architect, Google Cloud Architect) preferred.
  • AI/ML Infrastructure Experience: Knowledge of high-performance computing, data pipelines, and AI model deployment at scale.
  • Cost Optimization Skills: Proficient in cloud cost management strategies that ensure efficiency without compromising performance.

What we look for

If you are a performance-driven, inquisitive mind with the agility to adapt to ambiguity, you will fit right in. You should be eager to explore opportunities to build meaningful collaborations with stakeholders and aspire to create unique customer-centric solutions. Bias for action and a passion to conquer new frontiers in the AI space is at the heart of the Inception community.

What working at Inception offers

Culture: An open, diverse and inclusive environment with a global vision that encourages personal growth and focuses on ground-breaking, industry-first innovations.

Career: Outstanding learning, development & growth opportunities via structured training programs and innovative, high-tech projects.

Work-Life: A hybrid work policy to strike the perfect balance between office and home.

Rewards: A competitive remuneration package with a host of perks including healthcare, education support, leave benefits and more.

If you can confidently demonstrate that you meet the criteria above, please contact us as soon as possible.







More Info

Industry:Other

Function:technology

Job Type:Permanent Job

Skills Required

Login to check your skill match score

Login

Date Posted: 14/11/2024

Job ID: 100329425

Report Job

About Company

Follow

Hi , want to stand out? Get your resume crafted by experts.

Last Updated: 25-11-2024 06:56:21 PM
Home Jobs in Abu Dhabi Infrastructure Architect