Overview:
The Senior Data Scientist will be highly skilled and experienced in Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) techniques. Additionally, they will have a strong background in data science, machine learning, and natural language processing (NLP). Proficiency in Arabic is optional but will be considered a significant advantage.
Responsibilities:
Design, develop, and deploy large-scale language models and RAG systems to solve complex problems.
Optimize and fine-tune models to improve performance and accuracy for various applications.
Conduct research to advance the state-of-the-art in language models and retrieval-augmented generation.
Stay updated with the latest advancements in NLP, machine learning, and artificial intelligence.
Collect, preprocess, and analyze large datasets to support model training and evaluation.
Implement robust data pipelines to ensure efficient data flow and processing.
Evaluate model performance using appropriate metrics and methodologies.
Identify areas for improvement and implement strategies to enhance model effectiveness.
Collaborate with cross-functional teams including engineers, product managers, and researchers.
Mentor and guide junior data scientists and machine learning engineers.
Document methodologies, experiments, and results comprehensively.
Prepare and present reports and findings to stakeholders and team members.Qualifications:
Masters or Ph.D. in Computer Science, Data Science, Machine Learning, or a related field
Minimum of 5 years of experience in data science and machine learning.
Proven experience working with large language models (e.g., GPT, BERT) and RAG systems.
Experience with natural language processing and understanding tasks.
Proficiency in programming languages such as Python, and familiarity with machine learning frameworks (e.g., TensorFlow, PyTorch).
Strong understanding of machine learning algorithms, deep learning, and statistical methods.
Experience with data visualization tools and techniques.
Familiarity with cloud platforms and services (e.g., AWS, GCP, Azure).
Proficiency in Arabic language (reading, writing, speaking) is a plus.
Excellent problem-solving skills and attention to detail.
Strong communication and collaboration abilities.
Ability to work independently and in a team-oriented environment.
Continuous learning mindset and a passion for innovation.