AI Engineer Jobs
Jobgether

AI Research Engineer - Pre training

Jobgether

Remote (US) Senior Level
Posted Today

Perks

  • Fully Remote
  • Advanced Computing
  • GPU Clusters

Skills

Large Language Models Transformer Architectures PyTorch TensorFlow JAX Distributed Machine Learning GPU Optimization Python Deep Learning Optimization Model Scaling Multi-modal AI Parallel Training Strategies

About the Role

This position is posted by Jobgether on behalf of a partner company. We are currently looking for an AI Research Engineer - Pre training in United States.

This is an exceptional opportunity to contribute to the next generation of artificial intelligence systems within a highly innovative and globally distributed environment. In this role, you will work on large-scale pre-training initiatives for advanced language and multi-modal models, helping shape breakthroughs in model intelligence, efficiency, and scalability. You will collaborate with world-class engineers and researchers while leveraging cutting-edge infrastructure powered by thousands of GPUs. The position is ideal for candidates who enjoy deep technical challenges, experimentation, and pushing the boundaries of AI research. You’ll play a key role in developing novel architectures, optimizing training methodologies, and solving complex computational bottlenecks. The environment is fast-moving, research-driven, and built around autonomy, collaboration, and continuous innovation.

\n


Accountabilities:
  • Design, develop, and optimize large-scale AI model pre-training pipelines across distributed GPU infrastructures.
  • Research and prototype advanced architectures for large language models and multi-modal AI systems.
  • Conduct experiments independently and collaboratively, analyze training results, and iterate on methodologies to improve model quality and efficiency.
  • Identify, investigate, and resolve bottlenecks related to model performance, scalability, and computational optimization.
  • Contribute to data curation strategies and baseline improvements to strengthen model training outcomes.
  • Enhance distributed training systems to ensure seamless scalability, reliability, and operational efficiency across large compute environments.
  • Collaborate with cross-functional engineering and research teams to accelerate innovation and deliver high-impact AI capabilities.
  • Stay informed on emerging trends and advancements in AI research, machine learning systems, and large-scale model training.

Requirements:

  • Strong experience working with large language models, transformer architectures, and AI pre-training methodologies.
  • Deep understanding of distributed machine learning systems and large-scale GPU-based training environments.
  • Proven expertise in machine learning frameworks such as PyTorch, TensorFlow, JAX, or similar technologies.
  • Solid background in deep learning optimization, model scaling, and performance tuning.
  • Experience designing and executing research experiments with strong analytical and problem-solving capabilities.
  • Familiarity with distributed computing, parallel training strategies, and infrastructure optimization.
  • Strong programming skills in Python and experience building scalable AI training systems.
  • Research-oriented mindset with curiosity, innovation, and the ability to explore novel techniques and architectures.
  • Excellent communication and collaboration skills within remote and international teams.
  • Advanced degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field is preferred.

Benefits:

  • Fully remote work environment with the flexibility to work from anywhere.
  • Opportunity to work on cutting-edge AI research and large-scale pre-training systems.
  • Access to advanced computing infrastructure and high-performance GPU clusters.
  • Collaborative global culture with exposure to world-class engineering and research talent.
  • Fast-paced and innovation-driven environment focused on pushing technological boundaries.
  • Opportunity to contribute to impactful products shaping the future of digital finance, AI, and decentralized technologies.
  • Professional growth opportunities within rapidly evolving AI and blockchain ecosystems.
  • Flexible and autonomous work culture that values creativity, experimentation, and ownership.


\n

How Jobgether works:

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.

We appreciate your interest and wish you the best!

 Why Apply Through Jobgether? 

 

Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.

 

 

#LI-CL1

Similar Jobs

Apply Now