Artificial Intelligence (AI) Engineer / Developer (Remote)
About UsStatheros is a small DEFTECH firm focused on developing cutting-edge AI and autonomy systems for the US Department of Defense. Our team is passionate about building intelligent systems that solve complex problems. We are looking for a talented AI Engineer specializing in Proximal Policy Optimization (PPO) to lead the development of AI-enabled algorithms that automate the operation of air traffic radar systems.
Job Responsibilities
• Design, implement, and optimize Proximal Policy Optimization (PPO) algorithms for domain-specific use cases.
• Develop and train reinforcement learning models for real-world applications, focusing on efficiency and scalability.
• Collaborate with cross-functional teams to integrate PPO models into production systems.
• Analyze model performance and experiment with hyperparameter tuning to achieve optimal results.
• Stay up-to-date with the latest research and advancements in reinforcement learning and apply them to enhance existing solutions.
• Build robust pipelines for training, evaluation, and deployment of RL models.
• Document workflows, methodologies, and code for reproducibility and knowledge sharing.
Qualifications
• Educational Background: Bachelor's or Master's degree in Computer Science, Machine Learning, AI, Mathematics, or related fields. Ph.D. is a plus.
• Experience:
• 4+ years of professional experience in machine learning, with a focus on reinforcement learning.
• Demonstrated expertise in implementing and optimizing PPO or similar reinforcement learning algorithms.
• Hands-on experience with frameworks like TensorFlow, PyTorch, or JAX.
• Technical Skills:
• Strong programming skills in Python; familiarity with Rust or other languages is a plus.
• Proficiency in designing and running RL experiments in simulated or real-world environments.
• Experience with distributed training systems for reinforcement learning.
• Solid understanding of policy gradient methods and reinforcement learning theory.
• Soft Skills:
• Excellent problem-solving skills and the ability to work in a collaborative, fast-paced environment.
• Strong communication skills for presenting findings and collaborating with interdisciplinary teams.
Preferred Qualifications
• Experience in applying PPO to [specific domain, e.g., robotics, gaming, finance, etc.]
• Familiarity with OpenAI Gym, RLlib, or other RL development environments
• Knowledge of parallel computing and GPU acceleration for large-scale RL tasks
What We Offer
• Remote work location.
• Competitive salary.
• Flexible work schedule.
• Opportunities for professional development and research contributions
• Access to state-of-the-art resources and tools for AI development.
• The chance to work on groundbreaking projects with a talented and passionate team.
Employment Type: CONTRACTOR
Apply tot his job
Apply To this Job