Back to Jobs

Associate Data Scientist - Environmental Modeling

Remote, USA Full-time Posted 2025-11-24

Bayer is a global company focused on science and innovation in agriculture. They are seeking an Associate Data Scientist specializing in Environmental Modeling to design and build statistical and machine learning models for crop yield testing, automate analytics workflows, and develop methodologies for integrating various data types. The role involves collaboration to provide data-driven solutions to business problems and requires a strong foundation in quantitative fields.


Responsibilities

  • Design & build statistical, machine learning and deep learning models to quantify subfield-scale yield testing environments of crops
  • Automate analytics workflows
  • Develop next generation methodologies for integrative usage of genomic, phenomic & environmental data
  • Determine environmental correlations among testing locations & global regions
  • Design statistical modeling frameworks & prediction models to drive product placement recommendations and yield predictions
  • Collaborate to provide data-driven statistical solutions to business problems
  • Using object-oriented programming techniques to write Python packages to analyze high dimensional environmental data with Gap Statistics
  • Developing & selecting unsupervised learning algorithms to analyze high-dimensional environmental data, including K-means, agglomerative hierarchical clustering, and/or Gaussian mixture models
  • Using statistical & machine learning packages, including Tensorflow, Pandas, Multiprocessing, Joblib, Numpy, SciPy, Scikit-Learn, Keras, PyTorch, PySpark, and/or Dask, to develop discovery and production ready models for analysis of phenotypic and geospatial data
  • Adhering to and/or enforcing coding best practices
  • Using code management tools, including GitHub, to ensure the reproducibility of data science
  • Aggregating & summarizing complex datasets using GCP BigQuery, Presto, Superset, and AWS RedShift
  • Building heat, drought, and cold stress models over global regions using high dimensional environmental data
  • Automating workflows using AWS Sagemaker, Google Cloud Platform, Airflow, & Docker
  • Performing data operations, including spatial joins, zonal statistics, & re-projecting
  • Quantifying similarity scores between different environments & using distance metrics to compare multivariate time series environmental data related to major row crops
  • Visualizing geospatial data, including vector & raster files, using QGIS, Google BigQuery, and/or Python libraries
  • Performing data quality checks using deep learning-based anomaly detection on time-series data
  • Designing, training & optimizing neural networks for generating embeddings using AutoEncoder for multivariate time series-based data

Skills

  • Master's in Statistics, Mathematics, or closely related quantitative field
  • 1 yr experience using object-oriented programming techniques to write Python packages to analyze high dimensional environmental data with Gap Statistics
  • Developing & selecting unsupervised learning algorithms to analyze high-dimensional environmental data, including K-means, agglomerative hierarchical clustering, and/or Gaussian mixture models
  • Using statistical & machine learning packages, including Tensorflow, Pandas, Multiprocessing, Joblib, Numpy, SciPy, Scikit-Learn, Keras, PyTorch, PySpark, and/or Dask, to develop discovery and production ready models for analysis of phenotypic and geospatial data
  • Adhering to and/or enforcing coding best practices
  • Using code management tools, including GitHub, to ensure the reproducibility of data science
  • Aggregating & summarizing complex datasets using GCP BigQuery, Presto, Superset, and AWS RedShift
  • Building heat, drought, and cold stress models over global regions using high dimensional environmental data
  • Automating workflows using AWS Sagemaker, Google Cloud Platform, Airflow, & Docker
  • Performing data operations, including spatial joins, zonal statistics, & re-projecting
  • Quantifying similarity scores between different environments & using distance metrics to compare multivariate time series environmental data related to major row crops
  • Visualizing geospatial data, including vector & raster files, using QGIS, Google BigQuery, and/or Python libraries
  • Performing data quality checks using deep learning-based anomaly detection on time-series data
  • Designing, training & optimizing neural networks for generating embeddings using AutoEncoder for multivariate time series-based data

Benefits

  • Health care
  • Vision
  • Dental
  • Retirement
  • PTO
  • Sick leave

Company Overview

  • Bayer is a life science company that specializes in the areas of health care and agriculture. It was founded in 1863, and is headquartered in Leverkusen, Nordrhein-Westfalen, DEU, with a workforce of 10001+ employees. Its website is https://www.bayer.com.

  • Company H1B Sponsorship

  • Bayer has a track record of offering H1B sponsorships, with 62 in 2025, 71 in 2024, 76 in 2023, 141 in 2022, 138 in 2021, 117 in 2020. Please note that this does not guarantee sponsorship for this specific role.

  •   Apply To This Job

    Similar Jobs

    [Remote] E-commerce Product Manager (Contract)

    Remote, USA Full-time

    SQL Developer

    Remote, USA Full-time

    AI Engineer Intern

    Remote, USA Full-time

    AI-Based Cybersecurity Research Intern

    Remote, USA Full-time

    TalentBurst | Vet Tech Services Specialist MO | saint joseph, mo

    Remote, USA Full-time

    Data Science and Analytics Senior Manager (Virtual)

    Remote, USA Full-time

    Business Analyst

    Remote, USA Full-time

    [Remote] HCAI - HEALTH INFORMATION AND ELECTRONIC RECORDS ANALYST TRAINING PROGRAM (EHR SUPPORT INTERNSHIP)

    Remote, USA Full-time

    Strategy Consultant

    Remote, USA Full-time

    Senior Manager, CRM Systems Administration

    Remote, USA Full-time

    Director, Contract Revenue Accounting (Remote Opportunity)

    Remote, USA Full-time

    QA Auditor III ISQA- Fully remote!

    Remote, USA Full-time

    Experienced Route Sales Trainee – Entry-Level Position for Aspiring Sales Professionals in the Beverage Industry

    Remote, USA Full-time

    Chronic Care Manager - LPN Remote

    Remote, USA Full-time

    bolthires Moderator Job ( Social Media Moderator ) $24/Hour

    Remote, USA Full-time

    Physician Clinical Reviewer- GI - REMOTE

    Remote, USA Full-time

    Data Entry Specialist (Weekend)

    Remote, USA Full-time

    **Experienced Customer Service Representative – Work From Home Opportunities at arenaflex**

    Remote, USA Full-time

    Work From Home Jobs / Data Entry Clerk - Typing (Remote)

    Remote, USA Full-time

    Emergency Response Paramedic – Located in Rancho Cucamonga, CA in Rancho Cucamonga, CA – (job id: 1691909756)

    Remote, USA Full-time