Offre d'emploi

Principal Data Scientist - Machine Learning for Protein Design

Principal Data Scientist - Machine Learning for Protein Design

Postuler à l'offre

Date limite de candidature
Date limite pour postuler : 31.05.22


Corps de texte

About the Company:

Sanofi is a global life sciences company committed to improving access to healthcare and supporting the people we serve throughout the continuum of care. From prevention to treatment, Sanofi transforms scientific innovation into healthcare solutions, in human vaccines, rare diseases, multiple sclerosis, oncology, immunology, infectious diseases, diabetes and cardiovascular solutions and consumer healthcare. More than 110,000 people in over 100 countries at Sanofi are dedicated to make a difference on patients’ daily life, wherever they live and enable them to enjoy a healthier life. As a company with a global vision of drug development and a highly regarded corporate culture, Sanofi is recognized as one of the best pharmaceutical companies in the world and is pioneering the application of Artificial Intelligence (AI) in the R&D organization including drug discovery, chemical manufacturing and control, translational research, clinical development, and regulatory document management and submission. Details of the organization and the company’s mission and goals can be found on our website (


Artificial Intelligence (AI) and Machine Learning (ML) algorithms can significantly speed up drug discovery and shorten drug development and identification of patients for clinical trials thereby creating better medicines that save lives. AI and Deep Analytics (AIDA) is a critical group in Digital and Data Science (DDS) organization at Sanofi R&D focused on applications of AI/ML and Deep Learning (DL) in drug design, multi-omics diseases modeling, drug development, and analysis of outcomes of clinical trials.  

Our existing research and development areas include Omics Data Science applied to single-cell RNA sequences, multi-omics data integration, and real word data (RWD); SMEs and Biologics Drug Design; Natural Language Processing (NLP); Deep Learning-based Imaging and bioimaging for digital pathology and Spatial Biology; digital signal processing (DSP) and machine learning applied to digital health and patient-generated data from wearables.    

Scientists in our team come from diverse backgrounds in computational sciences and engineering with strong expertise in AI/ML, deep learning, biostatistics and algorithms.  

We are seeking a Principal/Senior Data Scientist to join the AI and Deep Analytics (AIDA), in silico Drug Design (isDD) team. isDD closely interacts with other scientific platforms at Sanofi R&D to identify and optimize compounds with modalities ranging from small organic compounds to multi-specific biotherapeutics.

The successful candidate will work with other scientists to apply cutting-edge computation, Machine Learning/Deep Learning approaches to resolve challenges in real-world drug discovery. The successful candidate will contribute to accelerating and improving the process of design and engineering of novel biologics drug candidates and make an impact on patient lives.

The responsibilities of the senior data scientist in AI and Deep Analytics will include:  

  • Apply and develop artificial intelligence and machine learning (AI/ML) approaches (e.g. classification, clustering, machine learning, deep learning) on pharma research data sets (eg activity, function, ADME properties, physico-chemical properties…)

  • Building models from internal and external data sources, algorithms, simulations, and performance evaluation by writing code and using state-of-the art machine learning technologies.

  •  Close interactions with other data scientists as well as research scientists in core scientific platforms focusing on protein therapeutics, in an international context (US, Europe, China)

  • Update and report relevant results to interdisciplinary project teams and stakeholders

  • Maintain a keen awareness of recent developments in data science and bioinformatics and state-of-the-art of AI/ML/DL algorithms and research results

  • Active engagement in evaluation and coordination of both academic and startup collaborations as well as outsourcing partners.

Qualifications & Requirements:

  • PhD in a field related to AI/ML or Data Analytics such as: Computer Science, Mathematics, Statistics, Physics, Biophysics, Computational Biology or Engineering Sciences.

  • 3+ years of industry experience with a track record of applying ML/Deep Learning (DL) approaches to solve molecule-related problems. Familiarity with protein structure or sequence featurization/embeddings.

  • Strong familiarity with advanced statistics, ML/DL techniques including various network architectures (CNNs, GANs, RNNs, Auto-Encoders, Transformers, PLM etc.), regularization, embeddings, loss-functions, optimization strategies, or reinforcement learning techniques.

  • Proficiency in Python and deep learning libraries such as PyTorch, TensorFlow, Keras.

  • Familiarity with data visualization and dimensionality reduction algorithms

  • Ability to develop, benchmark and apply predictive algorithms to generate hypotheses

  • Comfortable working in cloud and high-performance computational environments (e.g. AWS)

  • Excellent written and verbal communication, strong tropism for teamwork

  • Understanding of pharma R&D process is a plus.

  • A change agent with a combination of business, science & technology, and diplomatic skills

At Sanofi R&D North America, we deliver meaningful solutions for patients. We transform science into breakthrough, best-in-class and first-in-class medicines and vaccines. We believe in creating a diverse and inclusive workforce – and workplace – which brings together the collective brainpower of over 2,000 colleagues and provides you with an exciting place to grow and develop. We set the bar high, and we deliver. Join us and together we will build on our trusted legacy of breakthroughs for society.

Sanofi Inc. and its U.S. affiliates are Equal Opportunity and Affirmative Action employers committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race; color; creed; religion; national origin; age; ancestry; nationality; marital, domestic partnership or civil union status; sex, gender, gender identity or expression; affectional or sexual orientation; disability; veteran or military status or liability for military status; domestic violence victim status; atypical cellular or blood trait; genetic information (including the refusal to submit to genetic testing) or any other characteristic protected by law.