About me

Hello and welcome!

I am an Associate Director of Machine Learning at Relation Therapeutics.

My research interests lie within the fields of mathematics and computing. I am particularly interested in biomedical applications of machine learning to biomedical. I focus on improving the efficiency of the drug development pipeline, working closely with experimentalists, from target identification phase.

I have a PhD in Computer Science from University College London. During my PhD studies, I focused on developing models for mining, analysing, and integrating large scale data using graph theory and machine learning approaches. I worked multi-scale molecular and clinical data to uncover new biological knowledge with an eye towards improving patient diagnosis, prognosis, and treatment.

I am a graduate from the University of Oxford, MSc. in Applied Mathematics, and from Ecole Centrale de Lyon, BEng/MEng in general engineering.

Publications

P-9

Paul Scherer, Thomas Gaudelet, Alison Pouplin, Jyothish Soman, Lindsay Edwards, Jake P Taylor-King, PyRelationAL: A Library for Active Learning Research and Development, arXiv preprint arXiv:2205.11117.

Abstract

In constrained real-world scenarios where it is challenging or costly to generate data, disciplined methods for acquiring informative new data points are of fundamental importance for the efficient training of machine learning (ML) models. Active learning (AL) is a subfield of ML focused on the development of methods to iteratively and economically acquire data through strategically querying new data points that are the most useful for a particular task. Here, we introduce PyRelationAL, an open source library for AL research. We describe a modular toolkit that is compatible with diverse ML frameworks (e.g. PyTorch, Scikit-Learn, TensorFlow, JAX). Furthermore, to help accelerate research and development in the field, the library implements a number of published methods and provides API access to wide-ranging benchmark datasets and AL task configurations based on existing literature. The library is supplemented by an expansive set of tutorials, demos, and documentation to help users get started. We perform experiments on the PyRelationAL collection of benchmark datasets and showcase the considerable economies that AL can provide. PyRelationAL is maintained using modern software engineering practices - with an inclusive contributor code of conduct - to promote long term library quality and utilisation.

P-8

Paul Bertin, Jarrid Rector-Brooks, Deepak Sharma, Thomas Gaudelet, Andrew Anighoro, Torsten Gross, Francisco Martinez-Pena, Eileen L Tang, Cristian Regep, Jeremy Hayter, Maksym Korablyov, Nicholas Valiante, Almer van der Sloot, Mike Tyers, Charles Roberts, Michael M Bronstein, Luke L Lairson, Jake P Taylor-King, Yoshua Bengio, Recover: sequential model optimization platform for combination drug repurposing identifies novel synergistic compounds in vitro, arXiv preprint arXiv:2202.04202.

Abstract

Selecting optimal drug repurposing combinations for further preclinical development is a challenging technical feat. Due to the toxicity of many therapeutic agents (e.g., chemotherapy), practitioners have favoured selection of synergistic compounds whereby lower doses can be used whilst maintaining high efficacy. For a fixed small molecule library, an exhaustive combinatorial chemical screen becomes infeasible to perform for academic and industry laboratories alike. Deep learning models have achieved state-of-the-art results in silico for the prediction of synergy scores. However, databases of drug combinations are highly biased towards synergistic agents and these results do not necessarily generalise out of distribution. We employ a sequential model optimization search applied to a deep learning model to quickly discover highly synergistic drug combinations active against a cancer cell line, while requiring substantially less screening than an exhaustive evaluation. Through iteratively adapting the model to newly acquired data, after only 3 rounds of ML-guided experimentation (including a calibration round), we find that the set of combinations queried by our model is enriched for highly synergistic combinations. Remarkably, we rediscovered a synergistic drug combination that was later confirmed to be under study within clinical trials.

P-7

T. Gaudelet, Ben Day, Arian R Jamasb, Jyothish Soman, Cristian Regep, Gertrude Liu, Jeremy BR Hayter, Richard Vickers, Charles Roberts, Jian Tang, David Roblin, Tom L Blundell, Michael M Bronstein, Jake P Taylor-King, Utilising Graph Machine Learning within Drug Discovery and Development, Briefings in Bioinformatics, 2021.

Abstract

Graph Machine Learning (GML) is receiving growing interest within the pharmaceutical and biotechnology industries for its ability to model biomolecular structures, the functional relationships between them, and integrate multi-omic datasets-amongst other data types. Herein, we present a multidisciplinary academic-industrial review of the topic within the context of drug discovery and development. After introducing key terms and modelling approaches, we move chronologically through the drug development pipeline to identify and summarise work incorporating: target identification, design of small molecules and biologics, and drug repurposing. Whilst the field is still emerging, key milestones including repurposed drugs entering in vivo studies, suggest graph machine learning will become a modelling framework of choice within biomedical machine learning.

P-6

T. Gaudelet, N. Malod-Dognin, and N. Pržulj, Integrative Data Analytic Framework to Enhance Cancer Precision Medicine, Network and systems medicine, 2021.

Abstract

With the advancement of high-throughput biotechnologies, we increasingly accumulate biomedical data about diseases, especially cancer. There is a need for computational models and methods to sift through, integrate, and extract new knowledge from the diverse available data to improve the mechanistic understanding of diseases and patient care. To uncover molecular mechanisms and drug indications for specific cancer types, we develop an integrative framework able to harness a wide range of diverse molecular and pan-cancer data. We show that our approach outperforms competing methods and can identify new associations. Furthermore, through the joint integration of data sources, our framework can also uncover links between cancer types and molecular entities for which no prior knowledge is available. Our new framework is flexible and can be easily reformulated to study any biomedical problems.

P-5

J. Lugo-Martinez, D. Zeiberg, T. Gaudelet, N. Malod-Dognin, N. Pržulj, and P. Radivojac, Classification in biological networks with hypergraphlet kernels, Bioinformatics , September 2020, .

Abstract

Biological and cellular systems are often modeled as graphs in which vertices represent objects of interest (genes, proteins, drugs) and edges represent relational ties between these objects (binds-to, interacts-with, regulates). This approach has been highly successful owing to the theory, methodology and software that support analysis and learning on graphs. Graphs, however, suffer from information loss when modeling physical systems due to their inability to accurately represent multi-object relationships. Hypergraphs, a generalization of graphs, provide a framework to mitigate information loss and unify disparate graph-based methodologies. We present a hypergraph-based approach for modeling biological systems and formulate vertex classification, edge classification and link prediction problems on (hyper)graphs as instances of vertex classification on (extended, dual) hypergraphs. We then introduce a novel kernel method on vertex- and edge-labeled (colored) hypergraphs for analysis and learning. The method is based on exact and inexact (via hypergraph edit distances) enumeration of hypergraphlets; i.e., small hypergraphs rooted at a vertex of interest. We empirically evaluate this method on fifteen biological networks and show its potential use in a positive-unlabeled setting to estimate the interactome sizes in various species.

P-4

T. Gaudelet, N. Malod-Dognin, J. Sànchez-Valle, V. Pancaldi, A. Valencia, and N. Pržulj, Unveiling new disease, pathway, and gene associations via multi-scale neural networks, PloS one, Volume 15, Issue 4, April 2020, p.e0231059.

Abstract

Diseases involve complex modifications to the cellular machinery. The gene expression profile of the affected cells contains characteristic patterns linked to a disease. Hence, new biological knowledge about a disease can be extracted from these profiles, improving our ability to diagnose and assess disease risks. This knowledge can be used for drug re-purposing, or by physicians to evaluate a patient's condition and co-morbidity risk.

Here, we consider differential gene expressions obtained by microarray technology for patients diagnosed with various diseases. Based on these data and cellular multi-scale organization, we aim at uncovering disease--disease, disease--gene and disease--pathway associations. We propose a neural network with structure based on the multi-scale organization of proteins in a cell into biological pathways. We show that this model is able to correctly predict the diagnosis for the majority of patients. Through the analysis of the trained model, we predict disease--disease, disease--pathway, and disease--gene associations and validate the predictions by comparisons to known interactions and literature search, proposing putative explanations for the predictions.

P-3

T. Gaudelet, and N. Pržulj, Introduction to Graph and Network Theory, Analyzing Network Data in Biology and Medicine, Cambridge University Press, 2019, Pages 111-150.

Abstract

Hardly any entity exists in absolute isolation of others. It is usually a part of a larger system of greater complexity. This remains true at any scale. For instance, stars and planets interact via different forces of attraction and repulsion that shape the universe. Species interact tightly through predator-prey relationships to form an often fragile ecosystem. Neurons in a brain interact to carry out complex tasks ranging from limb motion to information processing. Studying single entities in isolation of others is crucial to our understanding. However, it is also essential to study their roles as part of wider systems of interacting components. The organisation of such systems is far from random and reflects the function of the system. Understanding the organisational principles of the systems can shed light on the impact of changes on the system or on its environment.

Graphs (networks) are means of representing entities and their interactions, while abstracting much of the data about them. These representations are needed to develop tools to investigate the organisation of complex systems. Everyday examples of network representations of real systems include maps of public transport networks, which connect various stations, and the World Wide Web. In biology, many systems can naturally be modeled by networks.

In this book chapter, we introduce elements of graph and network theory. The target audience is students enrolled in bioinformatics modules.

P-2

T. Gaudelet, N. Malod-Dognin, and N. Pržulj, Higher-order molecular organization as a source of biological function, Bioinformatics, Volume 34, Issue 17, September 2018, Pages i944–i953.

Abstract

Molecular interactions have widely been modelled as networks. The local wiring patterns around molecules in molecular networks are linked with their biological functions. However, networks model only pairwise interactions between molecules and cannot explicitly and directly capture the higher-order molecular organization, such as protein complexes and pathways. Hence, we ask if hypergraphs (hypernetworks), that directly capture entire complexes and pathways along with protein–protein interactions (PPIs), carry additional functional information beyond what can be uncovered from networks of pairwise molecular interactions. The mathematical formalism of a hypergraph has long been known, but not often used in studying molecular networks due to the lack of sophisticated algorithms for mining the underlying biological information hidden in the wiring patterns of molecular systems modelled as hypernetworks.

We propose a new, multi-scale, protein interaction hypernetwork model that utilizes hypergraphs to capture different scales of protein organization, including PPIs, protein complexes and pathways. In analogy to graphlets, we introduce hypergraphlets, small, connected, non-isomorphic, induced sub-hypergraphs of a hypergraph, to quantify the local wiring patterns of these multi-scale molecular hypergraphs and to mine them for new biological information. We apply them to model the multi-scale protein networks of bakers yeast and human and show that the higher-order molecular organization captured by these hypergraphs is strongly related to the underlying biology. Importantly, we demonstrate that our new models and data mining tools reveal different, but complementary biological information compared with classical PPI networks. We apply our hypergraphlets to successfully predict biological functions of uncharacterized proteins.

P-1

H. Tsuchiya, K. Manabe, T. Gaudelet, T. Moriya, K. Suwabe, M. Tenjimbayashi, K. Kyong, F. Gillot, and S. Shiratori. Improvement of heat transfer by promoting dropwise condensation using electrospun polytetrafluoroethylene thin films, New Journal of Chemistry 41, no. 3 (2017): 982–991.

Abstract

Vapor condensation is a crucial part of a broad range of industrial applications including power generation, water harvesting, and air conditioning. Hydrophobic and superhydrophobic surfaces promote dropwise condensation in vapor-filled environments and increase their heat transfer coefficients more than filmwise condensation on hydrophilic surfaces. Although dropwise condensation can lead to energy-efficient transfer, it is hard to achieve stable dropwise condensation in high-temperature environments. To decide the best conditions for achieving higher heat transfer is also difficult because the heat transfer coefficient is influenced by not only surface wettability but also surface structures of thin films and substrates. Herein, we fabricated thin films with different wettabilities and surface structures using polytetrafluoroethylene (PTFE) which show high heat resistance to determine the best conditions for heat transfer. Several different films were prepared by electrospinning a mixed solution of PTFE and polyvinyl alcohol on aluminum (Al) and copper (Cu) tubes. After annealing them, the PTFE thin films enhanced heat transfer performance and showed stable dropwise condensation in high-temperature environments. The films fabricated by electrospinning a solution containing 66 wt% PTFE displayed the highest heat transfer coefficients, with heat transfer coefficients 64% and 61% greater than those of uncoated Al and Cu tubes, respectively. That is because homogeneous superhydrophobic surfaces that showed the highest departure frequency of condensed water droplets were fabricated using 66 wt% PTFE. The results suggest that these electrospun PTFE thin films would demonstrate excellent potential for use on the surface of heat exchangers in various industries.

C

Workshop, Conference Talks & Posters

  • ISMB/ECCB 2019
    Poster

    T. Gaudelet, N. Malod-Dognin, J. Sànchez-Valle, V. Pancaldi, A. Valencia, and N. Pržulj,Unveiling new disease, pathway, and gene associations via multi-scale neural networks, ISMB-ECCB, Basel, Switzerland, July 21-25, 2019.
  • ECCB 2018
    Talk

    T. Gaudelet, N. Malod-Dognin, and N. Pržulj, Higher order molecular organisation as a source of biological function, ECCB, Athens, Greece, September 8-12, 2018.
  • NetSci 2018
    Poster

    N. Malod-Dognin, T. Gaudelet, S.F.L. Windels, and N. Pržulj, Higher-order modelling and integration of biological data for precision medicine, International School and Conference on Network Science, Satellite Symposium on Network Medicine, Paris, France, June 11-15, 2018.
  • ECTB 2018
    Talk

    N. Malod-Dognin, T. Gaudelet, S.F.L. Windels, and N. Pržulj, Towards higher-order modelling and integration of biological data for precision medicine, European Conference on Translational Bioinformatics, Barcelona, Spain, April 16-17, 2018.
  • Complex Networks 2017
    Talk

    T. Gaudelet, N. Malod-Dognin, J. Lugo-Martinez, P. Radivojac, and N. Pržulj, Hypergraphlets Give Insight into Multi-Scale Organisation of Molecular Networks, International Conference on Complex Networks and Their Applications, Lyon, France, November 29 - December 01, 2017.
  • ISMB/ECCB 2017
    Poster

    T. Gaudelet, N. Malod-Dognin, J. Lugo-Martinez, P. Radivojac, and N. Pržulj, Hypergraphlets Give Insight into Multi-Scale Organisation of Molecular Networks, ISMB-ECCB, Prague, Czech Republic, July 21-25, 2017.
  • IX UCM Modelling week 2015
    Workshop

    E.H. Serrano, S.I. Moreno, R.P. Ochoa, M.V. Domínguez, and T. Gaudelet, Generating a caplet volatility surface, IX UCM Modelling week, Madrid, Spain, June, 2015.
  • JSAP 2014
    Poster

    T. Gaudelet, K. Suwabe, K. Kyung, F. Gillot, S. Shiratori, Investigation of dropwise condensation on PTFE-based coated surfaces, JSAP, Tokyo, Japan, March 18, 2014.

Resume

Work experience


  • 2020-
    Present

    Associate Director Machine Learning
    Relation Therapeutics

    Developping state-of-the-art machine learning models to address unmet clinical needs.

  • 2016-
    2019

    Teaching Assistant
    University College London, U.K.

    • Advanced Deep Learning and Reinforcement Learning, Prof. Thore Graepel and DeepMind researchers
    • Introduction to Machine Learning, Dr. Iasonas Kokkinos
    • Robotics Programming, Prof. Ghita Kouadri Mostefaoui

  • 2016

    R&D Intern
    Fuel3D Technologies

    Investigated and implemented algorithms, using Matlab and C++, to improve on the quality of the reconstruction of three dimensional surfaces using Vision and Photometric Stereo.

  • 2014

    Development Intern
    PathControl

    Developed the user interface of the software for Passive Magnetic Ranging (PMR) created by PathControl. It involved mainly Matlab and the associated GUI Toolbox.

  • 2013-
    2014

    Research Intern
    Keio University, Tokyo

    Investigated the condensation behaviour of water on PTFE-based coating. The objective was to enhance heat exchanges through the surface. We presented the results at the JSAP conference in Tokyo.

  • 2013

    Research Intern
    EDF

    Examined the interest of nanofilms/nanoparticles for energy production and nuclear central safety.

Education


  • 2016-
    2020

    Ph.D Candidate
    University College London, U.K.

    Department of Computer Science.
    Supervised by Nataša Pržulj
    Integration of multi-scale protein interactions for biomedical data analysis.

  • 2019

    Research Student
    Stanford University, California, U.S.A.

    Department of Computer Science.
    Supervised by Prof. Jure Leskovec and Prof. Marinka Zitnik

  • 2014-
    2015

    Master of Science
    University of Oxford, U.K.

    Mathematical Modelling and Scientific Computing
    Dissertation: investigated and implemented pattern recognition using multi-layer neural networks, highlighting their robustness.

  • 2011-
    2016

    BEng/MEng General Engineering
    Ecole Centrale Lyon, France

    Developed a software designed to read multimedia files during the first year project. During the second year, studied a solution to optimise the shape of vehicle components based on the level-set method.

  • 2008-
    2011

    Intensive Preparatory Courses
    Lycée Pierre de Fermat, Toulouse, France

    Grandes Ecoles entry exam preparatory courses: Mathematics, Physics, Computer Science.

Past projects

Fuel3D Internship
2016

Three-source Photometic Stereo.
(Confidential report )

Abstract

We investigate different leads to improve photometric stereo for the reconstruction of three-dimensional shape based on a set of input pictures captured with Fuel3D Technologies Limited scanner SCANIFY®.

MSc Dissertation
2015

Short-range Impact of Damage on Object Recognition in a trained Neuronal Network.

Abstract

We investigate the impact of damage on a neuronal network trained to recognise objects. A neuronal network is formed by a set of nodes that represent neurons, and edges that represent the connections between the neurons. To set the stage, we review some existing models that describe the dynamics of individual neurons. We then focus on Integrate-and-Fire (IF) models that we use for this project. We then discuss the numerical method that we use for our model of interacting IF neurons. We then construct our network using a multilayer-network formalism. We consider a simple multilayer network architecture that represents the interactions between the retina and the primal visual cortex (V1) in the brain. We train the system to recognise objects and differentiate between them using the "continuous transformation learning rule", which is based on relative spike times of pre-synaptic and post-synaptic neurons. To highlight the robustness and stability of the trained neuronal network, we simulate damage affecting the connections in the network and measure the performance of the deteriorated systems. A network has good performances if the neurons in the V1 representation differentiate successfully between different objects. In this situation, a subset of the neurons responds strongly to the stimuli corresponding to one object and weakly to the other. Our study provides preliminary insights on the impact of damage on connections between two neuronal subsystems.

Pathcontrol Internship
2014

GUI Development for Passive Magnetic Ranging.
(Confidential report)

Abstract

The Passive Magnetic Ranging, or PMR, is a detection method based on the magnetic field produced by an object one tries to locate. The idea is to drive a magnetic object into the ground and to monitor its magnetic field. Then using the magnetic interferences produced by the target one can calculate its position relatively to the measurement devices. In our case, this allows the user to control the borehole's trajectory in order to achieve, for instance, the interception of the second well. The work entails developping the graphical user interface for PathControl's PMR software. It also involve detailing the method used as well as the software layout and how to use it.

Internship Research
2014

Investigation of Condensation Mode on PTFE-based Coating.

Abstract

The enhancement of heat transfers during condensation is at the core of numerous researches as it will lead to increased efficiencies and productivities for various fields especially for power generation. One of the main solutions explored at the current time is to engineer a surface coating in order to promote dropwise condensation, in opposition of the filmwise condensation generally observed in actual condensing systems. With this goal in mind, we investigate the use of polytetrafluoroethylene-coated (PTFE) surfaces to improve heat transfers.

EDF Internship
2013

Nanotechnologies in Energy Production.
(Confidential report)

Abstract

We present the interest of superhydrophobic and superhydrophilic surfaces for the enhancement of heat transfers. For this purpose, we review the literature on the subject and investigate the impact of these surfaces for two applications: improvement of energy production and improvement of cooling methods. Furthermore, we evaluate different hydrophobic products proposed on the market and try to make out which ones would suit EDF best. Finally, we conduct and expose the results of a study on the effect of a magnetite deposit on a surface's wettability.

Research Project
2013

Shape Optimisation of Vehicle Components using the Level-Set Method.
(Report in french)

Abstract

We investigate shape and topologic optimisation based on the Level-Set method developed by the mathematicians J.A. Sethian adn S. Osher. We start by detailing the theory and the optimisation algorithm. We then illustrate how the method works on a simple 2D case. The algorithm is implemented combining Matlab and Code Aster.

Group project
2012

Creation of a Smart Multimedia Files Manager.
(Report in french)

Abstract

This study project has for goal the development of a multimedia software, that we baptised Tag'n'Link. The project was proposed by the laboratory LIRIS of Ecole Centrale Lyon. LIRIS stands for Computer Science Laboratory for Image Processing and Information Systems (Laboratoire d'Informatique en Image et Systèmes d'Information). LIRIS was developing algorithms to detect concepts and emotions in pictures or music files. An emotion being defined by a degree of positivity (joy/sadness) and a level of activity. A concept is for instance a word or an object, for instance a car in a picture. The idea of the projects was to use these algorithms as a black box and to develop a software around them in order to exploit them. The main noveltie was to use both the algorithm and user tags associated to the media files in order to easily generate playlists, slideshows, and combinations of the two according to a certain degree of similarity.

Get in touch