Back to schedule


10 South Pl, London EC2M 7EB


9:00 AM - 5:00 PM



Data Science Festival – MainStage Day

CodeNode @
9:00 am-
5:00 pm
April 13th , 2019, Saturday

This free day will feature four-stream rooms with over 500 people attending. A day of lectures with over 40 top speakers. It will also feature our partners in our exhibitor section of the event. An entire day to learn, mingle and be inspired. For the people by the people!

Please register for a ballot ticket here: GET TICKETS

Please review how the tickets work here:

Speakers to date, please check for up to date speaker information.

Data Science Festival Mainstage (Ballot ticket only)

Due to the popularity of Data Science Festival events, we are now allocating event tickets via a random ballot. Registering here enters you into the ticket ballot for the Data Science Festival Mainstage day on Saturday 13th April 2019, the ballot will be drawn on the 5th April 2019. Those randomly selected will then be e-mailed tickets for the event, with the joining details.

Please bring a copy of your paper ticket or your ticket on your phone to the event to check in with your QR code. Tickets are non-transferable. The Data Science Festival is the first of its kind as the only community-led, free to attend Data Science Festival in the UK.


Akmal B. Chaudhri
GridGain Systems
Talk abstract: Machine and Deep Learning with In-Memory Computing. Apache Ignite is an open source memory-centric distributed database, caching, and processing platform used for transactional, analytical and streaming workloads -- delivering in-memory speeds at petabyte scale. Using demos, this presentation will provide an overview of the Machine Learning and Deep Learning capabilities…
Alex Combessie
Talk Abstract: The Making Of a Time Series Forecasting automated pipeline. The story of my 20%-side-project at Dataiku: how I designed and created a visual plugin for business users and data scientist to forecast time series automatically without code. A journey from understanding the literature of time series forecasting, packaging the latest open source…
Alex Dean
Snowplow Analytics
Talk abstract: Why high quality data is crucial for your machine learning models. It’s cliche, but garbage in truly means garbage out. In his talk Alex Dean, Co-Founder and CEO at Snowplow Analytics, will talk about how your data quality will make or break your machine learning models. He’ll dive…
Chris Samiullah
Train In Data - Udemy
Talk Abstract: Building and Deploying Reproducible Machine Learning Pipelines Deployment of machine learning (ML) models, or simply, putting ML models into production, is fundamentally about bridging the gap between the research environment and live systems. Successful deployments make our models available so they can be easily accessed by both internal…
Daniel Scott
Talk Abstract: Transforming the management of Natural and Built Assets through Analytics. Across the globe, £trillions are being spent each year on infrastructure to keep our lights on, our roads open and the water running from our taps.  Despite this, it is estimated that the current level of investment worldwide…
David Abelman
Talk Abstract: Product Analytics: The secret sauce! Product Analytics is a key part to Facebook’s success as a data driven company. We’ll explore how Data Scientists in the Product Analytics team helped drive the success of Workplace, Facebook’s enterprise offering built out of London. Bio: David Abelman is a Data Science Manager…
David Loughlan
Data Idols
Bio: David is the founder of Data Idols and the Data Science Festival.  He has spent years as a contract database administrator, delivering projects for some of the UK's leading companies.  During his time as a contractor, much of his work was sourced through recruitment agencies, it was a painful…
Dr Jo Judge
National Biodiversity Network
Talk abstract: How data can save the planet We all see the messages that the natural world and biodiversity are under threat, some native species are declining and climate change is affecting the wildlife we see and when and where we see it. But how do we know? This talk will…
Dr Merve Alanyali
Talk Abstract: A picture is worth a thousand words. But what does it say? An unprecedented amount of data is being generated on a daily basis. Automatic processing and analyses of these data sets therefore offer numerous benefits to decision makers in governmental and commercial arena. Due to the diverse…
Ed Klinger and Courtenay Mansel
Talk Abstract: Real-time risk analysis at scale: insuring the world’s largest drone fleets with Flock. Flock has built the world’s first geospatial risk analysis tool for the drone industry, using real-time data (such as weather conditions and proximity to high risk areas) to quantify and insure drone flights on an…
Fabrice Durier
Royal Mail
Talk Abstract: Efficient Route Optimisation, the Future of Parcel Delivery. A recognised brand in the logistic sector, Royal Mail is the market leader in delivering parcels to British homes. However, in a context of increasing parcel traffic and growing expectations, Royal Mail is always working to adjust its operations to…
Gabriel Straub
Talk Abstract: Gabriel will be talking about how to use a focus product in order to create capability in an organisation. The BBC has been a technology company since 1922. This means that there are a lot of different data sources. The BBC also has a long editorial tradition and as…
Gatis Seja
Talk Abstract: Creating Data Pipelines: Build Framework not Pipelines. Data pipelines are necessary for the flow of information from its source to its consumers, typically data scientists, analysts and software developers. Managing data flow from many sources is a complex task where the maintenance cost limits scale of being able…
Gianluca Campanella
Talk Abstract: MLOps: are we there yet? Situated at the intersection of R&D and IT operations, MLOps is crucial to realising the potential of Data Science: after all, what good is a model that's superbly accurate but never used? Why capture exquisitely complex relationships automatically yet require constant human supervision…
Jakub Langr
Talk Abstract: Progressing with GANs: Progressive growing for increasing quality, stability and variation. Generative Adversarial Networks (GANs) have recently reached few tremendous milestones: generating full-HD synthetic faces, to image compression better than the state of the art to cryptography. In this talk we will start with the basics of generative models,…
Jan Teichmann
Talk Abstract: Solving the real challenge of Data Science -- Productionisation -- with proven solutions straight from the front lines. Making data science a success is really hard with up to 85% of projects and initiatives around big data and data science failing according to Gartner. The reasons are complex…
Jessica Van Der Kroef
Talk Abstract: AB testing in mobile games Farm Heroes Saga is one of the few games that has made over a billion dollars since its launch, and it is still going strong after 5 years. It’s important to keep the game feeling fresh and interesting to our millions of players…
Kasia Kulma
Mango Solutions
Talk Abstract: Integrating empathy in the Data Science process. Despite the fast growth of talent in analytics and more sophisticated technical skill-sets, the success rate of data science projects remains low. It may be because people-related factors are top challenges in such projects, e.g. lack of clear question, company politics…
Kostas Perifanos
Magda Piatkowska & Clara Higuera
Talk Abstract: Slow burning, ground breaking and blockbuster data applications at the BBC. Optimising demand and resources for data science, data engineering and analytics products in a large organisation. Data solutions is a growing team within BBC. We are using machine learning to solve strategic problems and serve our audience better. In…
Marios Michailidis
Talk Abstract: An audience with a Kaggle Grandmaster. Over the last 6 years, Marios has been competing in Kaggle competitions and has achieved the number 1 ranking.  Marios will be sharing his experiences as a data scientist on his journey to the top of the kaggle rankings, the lessons he…
Mark Pinkerton
Oasis LMF
Talk Abstract: Catastrophe modelling: applying science, data science and open source software to managing risk. Catastrophe modelling is a data driven discipline for understanding and managing the risk of natural catastrophes, developed by the global insurance industry over the last 30 years. It has growing applicability outside of insurance for…
Sandeep Karkhanis
Talk Abstract: Lessons learnt from applying Data Science for Social Good. AI, Data Science, Machine Learning - it would be nigh impossible to not have come across these words in the newspapers, social media or television often in varying contexts... We live in exciting times where data is omni-present and…
Satya Singh
Talk Abstract: Ethics and Impact, the Humanity in Data science How do we ensure that we don’t forget about humanity when it comes to data science? How do we make sure that data science has a positive social and emotional impact? Well, we need to understand that data is not…
Sid Shekhar
Token Analyst
Talk Abstract: Learning about the future of money from analyzing blockchain data. A whirlwind tour of the exciting possibilities offered by analyzing blockchain data - including appraising the economic potential opened up by smart contract operations, assessing the risk involved with storing and trading crypto assets on exchanges and uncovering…
Simon Greenman
Best Practise AI
Talk Abstract: Who’s Going to Make Money in AI and Machine Learning? We’re currently experiencing an AI gold rush. Billions are being invested. AI startups abound. Google, Amazon, and Microsoft are duking it out for AI supremacy. Corporations are scrambling to ensure they adopt AI ahead of their competitors while looking…
Soledad Galli
Train In Data
Talk Abstract: Building and Deploying Reproducible Machine Learning Pipelines Deployment of machine learning (ML) models, or simply, putting ML models into production, is fundamentally about bridging the gap between the research environment and live systems. Successful deployments make our models available so they can be easily accessed by both internal…
Thomas Bartley
Talk Abstract: The risk of unintended information disclosure in data publishing. Sensitive information about individuals can be recovered from different types of data releases. This presentation will explore the privacy risks in publishing data in different formats and introduce privacy techniques to defend against them. From low-dimensional microdata files and…
Tom Ewing
Mango Solutions
Bio: Tom is a Senior Data Scientist for Mango Solutions and was previously Principal Data Scientist at the Department for Transport and has worked most of his career surrounded by data. Talk Abstract: A lot of talks often gloss over the bad things or mad panics that happen on projects,…
Tom Richardson
Talk Abstract: Modelling memory with busuu’s Vocab Trainer. Ever wondered how language learning apps like busuu decide which words are burned into your memory and which need to be reviewed? In this talk we describe the spaced repetition model behind busuu’s Vocab Trainer which predicts the rate at which your…