Talk Abstract: Gold standard data: lessons from the trenches
The first stage in a data science project is often to collect training data. However, getting a good data set is surprisingly tricky and takes longer than one expects. This talk describes our experiences in labelling gold-standard data and the lessons we learnt the hard way. We will present three case studies from natural language processing and discuss the challenges we encountered.
Bio: I am the CTO and cofounder at Teebly, a communication platform for high-trust business. I have previously worked as a data scientist and software engineer at several early-stage companies and have taught natural language processing at the University of Sussex