NLP and tagging: text analytics and predictions on natural language
The year started on a techy note with the first hands-on event on Natural Language Processing! The event was built around CityPolarna.se’s event-tagging algorithm and our quest to produce a better one. Johan’s CityPolarna is an event website where everyone can host an event and attend others’ in order to do fun things in their area and meet new friends - not unlike Barrel.ai but on a larger scale. The events that are created are suggested with a set of tags so that searchability increases for likeminded people. The thought was here to use Natural Language Processing, NLP, in order to create a predictive model on what tags each event should be assigned with.
The event started out with a review of the basics of NLP by Oscar to bring everyone up to speed on the techniques and glossary, some real cases from previous consulting assignments as well as the basics of building NLP pipelines with preprocessing and model creation. Johan then took over and gave us the back story of CityPolarna and a deeper look into the data and the previously built algorithm for tagging of events.
Some pizza slizes later and we were down to business: the laboration was of course set out as a competition where every team go to choose different pre-processing methods on the text data and then an algorithm of their choice.
Winner was team 42 (bob) who ended up on a score of 0.0307, where the original model performed at 0.0404 on the hamming_loss. Congratulations! All teams had a good run and the lab sparked some great discussions on hamming loss vs. accuracy and the importance of how the preprocessing is done as it lays the fundation for prediction accuracy. NLP is a vast area to cover and it is definitely not the last time we will be coding on this subject.
See you next time!
This event was sponsered by NordAxon, a company focusing on locally produced AI and Machine Learning in the Øresund region. www.nordaxon.com
- NLP, workshop, tagging