Motivation

At ResearchCue, we develop algorithms to answer business questions using web aggregated knowledge. And many of our algorithms require labeled or annotated data. Anyone who has worked on creating labeled datasets manually would know what a painful mundane process this is. We are trying to make this process a bit more convenient through this exercise. I tried looking for existing solutions but none of them were close to what I needed.

Our requirements

We had the following requirements

It should be a responsive web app operable on any device browser so that I can continue making labeled datasets even when I am waiting for Uber or even lying in my bed ( Main motivation :) ) The interface should show one text/image at a time and the user can select the correct label out of all the possible options. The chosen label along with the shown item (text/image) will be stored in the database and the interface should show the next entry without any additional inputs from the user. e.g. My document is “@ArlaIngredients launches new UK #foodservice division, targeting PS100m growth”. I want to label this as “Products” so I would click on “Products” button on the interface and it should be stored in the database and next entry be shown The user should be able to change the available labels at any point The app should make sure that evaluated item is not shown again The user should be able to change the last shown entry again (To account for accidental clicks) The user should be able to see how many data points are marked for each label It should be able to take seed inputs to generate some specific data points

Web Application

Here is how it looks