What do Sentimenti tools do? Dr Jan Kocoń is a natural language engineer – he is responsible for the machine learning closed in SentiTool, our tool for analyzing emotions in the text. He coordinates the work of the linguistics team, integrates individual elements of the tool, and works closely with the IT team.

When are you going to tell someone about Sentimenti and our tools, what do you say first?

Sentimenti is a project in which we analyze emotions in the text. Unlike competitive solutions that recognize only the overtones of the text (positive, neutral, negative), our tools are able to understand the text, assign specific meanings to the words in the text and the emotions people feel about them. These emotions, in turn, provide the knowledge base for a machine learning mechanism that automatically recognizes emotions at the level of sentences and the whole text.

What does it mean that we analyse emotions in the text?

In the research carried out in the project we adapted the Plutchik model, which includes eight basic emotions: joy, sadness, trust, repulsion, expectation, fear, surprise and anger. We are able to estimate to what extent these emotions are expressed in the text.

What else makes the emotion analysis tools in the text work?

Two things come to our aid. The first one is our giant database of opinions, with associated overtones, which come from different fields: travel, medicine, products and many others. We have more than 10 million such texts, which is an excellent source of information about the general feeling of the author. However, in order to find out what emotions a given text evokes in the reader, we also conduct our own research, analogous to research on single meanings.

This time the subject of these studies is the texts. The respondents attribute basic emotions to them, exactly the same as the meanings of words. The second pillar of our tool is a combination of many machine learning methods. Experts in natural language processing provide us with tools for text analysis at the syntactic and semantic level, additionally they create rules for the analysis of meanings in context such as: negation, conjecture, weakening or strengthening of the overtones, etc. This is an additional help for automatic methods, such as deep neural networks, which are used to make the right conclusions about the emotions in the text.