Sentiment analysis is to study an opinion or emotion and attitude of respondents. Often, polarity is used in the sentiment analysis and it refers to the binary subject impression of the respondents. For instance, it can identify identify whether the reviews of the respondents are positive or negative.
Sentiment analysis still requires many of the text analytics steps such as tokenization, part of speech (POS) tagging, and word filtering. This study has used the default setting for both rule-based model and statistical model:
Under the corpus, rename the project name as “Hotel_Reviews” etc. Following, imported the reviews for the “positive” and “negative” directory respectively
It is a form of supervised learning typically performed at the document level. It generated the basic rules that can be further modified or expanded with custom rules. It is effective in predicting the overall sentiment of a document.
It is a form of unsupervised learning typically performed at the sentence level. It extract sentiment at the product & feature levels. Feature levels can reflect the sentiment on what excatly the customer like or dislike. Under the rule-based model sentiment analysis, it has two approaches, 1) Lexicon-based and 2) Syntactic-based.
Lexicon-based in relation to attributes mentioned in setence. For example, the hotel quality is "outstanding". Syntactic-based invlolved defining part-of-speech tags (nouns, verbs, adverbs etc), and conjunction words (AND, NOR, OR, NOT, BUT etc.)
There are 6 types of rule found in SAS Sentiment Analysis Studio: