Sarcasm Detection on Reviews on Promotional Videos of Luxury Brands: A Deep Learning Approach
Anil Kumar TV
LEAD College of Management
Palakkad, Kerala, India
anilthekkat@lead.ac.in
Haripriya V
School of CS & IT, Jain (Deemed-to-be University)
Bangalore, Karnataka, India
haripriyav07@gmail.com
Abstract - Effective customer relationship and brand strategy require systematic analysis of customer reviews but sarcasm in reviews often make it difficult. Sarcasm is a form of experession where the intended meaning is the opposite of what is actually said in order to mock or insult someone. The presence of sarcasm in a sentence can completely change the meaning of that sentence. Sarcasm is considered to be an important aspect, especially where there is no face-to-face conversation. Customers show different attitudes towards luxury brands as they have unique characteristics. Detection of sarcasm is the main aim of this work in customer reviews related to luxury brand products. Recurrent Neural Network (RNN), a deep learning algorithm along with Long Short Term Memory(LSTM) units is used in the proposed work to classify reviews into sarcastic and non-sarcastic. This study proposes a method with the help of RNN-LSTM algorithm, which could be handy for doing sarcasm detection in industry level. The result of the study shows that the rate of sarcastic comments is high inthe context of luxury brands. The outcome of the study proposes recommendations to brand managers who practice customer review analysis for making branding strategies.
Keywords - Luxury, Sarcasm, Branding, Customer Reviews, Deep Learning, Recurrent Neural Network
Introduction
In the advancement of technologies, businesses are mostly focused on persisting competitive in the market. Internet has a major effect on any business operations by affecting the value activities by themselves or allows to acquire competitive advantages by utilizing changes in competitive scope. A specific amount of creativity and ingenuity needed for any businesses to convert these prospects to customers. Due to the prevalence of social media, businesses use this technology for promoting their products at a high rate in the recent past. The other benefit of using social media is to get to know the opinion of customers from their reviews written on social media websites. Consumer opinions plays a vital role in building brand reputation and making better brand strategy. Promotional contents in social network sites like YouTube allow their followers to express their sentiments about the products by writing comments, and these sentiments can be analyzed to identify the opinion about the products. Sentimental analysis results on customer reviews are beneficial for understanding the approach of customer towards the brand. However, this kind of review analysis is facing a big problem with sarcasm in customer reviews. The intended meaning and the actual meaning of the sentence gives a gap in sentiment analysis thereby the process of sarcasm detection becomes the most challenging task.
Companies have to be competitive enough to be sustainable in the market. One way of sketching the growth and weakness of their competitors are getting connected with the social media sites to collect customer's reviews and make better branding strategy using opinions. Therefore, numerous research works have been done on sentiment analysis, and the comments were classified as positive, negative and neutral. The negative sentiments of the consumers affect brand reputation. Customer reviews about a company or a product reveal the perception of customers in terms of quality of the product, service, value and performance. Among these results, the presence of sarcasm in the comment is a serious issue which has not been considered in most of the research, especially in the context of luxury.
Luxury brands have many unique characteristics which lead to different customer attitude towards these brands. People use luxury products to show prestige and status in the society. Intangible and situational utility are large concerns in terms of luxury products(Wiedmann et al., 2007; Nuenoand Quelch, 1998). Therefore, customers' opinion plays a major role in decision making, especially when it comes to luxury context. Since sarcasm is very prevalent in customers' reviews related to luxury products, sentimental analysis alone is not sufficient for decision making. However, sentimental analysis is a well-researched area, but sarcasm detection is little researched especially for luxury products. Therefore, separate studies in luxury context are required for customer review-based studies. It is essential to understand the level of sarcasm in the reviews along with performing sentimental analysis. To understand this phenomenon, the proposed study selected promotional videos of three popular luxury brands in different seasons for sarcasm detection. Generally, huge data sets will be considered for this kind of analysis for better accuracy. Doing analysis manually would be a hectic and time-consuming job for analysts. Therefore this multi-disciplinary research proposes a methodology with the support of computer application.
Literature Review
Companies can learn customers' opinion through social media sites by understanding the conversation, thereby generate new ideas to improve market strategy (Cvijikj and Michahelles, 2011; Palmer and Koenig, 2009). The information can be used to analyze the sentiments of customers; thereby, it helps the companies to gain an advantage against their competitors(Deyet al., 2011; Hu et al., 2017) suggested that when the customer gives reviews about the brand, the sentiments expressed by them reveals the actual opinion towards the brands. Sentiments of the customers about the brand pave the way for brand managers to realize their faults and implement new trends or services. Sentiment analysis is the process of identifying the opinion or the sentiments of customers. This technique plays a vital role in social media research. Analyzing a large amount of data getting generated in the e-commerce sites and social media sites is a difficult task to classify sentiments on reviews.
Various studies have been done on different fields such as cosmetics, healthcare, electronics and movies to find out the sentiments of customers. A study(Mostafa, 2013)has made a comparison to identify the brand sentiments of the brands such as Nokia, IBM, Pfizer and DHL considering tweets as the datasets. Cosmetics and healthcare industries have been carried out (Isah et al., 2014) to analyze the consumer opinions on social media for three brands in cosmetics and drugs such as 'Oral B', 'Dove' and 'Avon'. The main aim of the study was to predict the product counterfeiting, adverse effect and consumers sentiments for brands by analyzing data from social media. Industry-specific lexicon and general English based lexicon were used to get the sentiment score of consumer reviews of these brands. Various machine learning approaches namely Support Vector Machine, Naïve Bayes and Maximum Entropy were implemented to obtain the sentiments for the test dataset. The result shows that all three considered brands were positively skewed. Brand managers can extract knowledge or potential values from the data getting generated on social media. Fecthing the answer for questions related to brand management is a tedious task. As a solution a framework has been introduced that acquire latent brand topics andusing text mining and sentiment analysisit classifies the texts as brand sentiments (Liu et al., 2017).
The deep learning approach is widely used by many researchers for sarcasm detection(Kumar and Garg, 2019).To extract sentiment, emotions and personality characteristics for detecting sarcasm, a model has developed by authors(Gers et al., 1999) based on a pre-trained convolutional neural network. Many researchers addressed the most ignored generalizability issues of data classification, which has not been solved by the models at the learning phase. sAtt-BLSTM convNet model has been proposed in a study(Kumar et al.,2019)for constructing semantic word embeddings based on a hybrid model using Soft attention-based bidirectional long short-term memory and CNN by applying GLoVe(global vectors for word representation. To compare the accuracy of the proposed deep neural model with the Convolutional Network, LSTM and Bidirectional LSTM, an experimental study has been performed with the training and test dataset. The result shows that the proposed model outperforms other models with an accuracy of 97.87 per cent and 93.71 per cent for twitter dataset and random-tweet dataset respectively. A model has been built in a study (Ghosh and Veale, 2016) by merging convolutional network with RNN (Recurrent Neural Network and Long short term memory) along with the deep neural network. The output represents that the proposed approach gives more improvement for deep learning framework when compared to recursive Support Vector machine. Another study(Zhang et al., 2016) has been conducted to capture the semantic and syntactic information from the tweets by using bidirectional LSTM and in order to extract contextual features pooling neural network has been used from past tweets. The result represents an improvement when compared the neural features with discrete manual features.
Though many organizations are investing a huge amount of money and time on social media data to find customer sentiments for developing new brand strategies and promotional studies, there are very less research has been done on sarcasm detection on luxury brand data. Understanding the importance of sarcasm detection, especially for luxury product reviews, is not prevalent among researchers. This research study fills this gap by analyzing the customer reviews on selected promotional videos of three brands, namely Zara, Armani and Gucci. The promotional videos of various seasons have been taken for analyzing sentiments of customers and examined the presence of sarcasm in positive comments. The outcome of the study aids brand managers to understand the attitude towards brands' promotional videos, thereby improve promotional strategies for effective branding. Though many researches have been done on sarcasm detection on social media data, detection of sarcasm has not been considered on reviews on luxury brand. To fill this gap, the proposed work has taken reviews of luxury product promotional videos to understand the level of sarcasm.
Methodology
As this is interdisciplinary research, the researchers introduce a methodology which could be handy to the marketers. The main objective of the proposed methodology is to classify the sentiments of the customers into sarcastic and non-sarcastic comments. In this study, the reviews have been collected from YouTube. These reviews are subjected to deep learning approach to find out the amount of sarcasm.
Data Collection
Three luxury brands have been considered for the data collection as the availability of data is more in Armani, Zara and Gucci. Promotional videos of various seasons for the years 2017 to 2018 have been considered for the data collection. Various seasons data have been taken for the classification. Spring-Summer campaign, Falls winter and Spring summer and Autumn winter reviews of Armani, Gucci and Zara respectively. YouTube comment scrapper was used to extract the data and the resultant data is in the .csv format. Lexicon based approach has been used to perform sentiment analysis bythe authors(Haripriya andPatil, 2018)and the results is represented in Table1.Number of positive, negative and neutral comments wereclassified.The positive comments from the resultant dataset were used in this proposed work for sarcasm detection.
Table 1: Report of Sentiment Analysis: 2017-2018
Source: Haripriya et al. (2018)
Data Collection
The collected data is divided into two sets namely Training and Testing Dataset.The dataset has been imported using train_test_split from sklearn.Model_selection and the variables X and Y are used for assigning the data and the target labels. To use the assigned variables X and Y, it needs to flatten the array of target labels as input for the train_test_split() function.
Algorithms Used
Deep learning approach known as Recurrent Neural Network algorithm along with LSTM (Long Short Term Memory) units are used to perform sarcasm detection and Python with various packages has been used for implementation.
RNN-LSTM Model for Implementation
The LSTM Sequential model that consists of 4 layers. The first layer is the Embedded layer where each word is mapped into a real-valued vector of size 32. Number of words varies in each tweets, so each tweet will be constrained to be 1000 words, compression of long tweetsand pad shorter tweets with zero values. LSTM layer with 196 smart neurons (memory units) are the next layer. The dense output layer is used in the work since it is a classification problem. Softmax is used as an activation function, the reason being that the network is using categorical cross-entropy, and softmax is the right activation function.
Word embedding technique has been used to map each tweets into a real vector domain. Words are encoded as real-valued vectors where the words with similar meaning interpret to closeness within the vector area. Embedding layer present in Keras paves the way for converting positive integerdepiction of words into word embedding.
Results and Discussion
Results of Armani
The model was trained with the dataset and the accuracy obtained was 0.93 with 12 epochs as shown in Figure.1, and a graph of loss vs epoch was plotted. It was found that as the number of epochs increased, the accuracy of the model increased and the loss of the model decreased, which is depicted in Figure 2 and Figure 3.
Fig. 1. Training the model
Fig. 2. Accuracy vs epoch graph
After training, the model was validated with an unfamiliar dataset and gave an accuracy of 1.00 as shown in Figure 4.
Fig. 3. Loss vs epoch graph
Fig. 4. Validation accuracy
After validation of the model, the sarcasm and non-sarcasm accuracy that was obtained were 100 per cent and 70 per cent respectively. Figure.5 is the final result of this proposed work.
Fig. 5. Classification accuracy
Results of Zara
The model was trained with the dataset, and the accuracy obtained was 0.89, with 12 epochs, as shown in Figure.6 and a graph of accuracy vs epoch were plotted. It was found that as the number of epochs increased, the accuracy of the model increased as well as shown in Figure 7.
Fig. 6. Training the model
Fig. 7. Accuracy vs epoch graph
A graph of loss vs epoch was plotted. It was found that as the number of epochs increased, the loss of the model decreased as well as shown in Figure.8 and After training the model was validated with an unfamiliar dataset and gave an accuracy of 0.67, as shown in Figure.9.
Fig. 8. Loss vs epoch graph
Fig. 9. Validation accuracy
After validation of the model, the sarcasm and non-sarcasm accuracy that was obtained was 55.5 per cent and 83.3 per cent respectively. Figure 10 depicts the final result of this proposed work.
Fig. 10. Classification accuracy
Results of Gucci
The model was trained with the dataset, and the accuracy obtained was 0.87, with 12 epochs, as shown in Figure.11 and a graph of accuracy vs epoch were plotted. It was found that as the number of epochs increased, the accuracy of the model increased as well as shown in Figure 12.
Fig. 11. Training the model
Fig. 12. Accuracy vs epoch graph
A graph of loss vs epoch was plotted. It was found that as the number of epochs increased, the loss of the model decreased as well as shown in Figure.13 and After training the model was validated with an unfamiliar dataset and gave an accuracy of 0.68 as shown in Figure.14.
Fig. 13. Loss vs epoch graph
Fig. 14. Loss vs epoch graph
After validation of the model, the sarcasm and non-sarcasm accuracy that was obtained was 84 per cent and 0 per cent respectively. Figure 15 represents the final result of this proposed work.
Fig. 15. Classification accuracy
Implications
This multi-disciplinary study contributes to methodological and managerial implications by suggesting a mechanism for identifying and analyzing customer opinion and showing the level of sarcasm in positive reviews. Analyzing the sentiment, specifically sarcastic comments of consumers, helps to find out the satisfaction level and customers behaviour towards the brands and the products. This helps marketers to create appropriate marketing strategies to establish a healthy relationship with the customers. The result, level of sarcasm in the luxury context, is alerting brand managers that promotional strategies have to be built by considering the fact that many of the audience of the promotional videos are not actually happy with promotional videos. This kind of reviews from the group could create a negative outcome. As the targeted audience of social media is uncontrollable, this could be a challenging task for brand managers and marketers.
The result of a high level of sarcasm in this context is giving a hint to managers to consider this while going for sentimental analysis. This study further gives a mechanism to solve this issue with the help of Recurrent Neural Network algorithm along with LSTM. The results obtained from sarcasm detection will be helpful in areas such as sentiment analysis, natural language processing, opinion mining, business-related decision making or analytics etc.
Conclusion
A brand is a key element that ensures the uniqueness of a business as far as identity is concerned. Customer's opinions are the major factor which affects the sales of luxury brand products. This study has identified the sarcasm in the comments by passing the opinion of customers into deep learning algorithm and the results obtained are satisfactory. By understanding the trend of customer opinion regarding luxury brands' promotional videos will be useful for developing a better branding strategy. Future researchers may consider demographic variables and personality characteristics for obtaining a more accurate result. Moreover, future work can be done on various domain dataset as well as Multi-language social media data. Emojis also can be included along with text data for sarcasm detection.
References
Cvijikj IP, Michahelles F. (2011, Sep 28-30) Understanding social media marketing: a case study on topics, categories and sentiment on a Facebook brand page. In Proceedings of the 15th International Academic MindTrek Conference: Envisioning Future Media Environments, Tampere, Finland.
Dey L, Haque SM, Khurdiya A, Shroff G. (2011, Sep 17) Acquiring competitive intelligence from social media.In Proceedings of the 2011 joint workshop on multilingual OCR and analytics for noisy unstructured text data, Beijing, China.
Gers FA, Schmidhuber J, Cummins F. (1999, Aug ) Learning to forget: Continual prediction with LSTM. ICANN'99-9th International Conference on Artificial Neural Networks. Edinburgh, UK.
Ghosh A, Veale T. (2016 June 16) Fracking sarcasm using neural network.7th workshop on computational approaches to subjectivity, sentiment and social media analysis. San Diego, California.
Haripriya V, Patil PG. (2018 Dec 21-22) Detection of Sarcasm from Consumer Sentiments on Social Media About Luxury Brands. In International Conference on Recent Trends in Image Processing and Pattern Recognition, Solapur, India.
Hu G, Bhargava P, Fuhrmann S, Ellinger S, Spasojevic N. (2017, Nov 18-21) Analyzing users' sentiment towards popular consumer industries and brands on Twitter. IEEE International Conference on Data Mining Workshops (ICDMW), Louisiana, United.
Isah H, Trundle P, Neagu D. (2014, Sep 8-10) Social media analysis for product safety using text mining and sentiment analysis. 14th UK workshop on computational intelligence (UKCI), Bradford, UK.
Kumar A, Garg G. (2019) Empirical study of shallow and deep learning models for sarcasm detection using context in benchmark datasets. Journal of Ambient Intelligence and Humanized Computing. pp. (1-6).doi: 10.1007/s12652-019-01419-7
Kumar, A., Sangwan, S. R., Arora, A., Nayyar, A., & Abdel-Basset, M. (2019).Sarcasm detection using soft attention-based bidirectional long short-term memory model with convolution network. IEEE Access, 7, 23319-23328.
Liu X, Burns AC, Hou Y. (2017) An investigation of brand-related user-generated content on Twitter. Journal of Advertising, 46(2), pp. (236-47).
Mostafa MM. (2013) More than words: Social networks' text mining for consumer brand sentiments. Expert Systems with Applications, 40(10), pp. (4241-51).
Nueno JL, Quelch JA. (1998). The mass marketing of luxury. Business Horizons, 41(6), pp. (61-65).
Palmer A, Koenig Lewis N. (2009) An experiential, social network based approach to direct marketing. Direct Marketing: An International Journal. 3(3), pp. (162-176).
Wiedmann, K. P., Hennigs, N., &Siebels, A. (2007).Measuring consumers' luxury value perception: a cross-cultural framework. Academy of Marketing Science Review, 1, pp. (1-21).
Zhang M, Zhang Y, Fu G. (2016 Dec 11-16) Tweet sarcasm detection using deep neural network.26th International Conference on Computational Linguistics, Osaka, JapanWiedmann, K. P., Hennigs, N., &Siebels, A. (2007).Measuring consumers' luxury value perception: a cross-cultural framework.Academy of Marketing Science Review, 1, pp. (1-21).