Review Paper of Identifying Features in Opinion Mining using Bootstrap Methodology and Naive Bayes Classification

Abstract
Authors
Keywords
Conclusion
References

Extracting opinion characteristics are based only on models of a single review corpus, ignoring nontrivial disparities in the characteristics of word distribution of the characteristics of opinion across different Corpus. In this article we propose a new method to identify the characteristics of the opinions of online journals by exploiting the difference in opinion statistics through two corpus, a specific corpus of the domain and an independent corpus of the domain the contrasted corpus. We grasp this disparity via a measure called domain relevance (DR), which characterizes the relevance of a term to a collection of texts. First we extract a list of the characteristics of the candidate's opinion of the domain review corpus by defining a set of syntactic dependency rules. For each extracted candidate feature, we then estimate the intrinsic domain (IDR) and extrinsic domain relevance (EDR) scores in the domain-dependent and independent corpus domain, respectively. Candidate characteristics that are less generic (EDR score below a threshold) and more specific for the domain (IDR score higher than another threshold) are confirmed as characteristics of the opinion.

Published In : IJCSN Journal Volume 6, Issue 4

Date of Publication : August 2017

Pages : 509-511

Figures :--

Tables : --

Vishakha I. Sardar : Dept. of Computer Science and Engineering, MGM’s Jawaharlal Nehru Engineering College Aurangabad, India.

Saroj Date : Dept. of Computer Science and Engineering, MGM’s Jawaharlal Nehru Engineering College Aurangabad, India.

Candidate Feature Extraction, Bootstrap Method, IDR Score, EDR Score, IEDR Score.

In this article, we propose an efficient criterion for the technique of intrinsic and extrinsic domain relevance for the extraction of characteristics. We have used additional dependencies in English grammar to extract features. The weight equation is given to work with real-life reviews. Experimental results showed that the proposed approach yields a much better result than the traditional approach. It is essential that a good independent corpus of the domain is selected. Since this technique relies heavily on disparities in the characteristics of distribution characteristics of opinion, two best thresholds should be selected according to the corpus to improve performance.

[1] Zhen Hai, Kuiyu Chang, Jung-Jae Kim, and Christopher C. YangV,”Identifying Features in Opinion Mining via Intrinsic and Extrinsic Domain Relevance,” IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING , vol. 26, no. 3,pp.623-634,March 2014. [2] G. Qiu, C. Wang, J. Bu, K. Liu, and C. Chen, “Incorporate the Syntactic Knowledge in Opinion Mining in User-Generated Content,”Proc. WWW 2008 Workshop NLP Challenges in the Information Explosion Era, 2008. [3] D.M. Blei, A.Y. Ng, and M.I. Jordan, “Latent Dirichlet Allocation,”International Journal of Machine Learning Research,Vol. 3 ,pp. 993- 1022,March 2003. [12] Jose M. Chenlo, David E. Losada, “An empirical study of sentence features for subjectivity and polarity classification”, Information Sciences. 280, 275-288, 2014. [13] A.J. Viera and J.M. Garrett, “Understanding Interobserver Agreement: The Kappa Statistic”, Family Medicine, vol. 37, no. 5, pp. 360- 363, 2005. [14] W.X. Zhao, J. Jiang, H. Yan, and X. Li, “Jointly Modeling Aspects and Opinions with a Maxent- Lda Hybrid”, Proc. Conf. Empirical Methods in Natural Language Processing, pp. 56-65, 2010. [15] V. Hatzivassiloglou and J.M. Wiebe, “Effects of Adjective Orientation and Gradability on Sentence Subjectivity”, Proc. 18th Conf. Computational Linguistics, pp. 299-305, 2000. [16] L. Qu, G. Ifrim, and G. Weikum, “The Bagof- Opinions Method for Review Rating Prediction from Sparse Text Patterns,” Proc. 23rd Int’l Conf. Computational Linguistics, pp. 913-921, 2010. [17] D. Bollegala, D. Weir, and J. Carroll, “Cross- Domain Sentiment Classification Using a Sentiment Sensitive Thesaurus,” IEEE Trans. Knowledge and Data Eng., vol. 25, no. 8, pp. 1719-1731, Aug. 2013. [18] P.D. Turney, “Thumbs Up or Thumbs Down?: Semantic Orientation Applied to Unsupervised Classification of Reviews,” Proc.40th Ann. Meeting on Assoc. for Computational Linguistics, pp. 417- 424, 2002. [19] C. Zhang, D. Zeng, J. Li, F.-Y. Wang, and W. Zuo, “Sentiment Analysis of Chinese Documents: From Sentence to Document Level,” J. Am. Soc. Information Science and Technology, vol. 60, no. 12, pp. 2474-2487, Dec. 2009. [20] A.L. Maas, R.E. Daly, P.T. Pham, D. Huang, A.Y. Ng, and C. Potts, “Learning Word Vectors for Sentiment Analysis,” Proc. 49th Ann. Meeting of the Assoc. for Computational Linguistics: Human Language Technologies, pp. 142-150, 2011.