Extracting opinion characteristics are based only on models of a single review corpus, ignoring nontrivial disparities in
the characteristics of word distribution of the characteristics of opinion across different Corpus. In this article we propose a new
method to identify the characteristics of the opinions of online journals by exploiting the difference in opinion statistics through two
corpus, a specific corpus of the domain and an independent corpus of the domain the contrasted corpus. We grasp this disparity via
a measure called domain relevance (DR), which characterizes the relevance of a term to a collection of texts. First we extract a list
of the characteristics of the candidate's opinion of the domain review corpus by defining a set of syntactic dependency rules. For
each extracted candidate feature, we then estimate the intrinsic domain (IDR) and extrinsic domain relevance (EDR) scores in the
domain-dependent and independent corpus domain, respectively. Candidate characteristics that are less generic (EDR score below a
threshold) and more specific for the domain (IDR score higher than another threshold) are confirmed as characteristics of the
opinion.
Published In:IJCSN Journal Volume 6, Issue 4
Date of Publication : August 2017
Pages : 509-511
Figures :--
Tables : --
Vishakha I. Sardar : Dept. of Computer Science and Engineering,
MGM’s Jawaharlal Nehru Engineering College
Aurangabad, India.
Saroj Date : Dept. of Computer Science and Engineering,
MGM’s Jawaharlal Nehru Engineering College
Aurangabad, India.
In this article, we propose an efficient criterion for
the technique of intrinsic and extrinsic domain
relevance for the extraction of characteristics. We
have used additional dependencies in English
grammar to extract features. The weight equation is
given to work with real-life reviews. Experimental
results showed that the proposed approach yields a
much better result than the traditional approach. It is
essential that a good independent corpus of the
domain is selected. Since this technique relies
heavily on disparities in the characteristics of
distribution characteristics of opinion, two best
thresholds should be selected according to the corpus
to improve performance.
[1] Zhen Hai, Kuiyu Chang, Jung-Jae Kim, and
Christopher C. YangV,”Identifying Features in
Opinion Mining via Intrinsic and Extrinsic
Domain Relevance,” IEEE TRANSACTIONS ON
KNOWLEDGE AND DATA ENGINEERING ,
vol. 26, no. 3,pp.623-634,March 2014.
[2] G. Qiu, C. Wang, J. Bu, K. Liu, and C. Chen,
“Incorporate the Syntactic Knowledge in Opinion
Mining in User-Generated Content,”Proc. WWW
2008 Workshop NLP Challenges in the
Information Explosion Era, 2008.
[3] D.M. Blei, A.Y. Ng, and M.I. Jordan, “Latent
Dirichlet Allocation,”International Journal of
Machine Learning Research,Vol. 3 ,pp. 993-
1022,March 2003.
[12] Jose M. Chenlo, David E. Losada, “An empirical
study of sentence features for subjectivity and
polarity classification”, Information Sciences. 280,
275-288, 2014.
[13] A.J. Viera and J.M. Garrett, “Understanding
Interobserver Agreement: The Kappa Statistic”,
Family Medicine, vol. 37, no. 5, pp. 360- 363,
2005.
[14] W.X. Zhao, J. Jiang, H. Yan, and X. Li, “Jointly
Modeling Aspects and Opinions with a Maxent-
Lda Hybrid”, Proc. Conf. Empirical Methods in
Natural Language Processing, pp. 56-65, 2010.
[15] V. Hatzivassiloglou and J.M. Wiebe, “Effects of
Adjective Orientation and Gradability on Sentence
Subjectivity”, Proc. 18th Conf. Computational
Linguistics, pp. 299-305, 2000.
[16] L. Qu, G. Ifrim, and G. Weikum, “The Bagof-
Opinions Method for Review Rating Prediction
from Sparse Text Patterns,” Proc. 23rd Int’l Conf.
Computational Linguistics, pp. 913-921, 2010.
[17] D. Bollegala, D. Weir, and J. Carroll, “Cross-
Domain Sentiment Classification Using a
Sentiment Sensitive Thesaurus,” IEEE Trans.
Knowledge and Data Eng., vol. 25, no. 8, pp.
1719-1731, Aug. 2013.
[18] P.D. Turney, “Thumbs Up or Thumbs Down?:
Semantic Orientation Applied to Unsupervised
Classification of Reviews,” Proc.40th Ann.
Meeting on Assoc. for Computational Linguistics,
pp. 417- 424, 2002.
[19] C. Zhang, D. Zeng, J. Li, F.-Y. Wang, and W.
Zuo, “Sentiment Analysis of Chinese Documents:
From Sentence to Document Level,” J. Am. Soc.
Information Science and Technology, vol. 60, no.
12, pp. 2474-2487, Dec. 2009.
[20] A.L. Maas, R.E. Daly, P.T. Pham, D. Huang, A.Y.
Ng, and C. Potts, “Learning Word Vectors for
Sentiment Analysis,” Proc. 49th Ann. Meeting of
the Assoc. for Computational Linguistics: Human
Language Technologies, pp. 142-150, 2011.