Online Reviews Based on the Word Alignment Model

Authors

  • Sayali kothekar  Department of Computer & Science, GNIET, Nagpur, Maharashtra, India
  • Kalpana Malape  Department of Computer & Science, GNIET, Nagpur, Maharashtra, India
  • Vijaya Kamble  Department of Computer & Science, GNIET, Nagpur, Maharashtra, India

Keywords:

Opinion Mining, Opinion Targets Extraction, Opinion Words Extraction

Abstract

Mining opinion targets and opinion words from online reviews are important tasks for fine-grained opinion mining, the key component of which involves detecting opinion relations among words. To this end, this paper proposes a novel approach based on the partially-supervised alignment model, which regards identifying opinion relations as an alignment process. Then, a graph-based co-ranking algorithm is exploited to estimate the confidence of each candidate. Finally, candidates with higher confidence are extracted as opinion targets or opinion words. Compared to previous methods based on the nearest-neighbor rules, our model captures opinion relations more precisely, especially for long-span relations. Compared to syntax-based methods, our word alignment model effectively alleviates the negative effects of parsing errors when dealing with informal online texts. In particular, compared to the traditional unsupervised alignment model, the proposed model obtains better precision because of the usage of partial supervision. In addition, when estimating candidate confidence, we penalize higher-degree vertices in our graph-based co-ranking algorithm to decrease the probability of error generation. Our experimental results on three corpora with different sizes and languages show that our approach effectively outperforms state-of-the-art methods.

References

  1. M. Hu and B. Liu, “Mining and summarizing customer reviews,” in Proc. 10th ACM SIGKDD Int. Conf. Knowl. Discovery Data Mining, Seattle, WA, USA, 2004, pp. 168–177.
  2. F. Li, S. J. Pan, O. Jin, Q. Yang, and X. Zhu, “Cross-domain coextraction of sentiment and topic lexicons,” in Proc. 50th Annu. Meeting Assoc. Comput. Linguistics, Jeju, Korea, 2012, pp. 410–419.
  3. L. Zhang, B. Liu, S. H. Lim, and E. O’Brien-Strain, “Extracting and ranking product features in opinion documents,” in Proc. 23th Int. Conf. Comput. Linguistics, Beijing, China, 2010, pp. 1462–1470.
  4. target extraction using wordbased translation model,” in Proc. Joint Conf. Empirical Methods Natural Lang. Process. Comput. Natural Lang. Learn., Jeju, Korea, Jul. 2012, pp. 1346–1356.
  5. M. Hu and B. Liu, “Mining opinion features in customer reviews,” in Proc. 19th Nat. Conf. Artif. Intell., San Jose, CA, USA, 2004, pp. 755–760.
  6. A.-M. Popescu and O. Etzioni, “Extracting product features and opinions from reviews,” in Proc. Conf. Human Lang. Technol. Empirical Methods Natural Lang. Process., Vancouver, BC, Canada, 2005, pp. 339–346.
  7. G. Qiu, L. Bing, J. Bu, and C. Chen, “Opinion word expansion and target extraction through double propagation,” Comput. Linguistics, vol. 37, no. 1, pp. 9–27, 2011.
  8. B. Wang and H. Wang, “Bootstrapping both product features and opinion words from chinese customer reviews with crossinducing,” in Proc. 3rd Int. Joint Conf. Natural Lang. Process., Hyderabad, India, 2008, pp. 289–295.
  9. B. Liu, Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data, series Data-Centric Systems and Applications. New York, NY, USA: Springer, 2007.
  10. G. Qiu, B. Liu, J. Bu, and C. Che, “Expanding domain sentiment lexicon through double propagation,” in Proc. 21st Int. Jont Conf. Artif. Intell., Pasadena, CA, USA, 2009, pp. 1199–1204.
  11. R. C. Moore, “A discriminative framework for bilingual word alignment,” in Proc. Conf. Human Lang. Technol. Empirical Methods Natural Lang. Process., Vancouver, BC, Canada, 2005, pp. 81–88.
  12. X. Ding, B. Liu, and P. S. Yu, “A holistic lexicon-based approach to opinion mining,” in Proc. Conf. Web Search Web Data Mining, 2008 , pp. 231–240.
  13. F. Li, C. Han, M. Huang, X. Zhu, Y. Xia, S. Zhang, and H. Yu, “Structure-aware review mining and summarization.” in Proc. 23 th Int. Conf. Comput. Linguistics, Beijing, China, 2010, pp. 653–661.
  14. Y. Wu, Q. Zhang, X. Huang, and L. Wu, “Phrase dependency parsing for opinion mining,” in Proc. Conf. Empirical Methods Natural Lang. Process., Singapore, 2009, pp. 1533–1541.
  15. T. Ma and X. Wan, “Opinion target extraction in chinese news comments.” in Proc. 23th Int. Conf. Comput. Linguistics, Beijing, China, 2010, pp. 782–790.
  16. Q. Zhang, Y. Wu, T. Li, M. Ogihara, J. Johnson, and X. Huang, “Mining product reviews based on shallow dependency parsing,” in Proc. 32nd Int. ACM SIGIR Conf. Res. Develop. Inf. Retrieval, Boston, MA, USA, 2009, pp. 726–727.
  17. W. Jin and H. H. Huang, “A novel lexicalized HMM-based learning framework for web opinion mining,” in Proc. Int. Conf. Mach. Learn., Montreal, QC, Canada, 2009, pp. 465–472.
  18. J. M. Kleinberg, “Authoritative sources in a hyperlinked environment,” J. ACM, vol. 46, no. 5, pp. 604–632, Sep. 1999.
  19. Q. Mei, X. Ling, M. Wondra, H. Su, and C. Zhai, “Topic sentiment mixture: modeling facets and opinions in weblogs,” in Proc. 16 th Int. Conf. World Wide Web, 2007, pp. 171–180.
  20. I. Titov and R. McDonald, “A joint model of text and aspect ratings for sentiment summarization,” in Proc. 46th Annu. Meeting Assoc. Comput. Linguistics, Columbus, OH, USA, 2008, pp. 308–316.
  21. W. X. Zhao, J. Jiang, H. Yan, and X. Li, “Jointly modeling aspects and opinions with a MaxEnt-LDA hybrid,” in Proc. Conf. Empirical Methods Natural Lang. Process., Cambridge, MA, USA, 2010 , pp. 56–65.
  22. A. Mukherjee and B. Liu, “Modeling review comments,” in Proc. 50th Annu. Meeting Assoc. Comput. Linguistics, Jeju, Korea, Jul. 2012, pp. 320–329.
  23. P. F. Brown, V. J. D. Pietra, S. A. D. Pietra, and R. L. Mercer, “The mathematics of statistical machine translation: Parameter estimation,” Comput. Linguist., vol. 19, no. 2, pp. 263–311, Jun. 1993.
  24. Z. Liu, H. Wang, H. Wu, and S. Li, “Collocation extraction using monolingual word alignment method,” in Proc. Conf. Empirical Methods Natural Lang. Process., Singapore, 2009, pp. 487–495.
  25. Z. Liu, X. Chen, and M. Sun, “A simple word trigger method for social tag suggestion,” in Proc. Conf. Empirical Methods Natural Lang. Process., Edinburgh, U.K., 2011, pp. 1577–1588.
  26. Q. Gao, N. Bach, and S. Vogel, “A semi-supervised word alignment algorithm with partial manual alignments,” in Proc. Joint Fifth Workshop Statist. Mach. Translation MetricsMATR, Uppsala, Sweden, Jul. 2010, pp. 1–10.
  27. S. Baluja, R. Seth, D. Sivakumar, Y. Jing, J. Yagnik, S. Kumar, D. Ravichandran, and M. Aly, “Video suggestion and discovery for youtube: taking random walks through the view graph,” in Proc. 17th Int. Conf. World Wide Web, Beijing, China, 2008, pp. 895–904.
  28. P. P. Talukdar, J. Reisinger, M. Pasca, D. Ravichandran, R. Bhagat, and F. Pereira, “Weakly-supervised acquisition of labeled class instances using graph random walks,” in Proc. Conf. Empirical Methods Natural Lang. Process., Honolulu, Hawaii, 2008, pp. 582–590.
  29. K. Liu, H. L. Xu, Y. Liu, and J. Zhao, “Opinion target extraction using partially-supervised word alignment model,” in Proc. 23 rd Int. Joint Conf. Artif. Intell., Beijing, China, 2013, pp. 2134–2140.
  30. K. W. Gan and P. W. Wong, “Annotating information structures in chinese texts using hownet,” in Proc. 2nd Workshop Chin. Lang. Process.: Held Conjunction 38th Annu. Meeting Assoc. Comput. Linguistics, Hong Kong, 2000, pp. 85–92.
  31. Z. Hai, K. Chang, J.-J. Kim, and C. C. Yang, “Identifying features in opinion mining via intrinsic and extrinsic domain relevance,” IEEE Trans. Knowledge Data Eng., vol. 26, no. 3, p. 623–634, 2014.
  32. Z.-H. Zhou and M. Li, “Semi-supervised regression with cotraining,” in Proc. 15th Int. Joint Conf. Artif. Intell., Edinburgh, Scotland, U.K.
  33. J. Zhu, H. Wang, B. K. Tsou, and M. Zhu, “Multi-aspect opinion polling from textual reviews,” in Proc. 18th ACM Conf. Inf. Knowl. Manage., Hong Kong, 2009, pp. 1799–1802.

Downloads

Published

2017-02-28

Issue

Section

Research Articles

How to Cite

[1]
Sayali kothekar, Kalpana Malape, Vijaya Kamble, " Online Reviews Based on the Word Alignment Model, International Journal of Scientific Research in Science and Technology(IJSRST), Online ISSN : 2395-602X, Print ISSN : 2395-6011, Volume 3, Issue 2, pp.222-230, January-February-2017.