Tweet Segmentation using Correlation and Association
Keywords:
Tweet dataset, Tweet segmentation, Microsoft N-gram, Correlation and AssociationAbstract
Twitter is an online social network used by millions people. It used to provide a way to collect and understand user’s opinion about much private and public organization. Twitter has become one of the most important communication channels with it's achieve to providing the most up-to-date information to the user. In this paper we present to find the correlation of two words using the association rule. There must be an application to establish the mutual relationship between two words or sentences or segment. In the first step we collecting tweets are editable group of tweets hand selected by twitter user. These collected tweets are pre-processing in which stop words removed and then tweet segmentation. The form of generalized association rules, from messages posted by twitter users. The analysis of twitter post is focused on two different but related features: their textual content and their submission content. Due to it’s in valuable business value of timely information from these tweets, it is imperative to understand tweets language for a large body of downstream application, such as true named entity.
References
- D. Downey, M. Brodhead, and O. Etzioni. Locating complex named entities in web text. In Proc. of IJCAI, 2007.
- T. Finin, W. Murnane, A. Karandikar, N. Keller, J. Martineau, and M. Dredze. Annotating named entities in twitter data with crowd sourcing. In Proc. of the Workshop on Creating Speech and Language Data With Mechanical Turkat NAACL-HLT, 2010.
- K. Gimpel, N. Schneider, B. O’Connor, D. Das, D. Mills, J. Eisenstein, M. Heilman, D. Yogatama, J. Flannigan, and N. A. Smith. Part-of-speech tagging for twitter: Annotation, features, and experiments. In Proc. of ACL, 2011.
- B. Han and T. Baldwin. Lexical normalization of short text messages: Maknsens a #twitter. In Proc. of ACL, 2011.
- X. Liu, S. Zhang, F. Wei, and M. Zhou. Recognizing name identities in tweets. In Proc. of ACL, 2011.
- A. Ritter, S. Clark, Mausam, and O. Etzioni. Named entity recognition in tweets: An experimental study. In Proc. Of EMNLP, 2011.
- R. Agrawal, T. Imielinski, and A.N. Swami. Mining association rules between sets of items in large databases. In SIGMOD Conference, pages 207–216, 1993.
- W.A.V.B.D. Caragea and W.H. Hsu. Ontology-aware classification and association rule mining for interest and link prediction in social networks, 2009.
Downloads
Published
Issue
Section
License
Copyright (c) IJSRST

This work is licensed under a Creative Commons Attribution 4.0 International License.