Tweet Segmentation using Correlation and Association

Mr. Umesh A. Patil; Miss. Madhuri M. Pisotre; Miss. Snehal D. Gouraje; Miss. Ashwini P. Patil

doi:10.32628/IJSRST1733138

Authors

Mr. Umesh A. Patil HOD, Department of Computer Science & Engineering, D.Y.Patil Technical Campus, Talsande, Kolhapur, India.
Miss. Madhuri M. Pisotre Student, BE (CSE), D.Y.Patil Technical Campus, Talsande, Kolhapur, IndiaFaculty of Environmental Studies, Universiti Putra Malaysia, UPM Serdang, Selangor Darul Ehsan
Miss. Snehal D. Gouraje Student, BE (CSE), D.Y.Patil Technical Campus, Talsande, Kolhapur, IndiaFaculty of Environmental Studies, Universiti Putra Malaysia, UPM Serdang, Selangor Darul Ehsan
Miss. Ashwini P. Patil Student, BE (CSE), D.Y.Patil Technical Campus, Talsande, Kolhapur, IndiaFaculty of Environmental Studies, Universiti Putra Malaysia, UPM Serdang, Selangor Darul Ehsan

Keywords:

Tweet dataset, Tweet segmentation, Microsoft N-gram, Correlation and Association

Abstract

Twitter is an online social network used by millions people. It used to provide a way to collect and understand userâ€™s opinion about much private and public organization. Twitter has become one of the most important communication channels with it's achieve to providing the most up-to-date information to the user. In this paper we present to find the correlation of two words using the association rule. There must be an application to establish the mutual relationship between two words or sentences or segment. In the first step we collecting tweets are editable group of tweets hand selected by twitter user. These collected tweets are pre-processing in which stop words removed and then tweet segmentation. The form of generalized association rules, from messages posted by twitter users. The analysis of twitter post is focused on two different but related features: their textual content and their submission content. Due to itâ€™s in valuable business value of timely information from these tweets, it is imperative to understand tweets language for a large body of downstream application, such as true named entity.

References

D. Downey, M. Brodhead, and O. Etzioni. Locating complex named entities in web text. In Proc. of IJCAI, 2007.
T. Finin, W. Murnane, A. Karandikar, N. Keller, J. Martineau, and M. Dredze. Annotating named entities in twitter data with crowd sourcing. In Proc. of the Workshop on Creating Speech and Language Data With Mechanical Turkat NAACL-HLT, 2010.
K. Gimpel, N. Schneider, B. O’Connor, D. Das, D. Mills, J. Eisenstein, M. Heilman, D. Yogatama, J. Flannigan, and N. A. Smith. Part-of-speech tagging for twitter: Annotation, features, and experiments. In Proc. of ACL, 2011.
B. Han and T. Baldwin. Lexical normalization of short text messages: Maknsens a #twitter. In Proc. of ACL, 2011.
X. Liu, S. Zhang, F. Wei, and M. Zhou. Recognizing name identities in tweets. In Proc. of ACL, 2011.
A. Ritter, S. Clark, Mausam, and O. Etzioni. Named entity recognition in tweets: An experimental study. In Proc. Of EMNLP, 2011.
R. Agrawal, T. Imielinski, and A.N. Swami. Mining association rules between sets of items in large databases. In SIGMOD Conference, pages 207–216, 1993.
W.A.V.B.D. Caragea and W.H. Hsu. Ontology-aware classification and association rule mining for interest and link prediction in social networks, 2009.

Tweet Segmentation using Correlation and Association

Authors

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite