CFP last date
15 May 2024
Reseach Article

A Proposal of Weight based Similarities Hybrid Algorithm on Social Media Posts through Crowdsourcing to Achieve High Performance Recommendation

by Fayza Amreen, Md. Golam Muktadir, Tonmoy Hossain, Nazmus Sakib
International Journal of Applied Information Systems
Foundation of Computer Science (FCS), NY, USA
Volume 12 - Number 25
Year of Publication: 2019
Authors: Fayza Amreen, Md. Golam Muktadir, Tonmoy Hossain, Nazmus Sakib
10.5120/ijais2019451833

Fayza Amreen, Md. Golam Muktadir, Tonmoy Hossain, Nazmus Sakib . A Proposal of Weight based Similarities Hybrid Algorithm on Social Media Posts through Crowdsourcing to Achieve High Performance Recommendation. International Journal of Applied Information Systems. 12, 25 ( November 2019), 1-5. DOI=10.5120/ijais2019451833

@article{ 10.5120/ijais2019451833,
author = { Fayza Amreen, Md. Golam Muktadir, Tonmoy Hossain, Nazmus Sakib },
title = { A Proposal of Weight based Similarities Hybrid Algorithm on Social Media Posts through Crowdsourcing to Achieve High Performance Recommendation },
journal = { International Journal of Applied Information Systems },
issue_date = { November 2019 },
volume = { 12 },
number = { 25 },
month = { November },
year = { 2019 },
issn = { 2249-0868 },
pages = { 1-5 },
numpages = {9},
url = { https://www.ijais.org/archives/volume12/number25/1068-2019451833/ },
doi = { 10.5120/ijais2019451833 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2023-07-05T19:10:03.235132+05:30
%A Fayza Amreen
%A Md. Golam Muktadir
%A Tonmoy Hossain
%A Nazmus Sakib
%T A Proposal of Weight based Similarities Hybrid Algorithm on Social Media Posts through Crowdsourcing to Achieve High Performance Recommendation
%J International Journal of Applied Information Systems
%@ 2249-0868
%V 12
%N 25
%P 1-5
%D 2019
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Recommendation based system on social media posts through crowd-sourcing is an ambitious task. This paper has formulated a hybrid algorithm acquaint with a new approach which is based on weight-based similarity to classify the social media posts as a positive or negative directional. In this paper, it has been proposed a scheme to use social media platforms taking crowd-source reactions and gathering information from the comments and posts by a user. The initial base post is generated by using the Raindrop algorithm where the credibility of the user account is factored as weight. The reaction from this base post can be used to determine whether the community is accepting the information of the base post or rejecting it with a negative impression. To find positively relevant comments and posts regarding the base post, the MinHash algorithm is used. Firstly, using substantial steps of Natural Language Processing (NLP) for pre-processing the data. Then the generalized MinHash algorithm is used to extract the relevant data from all the comments and the posts with the signature. Finally, Longest Common Subsequence (LCS) Algorithm is implemented to categorize the supporting most similar data, thus the post that triggered by the user will get the directional data from the relatively matched comments from the shingles.

References
  1. https://www.internetlivestats.com/twitter-statistics/. Accessed on 8th October, 2019
  2. Li, Guoliang & Wang, Jianan & Zheng, Yudian & Franklin, Michael. (2016). “Crowdsourced Data Management: A Survey”. IEEE Transactions on Knowledge and Data Engineering. 28. 1-1. 10.1109/TKDE.2016.2535242.
  3. Z. Wei, “A Raindrop Algorithm for Searching the Global Optimal Solution in Non-linear Programming”, Cornell University, 2013.
  4. J. Howe, “The Rise of Crowdsourcing,” Wired Magazine, vol. 14, n4. 6, pp. 14, 2006. [4] Brabham, Daren (2008), “Crowdsourcing as a Model for Problem S5lving: An Introduction and Cases”, Convergence: The International Journal of Research into New Media Technologies, 14 (1): 7590
  5. Antonio Ghezzi, Donata Gabelloni, Antonella Martini, Angelo N6talicchio, “Crowdsourcing: A Review and Suggestions for Future Research”, International Journal of Management Reviews, Vol. 00, 121 (2017).
  6. Kantardzic, Mehmed (2003). “Data Mining: Concepts, Models, M7thods, and Algorithms”. John Wiley & Sons. ISBN 978-0-471-22852-3.
  7. Zafarani, Reza; Abbasi, Mohammad Ali; Liu, Huan (2014). “Social Media Mining: An Introduction”.
  8. A Gelbukh, “Natural Language Processing and its Applications”, Research in Computing Science, 2010
  9. D. Hindle, M. Rooth, “Structural Ambiguity and Lexical Relations”, Computational Linguistics, 1993.
  10. Chowdhury, Gobinda G. “Natural language processing.” Annual review of information science and technology 37.1 (2003): 51-89.
  11. Tomas Mikolov, “Distributed Representations of Words and Phrases and their Compositionality”, NIPS 2013.
  12. J. Howe, “The Rise of Crowdsourcing,” Wired Magazine, vol. 14, no. 6, pp. 14, 2006.
  13. Kosub, Sven; “A note on the triangle inequality for the Jaccard distance”
  14. Ioffe, Sergey. “Improved consistent sampling, weighted minhash and l1 sketching.” 2010 IEEE International Conference on Data Mining. IEEE, 2010.
  15. Sofia Visa, Brian Ramsay, Anca Ralescu, Esther van der Knaap, “Confusion Matrix-based Feature Selection”, 22nd Midwest Artificial Intelligence and Cognitive Science Conference, Ohio, USA, 2011
  16. Ronnie Merin George and Dr. Jose Alex Mathew, “Emotion Classification Using Machine Learning and Data Preprocessing Approach on Tulu Speech Data”, IJCSMC, Vol. 5, Issue. 6, June 2016
  17. B. P. Salmon, W. Kleynhans, C. P. Schwegmann and J. C. Olivier, “Proper comparison among methods using a confusion matrix”, Geoscience and Remote Sensing (IGARSS), IEEE International Symposium, 2015
  18. L. Bergroth and H. Hakonen and T. Raita (2000). “A Survey of Longest Common Subsequence Algorithms”. SPIRE. IEEE Computer Society. 00: 3948.
  19. Wagner, Robert; Fischer, Michael (January 1974). “The string-to-string correction problem”. Journal of the ACM. 21 (1)
Index Terms

Computer Science
Information Sciences

Keywords

Crowdsourcing Data Mining Natural Language Processing (NLP) MinHash Confusion Matrix Raindrop Longest Common Subsequence (LCS) Algorithm