Machine Learning Techniques, Features, Datasets, and Algorithm Performance Parameters for Sentiment Analysis: a Systematic Review

dc.contributor.authorOndara, Bernard
dc.contributor.authorWaithaka, Stephen
dc.contributor.authorKandiri, John
dc.date.accessioned2023-05-23T06:47:50Z
dc.date.available2023-05-23T06:47:50Z
dc.date.issued2022
dc.descriptionArticleen_US
dc.description.abstractThe purpose of this paper is to review various studies on current machine learning techniques used in sentiment analysis with the primary focus on finding the most suitable combinations of the techniques, datasets, data features, and algorithm performance parameters used in most applications. To accomplish this, we performed a systematic review of 24 articles published between 2013 and 2020 covering machine learning techniques for sentiment analysis. The review shows that Support Vector Machine as well as Naïve Bayes techniques are the most popular machine learning techniques; word stem and n-grams are the most extensively applied features, and the Twitter dataset is the most predominant. This review further revealed that machine learning algorithms' performance depends on many factors, including the dataset, extracted features, and size of data used. Accuracy is the most commonly used algorithm performance metric. These findings offer important information for researchers and businesses to use when selecting suitable techniques, features, and datasets for sentiment analysis for various business applications such as brand reputation monitoring.en_US
dc.identifier.citationOndara, B., Waithaka, S., Kandiri, J., & Muchemi, L. (2022). Machine Learning Techniques, Features, Datasets, and Algorithm Performance Parameters for Sentiment Analysis: A Systematic Review. Open Journal for Information Technology, 5(1), 1.en_US
dc.identifier.urihttps://doi.org/10.32591/coas.ojit.0501.01001o
dc.identifier.urihttp://ir-library.ku.ac.ke/handle/123456789/25389
dc.language.isoenen_US
dc.publisherCOASen_US
dc.subjectsentiment analysisen_US
dc.subjectmachine learning techniqueen_US
dc.subjectmachine learning algorithmen_US
dc.subjectsentiment classification techniqueen_US
dc.subjectsentiment classification algorithmen_US
dc.titleMachine Learning Techniques, Features, Datasets, and Algorithm Performance Parameters for Sentiment Analysis: a Systematic Reviewen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Machine Learning Techniques, Features, Datasets, and Algorithm Performance Parameters for Sentiment Analysis, a Systematic Review.pdf
Size:
234.39 KB
Format:
Adobe Portable Document Format
Description:
Full text Article
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: