International Journal of Emerging Research in Science, Engineering, and Management
Vol. 2, Issue 2, pp. 01-06, February 2026.
This work is licensed under a Creative Commons Attribution 4.0 International License.
Guggila Yasaswini
Abburi Thriveni
Karamala Praveena
Satyam Kumar Sharma
Valkuru Suman
*D. Sudhakara
Department of CSE, Siddartha Institute of Science and Technology, Puttur, India.
*Department of CSE, Siddartha Institute of Science and Technology, Puttur, India.
Abstract: The current paper presents a web-based program that will evaluate the comments posted on YouTube on the basis of an analysis procedure that utilizes sentiment classification methods supported by contextual information gained through video transcripts. The system will first of all extract the comment, then, semantic search is performed to retrieve the most relevant transcript parts, and then the particular aspect is identified to which the user is agreeing, disagreeing, or assuming a neutral situation. A retrieval-augmented plan is implemented together with a language model, which adds the sentiment predictions with a human-readable explanation. This will increase the credibility of interpreting sophisticated remarks, especially those that may mention a particular spoken material in the video. The system aims at backing content creators, moderators, and viewers by providing a more subtle and dependable sentiment analysis.
Keywords: Sentiment Analysis, Natural Language Processing, YouTube Data API, Retrieval Augmented Generation, Naive Bayes.
References:
- F. Lovera and Y. Cardinale, “An analytics approach to extracting location and sentiment insights from classified social media data,” Decision Analytics Journal, vol. 16, p. 100624, Aug. 2025, doi: 10.1016/j.dajour.2025.100624.
- N. Shivsharan, V. Kambli, S. Dabholkar, A. Dalvi, and T. Sukali, “Evaluating machine learning models for sentiment analysis of YouTube comments and creating an accessible web application for comment analysis,” Procedia Computer Science, vol. 258, pp. 3165–3174, Jan. 2025, doi: 10.1016/j.procs.2025.04.574.
- P. Giganti, G. Errichiello, and P. M. Falcone, “Exploring public discourse on green hydrogen via YouTube comments: A comparative sentiment analysis using VADER and ChatGPT,” Economic Analysis and Policy, vol. 88, pp. 2012–2030, Nov. 2025, doi: 10.1016/j.eap.2025.11.014.
- J. Ma, N. Sakib, F. I. Anik, and S. I. Ahamed, “Time-synchronized sentiment labeling via autonomous online comments data mining: A multimodal information fusion on large-scale multimedia data,” Big Data Research, vol. 41, p. 100552, Jun. 2025, doi: 10.1016/j.bdr.2025.100552.
- Md. S. Islam, M. Ferdusi, and T. T. Aurpa, “Words of War: A hybrid BERT-CNN approach for topic-wise sentiment analysis on The Russia-Ukraine War,” Expert Systems With Applications, vol. 284, p. 127759, Apr. 2025, doi: 10.1016/j.eswa.2025.127759.
- E. Supriyadi and P. N. Makatita, “Sentiment analysis of TikTok user comments on QRIS adoption in Indonesia using IndoBERT,” Procedia Computer Science, vol. 269, pp. 121–130, Jan. 2025, doi: 10.1016/j.procs.2025.08.265.
- L. M. Gandy, L. V. Ivanitskaya, L. L. Bacon, and R. Bizri-Baryak, “Public health discussions on social media: Evaluating Automated sentiment Analysis methods,” JMIR Formative Research, vol. 9, p. e57395, Sep. 2024, doi: 10.2196/57395.
- P. Gamette and S. A. Asongu, “From screens to grids: A web scrapping analysis of electricity access in sub-Saharan Africa,” The Electricity Journal, vol. 38, no. 2, p. 107472, May 2025, doi: 10.1016/j.tej.2025.107472.
- J. Oh and J. Kim, “YouTube as a social listening tool: Mining housing discourse on YouTube,” Cities, vol. 166, p. 106279, Jul. 2025, doi: 10.1016/j.cities.2025.106279.
- S. In and S. Chanchamnan, “Comparison of VADER and TextBlob labeling for sentiment analysis using machine learning and deep learning models: A study on generative AI user experience,” Acta Psychologica, vol. 263, p. 106268, Jan. 2026, doi: 10.1016/j.actpsy.2026.106268.
- M. Asghari, J. Nielsen, M. Gentili, N. Koizumi, and A. Elmaghraby, “Classifying comments on social media related to living kidney donation: Machine Learning training and validation study,” JMIR Medical Informatics, vol. 10, no. 11, p. e37884, Sep. 2022, doi: 10.2196/37884.
- M. Ahmad, I. Batyrshin, and G. Sidorov, “Sentiment Analysis using a Large Language Model–Based approach to detect opioids mixed with other substances via social media: Method development and validation,” JMIR Infodemiology, vol. 5, p. e70525, Jun. 2025, doi: 10.2196/70525.
