以深度學習對包含長文之資料集進行情感分析

No Thumbnail Available

Date

2022

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

隨著網際網路的蓬勃發展,越來越多的訊息在社群網站、線上購物網站、論壇等各種網路平台間傳遞,而這一些訊息可能都表達了人們的看法或是評價。但是只依靠人力來觀察如此龐大的資訊量是非常沒有效率的,因此如何讓電腦得以代替人類完成這一項工作量龐大的任務是必要的。自然語言處理(Natural Language Processing,NLP)是一種讓電腦可以理解人類語言的技術,而情感分析(Sentiment Analysis)則是NLP其中的一項常見應用。它能夠了解字句間所表達的情緒,比如分析網路上對於某些產品、名人、事件等事物的評論立場為何,像是有好感還是持有負面態度。本實驗使用含有長文的IMDB資料集進行情感分析,該資料集將評論分為正面和負面,並且建立深度學習模型讓它藉由評論內容判斷評論表達的情緒是正面或負面,除了基本的LSTM和BERT模型以外,本實驗還有嘗詴讓BERT合併BERT或LSTM模型,希望藉由增加模型獲得的特徵來提高準確度,並且對各種模型的實驗結果進行比較。
With the vigorous development of the Internet, more and more information is transmitted among various network platforms such as social networking sites, online shopping sites, forums, etc., and these messages may express people's opinions or evaluations. However, it is very inefficient to observe such a huge amount of information only by people. Therefore, it is necessary to allow computers to replace people to complete this huge task. Natural Language Processing (NLP) is a technology that enables computers to understand human language, and sentiment analysis is a common application in the NLP domain. Sentiment analysis aims to understand the emotions expressed between words, such as analyzing the stance of comments on certain products, celebrities, events, etc. on the Internet, and then identifying positive or negative polarity.This experiment uses the IMDB dataset containing long texts for sentiment analysis, which divides comments into positive and negative polarity. This thesis proposes a deep learning model to judge whether the sentiment expressed by the comment is positive or negative based on the content of the comment. In addition to the LSTM and BERT model, this experiment also tried to combine the BERT or LSTM model with BERT, hoping to improve the accuracy by increasing the features obtained by the model, and to compare the experimental results of different models.

Description

Keywords

自然語言處理, 情感分析, 深度學習, BERT, Natural Language Processing, Sentiment Analysis, Deep Learning, BERT

Citation

Collections

Endorsement

Review

Supplemented By

Referenced By