Hot topic identification from micro-blog based on improved Single-pass algorithm

Jian Feng; Yuanyuan Ding; Xiangyu Luo

国产bbaaaaa片,成年美女黄网站色视频免费,成年黄大片,а天堂中文最新一区二区三区,成人精品视频一区二区三区尤物

首頁> 外文期刊>Journal of Computational Methods in Sciences and Engineering >Hot topic identification from micro-blog based on improved Single-pass algorithm

【24h】

Hot topic identification from micro-blog based on improved Single-pass algorithm

機(jī)譯：基于改進(jìn)的單遍算法的微博熱點(diǎn)話題識(shí)別

獲取原文

獲取原文并翻譯 | 示例

開具論文收錄證明 >>

頁面導(dǎo)航

摘要
著錄項(xiàng)
引文網(wǎng)絡(luò)
相似文獻(xiàn)
相關(guān)主題

摘要

Hot topic identification from micro-blog is very important for detection and control of the public opinion. When using Single-pass algorithm to cluster hot topics for Chinese micro-blog, Chinese word segmentation technology is a necessary preprocessing, but it will introduce inevitable segment errors. This kind of errors will make topic identification has low clustering precision. To solve this problem, this paper proposed an improved algorithm based on Single-pass which combines CS (Cosine Similarity) and LCS (Longest Common Subsequences) to calculate the similarity between Chinese words. Experiments on three different micro-blog data sets for hot topic identification are made, and the results show that the improved algorithm has both higher recall rate and precision rate than the original ones. The proposed algorithm is feasible and effective.

機(jī)譯：微博中的熱門話題識(shí)別對(duì)于檢測(cè)和控制輿論非常重要。當(dāng)使用單次通過算法對(duì)中文微博客的熱門話題進(jìn)行聚類時(shí)，中文分詞技術(shù)是必不可少的預(yù)處理程序，但是它會(huì)不可避免地引入分段錯(cuò)誤。這種錯(cuò)誤會(huì)使主題識(shí)別的聚類精度降低。為了解決這個(gè)問題，本文提出了一種基于單遍的改進(jìn)算法，該算法結(jié)合了余弦相似度和最長(zhǎng)公共子序列，計(jì)算了漢字之間的相似度。對(duì)三種不同的微博數(shù)據(jù)進(jìn)行熱點(diǎn)識(shí)別實(shí)驗(yàn)，結(jié)果表明，改進(jìn)算法比原始算法具有更高的查全率和查準(zhǔn)率。該算法是可行和有效的。

著錄項(xiàng)

來源
《Journal of Computational Methods in Sciences and Engineering》 |2017年第4期|791-798|共8頁
作者
Jian Feng; Yuanyuan Ding; Xiangyu Luo;
展開▼
作者單位

College of Computer Science & Technology Xi'an University of Science and Technology Xi' an 710054 Shaanxi China;

Beijing Beibian MicroGrid Technology Co. Ltd. Beijing 100193 China;

展開▼
收錄信息
原文格式 PDF
正文語種 eng
中圖分類
關(guān)鍵詞
Hot topic identification; clustering; Single-pass; word segmentation;

機(jī)譯：熱門話題識(shí)別;集群?jiǎn)纬谭衷~;

相似文獻(xiàn)

外文文獻(xiàn)
中文文獻(xiàn)
專利

1. Extracting and tracking hot topics of micro-blogs based on improved Latent Dirichlet Allocation [J] . Du YaJun, Yi YongTao, Li XianYong, Engineering Applications of Artificial Intelligence . 2020,第Jana期

機(jī)譯：基于改進(jìn)的潛在狄利克雷分配法提取和跟蹤微博熱點(diǎn)話題
2. Micro-Blog Topic Detection Method Based on BTM Topic Model and K-Means Clustering Algorithm [J] . Weijiang Li, Yanming Feng, Dongjun Li, Automatic Control and Computer Sciences . 2016,第4期

機(jī)譯：基于BTM主題模型和K-Means聚類算法的微博主題檢測(cè)方法
3. Topology-based Algorithm for Users' Influence on Specific Topics in Micro-blog [J] . Jinfeng Yuan, Li Li, Le Luo, Journal of information and computational science . 2013,第8期

機(jī)譯：基于拓?fù)涞挠脩魧?duì)微博中特定主題影響的算法
4. An Improved Single-Pass Algorithm for Chinese Microblog Topic Detection and Tracking [C] . Danfeng Yan, Enzheng Hua, Bo Hu IEEE International Congress on Big Data . 2016

機(jī)譯：一種改進(jìn)的中文微博主題檢測(cè)與跟蹤單次通過算法
5. Algorithmes numeriques en temps reel appliques a l'identification de cristaux et a la mesure de l'estampe du temps pour scanner TEP/TDM tout-numerique a base de photodiodes a avalanche. [D] . Semmaoui, Hicham. 2009

機(jī)譯：實(shí)時(shí)數(shù)值算法應(yīng)用于基于雪崩光電二極管的全數(shù)字PET / CT掃描儀的晶體識(shí)別和時(shí)間戳測(cè)量。
6. Parameters Identification for Photovoltaic Module Based on an Improved Artificial Fish Swarm Algorithm [O] . Wei Han, Hong-Hua Wang, Ling Chen -1

機(jī)譯：基于改進(jìn)人工魚群算法的光伏組件參數(shù)識(shí)別
7. Emotional Tendency Identification for Micro-blog Topics Based on Multiple Characteristics [O] . Liu Quanchao, Feng Chong, Huang Heyan 2012

機(jī)譯：基于多特征的微博話題情感傾向識(shí)別

獲取原文

客服郵箱：kefu@zhangqiaokeyan.com

京公網(wǎng)安備：11010802029741號(hào) ICP備案號(hào)：京ICP備15016152號(hào)-6 六維聯(lián)合信息科技 (北京) 有限公司?版權(quán)所有

客服微信
服務(wù)號(hào)

国产bbaaaaa片,成年美女黄网站色视频免费,成年黄大片,а天堂中文最新一区二区三区,成人精品视频一区二区三区尤物

Hot topic identification from micro-blog based on improved Single-pass algorithm

摘要

著錄項(xiàng)

引文網(wǎng)絡(luò)

相似文獻(xiàn)

相關(guān)主題

期刊訂閱