您的位置: 专家智库 > >

国家高技术研究发展计划(2012AA01A402)

作品数:2 被引量:18H指数:1
发文基金:国家高技术研究发展计划国家自然科学基金国家重点基础研究发展计划更多>>
相关领域:自动化与计算机技术更多>>

文献类型

  • 2篇期刊文章
  • 2篇会议论文

领域

  • 4篇自动化与计算...

主题

  • 1篇狄利克雷
  • 1篇用户
  • 1篇用户兴趣
  • 1篇数据库
  • 1篇接口
  • 1篇关系数据库
  • 1篇海量
  • 1篇海量数据
  • 1篇访问接口
  • 1篇NOSQL
  • 1篇PAID
  • 1篇SINA
  • 1篇GIBBS抽...
  • 1篇HANDS
  • 1篇HBASE
  • 1篇LDA
  • 1篇MAPRED...
  • 1篇ORGANI...
  • 1篇USER
  • 1篇PROMOT...

机构

  • 2篇国防科学技术...

作者

  • 2篇杨树强
  • 1篇徐锡山
  • 1篇肖英
  • 1篇喻承

传媒

  • 2篇China ...
  • 1篇第九届中国通...

年份

  • 1篇2015
  • 1篇2014
  • 2篇2012
2 条 记 录,以下是 1-4
排序方式:
Mining User Interest in Microblogs with a User-Topic Model被引量:17
2014年
Microblogs have become an important platform for people to publish,transform information and acquire knowledge.This paper focuses on the problem of discovering user interest in microblogs.In this paper,we propose a topic mining model based on Latent Dirichlet Allocation(LDA) named user-topic model.For each user,the interests are divided into two parts by different ways to generate the microblogs:original interest and retweet interest.We represent a Gibbs sampling implementation for inference the parameters of our model,and discover not only user's original interest,but also retweet interest.Then we combine original interest and retweet interest to compute interest words for users.Experiments on a dataset of Sina microblogs demonstrate that our model is able to discover user interest effectively and outperforms existing topic models in this task.And we find that original interest and retweet interest are similar and the topics of interest contain user labels.The interest words discovered by our model reflect user labels,but range is much broader.
HE LiJIA YanHAN WeihongDING Zhaoyun
关键词:用户兴趣GIBBS抽样狄利克雷LDA
面向海量数据非关系数据库的测试基准研究
海量数据非关系数据库虽然起步晚,但其具有传统关系数据库不能比拟的优势和特点,因此发展十分迅速。在当下云计算风起云涌,数据量越来越庞大,数据访问和数据处理越来越频繁的时代,海量数据非关系数据库越来越发挥其重要作用。然而,使...
喻承杨树强肖英
关键词:NOSQL数据库
文献传递
Finding the Hidden Hands:A Case Study of Detecting Organized Posters and Promoters in SINA Weibo被引量:1
2015年
With the development of online social networks,a special group of online users named organized posters(or Internet water army,Internet paid posters in some literatures) have fl ooded the social network communities. They are organized in groups to post with specific purposes and sometimes even confuse or mislead normal users.In this paper,we study the individual and group characteristics of organized posters. A classifier is constructed based on the individual and group characteristics to detect them. Extensive experimental results on three real datasets demonstrate that our method based on individual and group characteristics using SVM model(IGCSVM) is effective in detecting organized posters and better than existing methods. We take a first look at finding the promoters based on the detected organized posters of our IGCSVM method. Our experiments show that it is effective in detecting promoters.
WANG XiangZHANG ZhilinYU XiangJIA YanZHOU BinLI Shasha
关键词:ORGANIZEDPOSTERSPAIDPROMOTER
针对HBase的MapReduce访问接口的优化
现有的HBase提供的MapReduce访问接口存在数据读取速度较慢的问题。针对此问题本文提出了一种改进方法,该方法不以原来的逻辑存储单元Region作为任务分配的基本单位,而是以HBase的物理存储单元Block作为任...
田胜利徐锡山杨树强华中杰
关键词:HBASEMAPREDUCE
文献传递
共1页<1>
聚类工具0