首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于矩阵降维的典型用户文件发现方法
引用本文:陆建江,徐宝文,黄刚石,张亚非.基于矩阵降维的典型用户文件发现方法[J].东南大学学报,2003,19(3):231-235.
作者姓名:陆建江  徐宝文  黄刚石  张亚非
作者单位:[1]东南大学计算机科学与工程系,南京210096 [2]国防科学技术大学计算机学院,长沙410073 [3]解放军理工大学理学院,南京210007
基金项目:TheNationalNaturalScienceFoundationofChina(60 0 73 0 12 ) ,NationalGrandFundamentalResearch 973ProgramofChina (2 0 0 2CB3 12 0 0 0 ),NationalResearchFoundationfortheDoctoralProgramofHigherEducationofChinaandOpeningFoundationofJiangsuKeyLaboratoryofComp
摘    要:应用聚类技术能够自动地发现典型用户件,但是由于会话向量通常是高维的稀疏向量,因此很难在会话向量之间设计有效的相似度度量.本提出2种基于矩阵降维的典型用户件发现方法.这些方法应用非负矩阵分解技术降低会话-URL矩阵的维数,并通过球形的后.均值算法对用户会话向量的投影向量聚类,由此得到典型用户件.实验结果表明,这些算法能够有效地从用户会话中发现典型的用户件.

关 键 词:数据挖掘  Web挖掘  非负矩阵分解  球形k-均值算法  矩阵降维  典型用户文件  发现方法  知识发现

Matrix dimensionality reduction for mining typical user profiles
Lu Jianjiang,Xu Baowen,Huang Gangshi,ZHANG Yafei.Matrix dimensionality reduction for mining typical user profiles[J].Journal of Southeast University(English Edition),2003,19(3):231-235.
Authors:Lu Jianjiang  Xu Baowen  Huang Gangshi  ZHANG Yafei
Abstract:Recently clustering techniques have been used to automatically discover typical user profiles. In general, it is a challenging problem to design effective similarity measure between the session vectors which are usually high-dimensional and sparse. Two approaches for mining typical user profiles, based on matrix dimensionality reduction, are presented. In these approaches, non-negative matrix factorization is applied to reduce dimensionality of the session-URL matrix, and the projecting vectors of the user-session vectors are clustered into typical user-session profiles using the spherical k -means algorithm. The results show that two algorithms are successful in mining many typical user profiles in the user sessions.
Keywords:Web usage mining  non-negative matrix factorization  spherical k-means algorithm
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号