全文下载:2012020201
许喆, 崔海云, 曹赞华, 艾变开
(国家知识产权局专利局专利审查协作北京中心,北京 100190)
摘要: 采用文本挖掘技术对中国2000—2007年间ICT领域发明专利进行了分析,从中提取到1 000多个技术性关键词,并记录了每个关键词出现的次数。以关键词出现次数为特征向量,采用余弦夹角计算了子领域间的技术关联矩阵。然后使用社会网络分析方法对ICT领域内部的技术关联度进行了分析。结果显示,基础电子电路和半导体设备领域同其他子领域的关联度较高,属于ICT领域的核心子领域。因此,在进行研发战略布局时,应当更加重视核心子领域的研发投入,加大研发力度。
关键词: 专利; 文本挖掘; 信息与通信技术; 技术关联度; 社会网络分析
中图分类号: F 062.3文献标志码: A
Technological Correlation in ICT Based on Text Mining Technique XU Zhe, CUI Hai-yun, CAO Zan-hua, AI Bian-kai
(Patent Examination Cooperation Center of the Patent Office SIPO,Beijing, Beijing 100190,China)
Abstract: We analysed China’s invention patent in ICT field between 2000 and 2007 using text mining technique. Over 1 000 technological key words are extracted and the number of occurrence of each key word is recorded. The key word occurrence forms the characteristics vector, which is used in estimating the correlation matrix. Then social network analysis is used in analyzing technological correlation in ICT field. The result shows that the fields of basic electric circuit and semiconductors which are more correlated with other fields are core fields of ICT. Therefore, the strategic plan of R&D should focus more on core fields and intensify R&D action of core fields.
Key words: patent; text mining; information and communication technology; technological correlation; social network analysis
收稿日期:2011-11-07
作者简介: 许喆(1982—), 男,硕士.