PDF全文下载:20160100108
石永革, 张毫
(南昌大学 信息工程学院,江西 南昌 330031)
摘要: BPMBM算法结合位并行和过滤技术,是当前近似字符串匹配算法中效率最高的算法之一。算法中过滤机制容易导致位并行计算连续性中断,使位并行计算回溯导致性能大幅降低。针对此问题提出了基于过滤优化的BPMBM算法。实验结果表明:优化算法在大字符集环境下继承了BPMBM算法的运行高效性,在非大字符集环境下较BPMBM算法提升显著,且随着编辑距离的增长,其时间开销增长的稳定性大幅优于BPMBM算法。
关键词: 近似字符串匹配;BPMBM算法;位并行;过滤
中图分类号: TP 391 文献标志码: A
A Filter Optimized Bitparallel Approximate String Matching Algorithm
SHI Yongge, ZHANG Hao
(Information Engineering School, Nanchang University, Nanchang 330031, China)
Abstract: BPMBM, combined with filter and bitparallel mechanism, has become one of the most effective solutions for approximate string matching problem. Bitparallelism's continuity contradicts with filter as its continuity is always interrupted by filter easily, which brings serious backtracking for the bit parallel calculation and result in a significant decrease in performance. To solve this problem, an optimized BPMBM algorithm is proposes based on filter optimization. The experimental results show that the optimized BPMBM algorithm inherits BPMBM′s high performance when dealing with large character sets, and its performance is significantly enhanced when dealing with non large character sets. Besides, its time cost growth is more stable with the edit distance growth comparing with BPMBM.
Key words: approximate string match; BPMBM algorithm; bitparallel; filter
收稿日期: 20150328
基金项目: 国家自然科学基金项目(61163005).
作者简介: 石永革(1953—),男,教授.
文章编号: 16726987(2016)01010805; DOI: 10.16351/j.16726987.2016.01.022