FP-Tree-Based Approach for Frequent Trajectory Pattern Mining
-
摘要: 通过对经典频繁模式数据结构FP-tree的扩展与改进,提出了一种适用于处理轨迹数据的灵活高效的FP-tree轨迹挖掘方法(NFTM)。首先运用二维筛选和GPS格式过滤的方法对轨迹进行预处理,然后将有效数据经一次扫描后,生成按照真实轨迹顺序排列且具备时空属性的改进型FP-tree,使用动态数组存储模式挖掘过程中得到的候选集,根据用户的输入针对性输出相应时间和频率范围的频繁轨迹。最后通过与GSP算法、Prefixspan算法的对比测试表明,该算法具有更短执行时间和更优性能。
-
[1] AGRAWAL R, SRIKANT R. Fast algorithms for mining association rules[C]//Very large data bases(VLDB). San Francisco, CA, USA: Morgan Kaufmann Publishers, 1994: 487-499. [2] HAN Jia-wei, PEI Jian, YIN Yi-wen, et al. Mining frequent patterns without candidate generation: a frequent pattern tree approach[J]. Data Mining and Knowledge Discovery, 2004, 8(1): 53-87. [3] SRIKANT R, AGRAWAL R. Advances in database technology—EDBT'96[M]. Berlin, Heidelberg: Springer, 1996: 1554-1558. [4] PEI Jian, HAN Jia-wei, MORTAZAVI-ASL B, et al. PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth[C]//ICDE'01 Proceedings of the 17th International Conference on Data Engineering. Washington D C, USA: IEEE Computer Society, 2001: 215-224. [5] ZAKI M J. SPADE: an efficient algorithm for mining frequent sequences[J]. Machine Learning, 2001, 11(5): 31-60. [6] KANG Juyoung, YONG Hwan-Seung Y. Mining trajectory patterns by incorporating temporal properties[C]//Proceedings of the 1st International Conference on Emerging Database. Busan, Korea: Hwan-Seung, 2009: 63-68. [7] GIANNOTTI F, NANNI M, PINELLI F, et al. Trajectory pattern mining[C]//Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. California, USA : Scan Jose, 2007: 330-339. [8] PEI Jian, HAN Jia-wei, MAO run-ying. CLOSET: an efficient algorithm for mining frequent closed itemsets[C]//Proceedings of the 5th ACM-SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery. Dallas, USA: ACM, 2000: 11-20. [9] MILIARAKI I, BERBERICH K, GEMULLA R, et al. Mind the gap large-scale frequent sequence mining[C]//Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. New York, USA: ACM 2013: 797-808. [10] DEAN J, GHEMAWAT S. Mapreduce: Implied data processing on large clusters[C]//6th Symposium on Operating Systems Design and Implementation. [S.l.]: USENIX Association, 2004: 137-149. [11] GRUENWALD L, CHOK H, ABOUKHAMIS M. Using data mining to estimate missing sensor data[C]//IEEE International Conference on Data Mining Workshops. Omaha, NE, USA: IEEE, 2007: 207-212. [12] GRUENWALD L, YANG Han-qing, SADIK M S, et al. Using data mining to handle missing data in multi-hop sensor network applications[C]//9th ACM International Workshop on Data Engineering for Wireless and Mobile Access. New York, USA: ACM, 2010: 9-16. [13] GENG Xiao-liang, ARIMURA H, UNO T. Pattern mining from trajectory GPS data[C]//2012 IIAI International Conference on Advanced Applied Informatics. Fukuoka, USA: IEEE, 2012: 60-65. [14] LIANG Wan-ga, HUA Kunyuan, TAO Ku, et al. Mining frequent trajectory pattern based on vague space partition[J]. Knowledge-Based Systems, 2013, 50: 100-111.
点击查看大图
计量
- 文章访问数: 4059
- HTML全文浏览量: 146
- PDF下载量: 318
- 被引次数: 0