- CluChunk: Clustering Large Scale User-generated Content Incorporating Chunklet Information
Yu Cheng, Yusheng Xie, Kunpeng Zhang, Ankit Agrawal and Alok Choudhary. - Parallel Rough Set based Knowledge Acquisition using MapReduce from Big Data
Junbo Zhang, Tianrui Li and Yi Pan. - Delta-SimRank Computing on MapReduce
Liangliang Cao, Hyun Duk Kim, Min-Hsuan Tsai, Brian Cho, Zhen Li, Indrani Gupta, Chengxiang Zhai and Thomas Huang. - Incrementally Optimized Decision Tree for Noisy Big Data
Hang Yang and Simon Fong. - Compression-Aware I/O Performance Analysis for Big Data Clustering
Zhenghua Xue, Jianhui Li, Yang Zhang, Geng Shen, Qian Xu and Jing Shao. - Space-Efficient Sampling from Social Activity Streams
Nesreen Ahmed, Jennifer Neville and Ramana Kompella. - A parallel graph partitioning algorithm to speed up the large-scale distributed graph mining
Zengfeng Zeng, Bin Wu and Haoyu Wang. - A Density-Based Clustering Structure Mining Algorithm for Data Streams
Huan Wang, Yanwei Yu, Qin Wang and Yadong Wan. - Subscriber classification within telecom networks utilizing big data technologies and machine learning
Tor Kvernvik and Jonathan Magnusson. - Accelerating Minor Allele Frequency Computation with Graphics Processors
Mian Lu, Jiuxin Zhao, Qiong Luo and Bingqiang Wang. - Online Feature Selection for Mining Big Data
Steven C.H. Hoi, Jialei Wang, Peilin Zhao and Rong Jin. - Accelerating Bayesian Network Parameter Learning Using Hadoop and MapReduce
Aniruddha Basak, Irina Brinster, Xianheng Ma and Ole J. Mengshoel. - Stream-Dashboard: A Framework for Mining, Tracking and Validating Clusters in a Data Stream
Basheer Hawwash and Olfa Nasraoui. - A Kernel Fused Perceptron for the Online Classification of Large-Scale Data
Huijun He, Mingmin Chi and Wenqiang Zhang.