XIFENG YAN

home | research | publications | tutorials | software


[dblp][category]

Journal Papers
Conference Papers
Book Chapters
Workshop Papers, Demos, and Technical Reports

Journal Papers

  1. Frequent Pattern Mining: Current Status and Future Directions,
    by J. Han, H. Cheng, D. Xin and X. Yan,
    DMKD'07 (Data Mining and Knowledge Discovery, 10th Anniversary Issue), 2007 (Invited submission, to appear) [pdf]
  2. On compressing frequent patterns,
    by D. Xin, J. Han, X. Yan, H. Chen, 
    DKE'07 (Data Knowledge Engineering), 60(1): 5-29, 2007 [pdf]
  3. Integrative Array Analyzer: A Software Package for Analysis of Cross-platform and Cross-species Microarray Data,
    by F. Pan, K Kamath, K. Zhang, S. Pulapura, A. Achar, J. Nunez-Iglesias, Y. Huang, X. Yan, J. Han, H. Hu, M. Xu, J. Hu, and X. Jasmine Zhou,
    Bioinformatics'06
    , Vol.22 no.13: 1665–1667, 2006. [pdf]
  4. Feature-based Substructure Similarity Search, 
    by X. Yan, F. Zhu, P. S. Yu, and J. Han,
    ACM-TODS'06 (ACM Transactions on Database Systems), Dec. 2006. [pdf]
  5. Statistical Debugging: A Hypothesis Testing-based Approach,
    by  C. Liu, L. Fei, X. Yan, J. Han and S. Midkiff,
    IEEE-TSE'06 (IEEE Transaction on Software Engineering), 32(10):831-848, 2006. [pdf]
  6. Graph Indexing Based on Discriminative Frequent Structure Analysis, 
    by X. Yan, P. S. Yu, and J. Han,
    ACM-TODS'06 (ACM Transactions on Database Systems), Dec. 2005. [pdf]
  7. TSP: Mining Top-K Closed Sequential Patterns,  
    by P. Tzvetkov, X. Yan, and J. Han,
    KAIS'05 (Knowledge and Information Systems: An International Journal), 7:438-457, 2005. [pdf]
  8. From Sequential Pattern Mining to Structured Pattern Mining: A Pattern-Growth Approach, 
    by J. Han, J. Pei, and X. Yan,
    JCST'04 (Journal of Computer Science and Technology), 19(3): 257 – 279, 2004. [pdf]

Conference Papers

  1. Mining Significant Graph Patterns by Scalable Leap Search,
    X. Yan, H. Cheng, J. Han, and P. S. Yu,
    SIGMOD'08 (Proc. 2008 ACM SIGMOD Int. Conf. on Management of Data), Jun. 2008 [pdf]
  2. Direct Discriminative Pattern Mining for Effective Classification,
    H. Cheng, X. Yan, J. Han, and P. S. Yu,
    ICDE'08 (Proc. of 2008 Int. Conf. on Data Engineering), Apr. 2008. [pdf]
  3. gApprox: Mining Frequent Approximate Patterns from a Massive Network,
    by C. Chen, X. Yan, F. Zhu, and J. Han.
    ICDM'07a (Proc. of 2007 Int. Conf. on Data Mining), Oct. 2007. (short paper) [pdf]
  4. Efficient Discovery of Frequent Approximate Sequential Patterns,
    by F. Zhu, X. Yan, J. Han, and P. S. Yu.
    ICDM'07b (Proc. of 2007 Int. Conf. on Data Mining), Oct. 2007. (short paper) [pdf]
  5. Towards Graph Containment Search and Indexing,
    by C. Chen, X. Yan, P. S. Yu, J. Han, D.-Q. Zhang and X. Gu.
    VLDB'07a (Proc. of 2007 Int. Conf. on Very Large Data Bases), Sep. 2007. [pdf]
  6. Entity Search: Search Directly and Holistically,
    by T. Cheng, X. Yan and K. Chang.
    VLDB'07b (Proc. of 2007 Int. Conf. on Very Large Data Bases), Sep. 2007. [pdf]
  7. A Graph-Based Approach to Systematically Reconstruct Human Transcriptional Regulatory Modules,
    by X. Yan, M. Mehan, Y. Huang, M. S. Waterman, P. S. Yu, and X. Zhou.
    ISMB'07a (the 15th Annual Int. Conf. on Intelligent Systems for Molecular Biology), Jul. 2007. [pdf]
  8. Systematic Discovery of Functional Modules and Context-Specific Functional Annotation of Human Genome,
    by Y. Huang, H. Li, H. Hu, X. Yan, M. S. Waterman, H. Huang, and X. Zhou.
    ISMB'07b (the 15th Annual Int. Conf. on Intelligent Systems for Molecular Biology), Jul. 2007. [pdf]
  9. gPrune: A Constraint Pushing Framework for Graph Pattern Mining,
    by F. Zhu, X. Yan, J. Han, and P. S. Yu.
    PAKDD'07 (Proc. of 2007 Pacific-Asia Conference on Knowledge Discovery and Data Mining), May 2007. Best Student Paper. [pdf]
  10. Mining Colossal Frequent Patterns by Core Pattern Fusion,
    by F. Zhu, X. Yan, J. Han, P. S. Yu, and H. Cheng.
    ICDE'07a (Proc. of 2006 Int. Conf. on Data Engineering), Apr. 2007. Best Student Paper [pdf]
  11. Discriminative Frequent Pattern Analysis for Effective Classification,
    by H. Cheng, X. Yan, J. Han, and C. Hsu.
    ICDE'07b (Proc. of 2006 Int. Conf. on Data Engineering), Apr. 2007. [pdf]
  12. Extracting Redundancy-aware Top-k Patterns,
    by D. Xin, H. Cheng, X. Yan, J. Han, 
    SIGKDD'06 (Proc. of 2006 Int. Conf. on Knowledge Discovery and Data Mining). [pdf]
  13. Mining Control Flow Abnormality for Logic Error Isolation,

    by C. Liu, X. Yan, and J. Han,

    SDM'06 (Proc. of 2006 SIAM Int. Conf. on Data Mining), 2006. [pdf] (acceptance rate, 16%)

  14. Searching Substructures with Superimposed Distance, 
    by X. Yan, F. Zhu, J. Han, and P. S. Yu,
    ICDE'06 (Proc. of 2006 Int. Conf. on Data Engineering), 2006. [pdf] [ppt_slides] (acceptance rate, 20%)
  15. Community Mining from Multi-Relational Networks, 
    by D. Cai, Z. Shao, X. He, X. Yan, J. Han,
    PKDD'05 (Proc. of 2005 European Conf. on Principles and Practice of Knowledge Discovery in Databases), 2005. [pdf] (acceptance rate, 28%)
  16. SOBER: Statistical Model-based Bug Localization, 
    by C. Liu, X. Yan, L. Fei, J. Han, and S. Midkiff,
    FSE'05 (Proc. of 2005 13th ACM SIGSOFT Symp. on the Foundations of Software Engineering), 2005.   [pdf] [website] (acceptance rate, 16%)
  17. Mining Compressed Frequent-Pattern Sets, 
    by D. Xin, J. Han, X. Yan and H. Cheng,
    VLDB'05 (Proc. of 2005 Int. Conf. on Very Large Data Bases), 2005. [pdf] (acceptance rate, 16.5%)
  18. Summarizing Itemset Patterns: A Profile-Based Approach, 
    by X. Yan, H. Cheng, J. Han, and D. Xin,
    SIGKDD'05a
    (Proc. of 2005 Int. Conf. on Knowledge Discovery and Data Mining), 2005, Best Student Paper RunnerUp. [pdf] (acceptance rate, 9%)
  19. Mining Closed Relational Graphs with Connectivity Constraints, 
    by X. Yan, X. Jasmine Zhou, and J. Han,
    SIGKDD'05b (Proc. of 2005 Int. Conf. on Knowledge Discovery and Data Mining), 2005. [pdf] (acceptance rate, 9%)
  20. Mining Coherent Dense Subgraphs Across Massive Biological Networks for Functional Discovery, 
    by H. Hu, X. Yan, Y. Huang, J. Han, X. Jasmine Zhou,
    ISMB'05 (also Bioinformatics). [pdf] [website(acceptance rate, 13%)
  21. Substructure Similarity Search in Graph Databases, 
    by X. Yan, P. S. Yu, and J. Han,

    SIGMOD'05 (Proc. of 2005 Int. Conf. on Management of Data), 2005. [pdf] (acceptance rate, 15%)
    Among top-ranked papers in SIGMOD'05, Invited to  ACM Transactions on Database Systems (TODS).
  22. Mining Behavior Graphs for `Backtrace' of Noncrashing Bugs, 
    by C. Liu, X. Yan, H. Yu, J. Han, and P. S. Yu,

    SDM'05a (Proc. of 2005 SIAM Int. Conf. on Data Mining), 2005. [pdf] (acceptance rate, 18%)
  23. SeqIndex: Indexing Sequences by Sequential Pattern Analysis, 
    by H. Cheng, X. Yan, and J. Han,

    SDM'05b (Proc. of 2005 SIAM Int. Conf. on Data Mining), 2005 (short paper). [pdf] (acceptance rate, 36%)
  24. Mining Closed Relational Graphs with Connectivity Constraints, 
    by X. Yan, X. Zhou, J. Han,
    ICDE'05 (Proc. of 2005 Int. Conf. on Data Engineering) (short paper). [pdf]
  25. Graph Indexing: A Frequent Structure-based Approach, 
    by X. Yan, P. S. Yu, and J. Han,
    SIGMOD'04 (Proc. of 2004 Int. Conf. on Management of Data), 2004. [pdf][dataset] (acceptance rate, 16%)
    Among top-ranked papers in SIGMOD'04, Invited to  ACM Transactions on Database Systems (TODS).
  26. IncSpan: Incremental Mining of Sequential Patterns in Large Database, 
    by H. Cheng, X. Yan, and J. Han,

    SIGKDD'04 (Proc. 2004 of the Int. Conf. on Knowledge Discovery and Data Mining), 2004. [pdf]
  27.   (acceptance rate 25%)
  28. CloseGraph: Mining Closed Frequent Graph Patterns, 
    by X. Yan and J. Han,

    SIGKDD'03 (Proc. of 2003 Int. Conf. Knowledge Discovery and Data Mining), 2003. [pdf] (acceptance rate, 13%)

    Google Scholar ranks CloseGraph as #1 for "graph pattern mining", with 140 citations. (as of Nov 25, 2007)
  29. CloSpan: Mining Closed Sequential Patterns in Large Datasets,
    by X. Yan, J. Han, and R. Afshar,

    SDM'03 (Proc. of 2003 SIAM Int. Conf. Data Mining), 2003.  [pdf] (acceptance rate, 20%)
  30. TSP: Mining Top-K Closed Sequential Patterns,
    by P. Tzvetkov, X. Yan, and J. Han,
    ICDM'03 (Proc. of 2003 Int. Conf. on Data Mining), 2003. [pdf] (acceptance rate, 12%)
  31. gSpan: Graph-Based Substructure Pattern Mining,
    by X. Yan and J. Han,
    ICDM'02 (Proc. of 2002 Int. Conf. on Data Mining) (short paper), 2002.  [pdf]  (acceptance rate, 31%)
    Expanded Version, UIUC Technical Report, UIUCDCS-R-2002-2296. [pdf]
    Google Scholar ranks gSpan as #3 for "graph pattern mining", with 276 citations. (as of Nov 25, 2007)
  32. Accelerating Volume Rendering with L-Buffer,
    by X. Yan, W. Cai and J. Shi,
    CAD&Graphics'97
    , Wuhan, China, 1997.

Book Chapters

  1. Discovery of Frequent Substructures
    by X. Yan and J. Han,
    Mining Graph Data, D. Cook and L. Holder, John Wiley & Sons Inc, 2007.
  2. Discovering evolutionary classifier over high speed non-static stream,  
    by J. Yang, X. Yan, J. Han, and W. Wang,
    Advanced Methods for Knowledge Discovery from Complex Data, S. Bandyopadhyay, U. Maulik, L. Holder, D. Cook (Eds.), Springer, 2005.
  3. Mining Frequent Patterns in Data Streams at Multiple Time Granularities,
    by C. Giannella, J. Han, J. Pei, X. Yan, and P. S. Yu,
    Next Generation Data Mining, H. Kargupta, A. Joshi, K. Sivakumar, and Y. Yesha (eds.),  AAAI/MIT, 2004.
  4. Sequential Pattern Mining by Pattern-Growth: Principles and Extensions,
    by J. Han, J. Pei, and X. Yan,
    Recent Advances in Data Mining and Granular Computing (Mathematical Aspects of Knowledge Discovery), W. Chu and T. Lin (eds.), Springer Verlag, 2004.

Workshop Papers, Demos, and Technical Reports

  1. BioArrayMine: A Software Package for Integrative Analysis of Cross-platform and Cross-species Microarray Data,  
    by F. Pan, K. Kamath, H. Hu, Y. Huang, K. Zhang, M. Xu, X. Yan, J. Han, and X. Jasmine Zhou,
    Proc. of 2005 Int. Conf. on Intelligent Systems for Molecular Biology (ISMB'05), Detroit, MI, 2005 (system demo).
  2. GraphMiner: A Structural Pattern Mining System for Large Disk-based Graph Databases and Its Applications,  
    by W. Wang, C. Wang, Y. Zhu, B. Shi, J. Pei, X. Yan, and J. Han,
    Proc. of 2005 Int. Conf. on Management of Data (SIGMOD'05), 879 – 881, Baltimore, MD, 2005 (system demo).
  3. Mining Hidden Community in Heterogeneous Social Networks,  
    by D. Cai, Z. Shao, X. He, X. Yan, and J. Han,
    Technical Report UIUCDCS-R-2005-2538, Department of Computer Science, University of Illinois at Urbana-Champaign, 2005.
  4. Using Data Mining for Discovering Patterns in Autonomic Storage Systems,  
    by Z. Li, S. Srinivasan, Z. Chen, Y. Zhou, P. Tzvetkov, X. Yan, and J. Han,
    ACM Workshop on Algorithms and Architectures for Self-Managing Systems, Proc. of 2003 Federated Computing Research Conference (FCRC'03), 2003.
  5. A Framework for Continuous Quantile Computation over Sensor Networks,  
    by X. Yan, J. Yang, J. Han, and W. Wang,
    Technical Report UIUCDCS-R-2003-2382, Department of Computer Science, University of Illinois at Urbana-Champaign, 2003.
  6. gSpan: Graph-Based Substructure Pattern Mining,  
    by X. Yan and J. Han,
    Technical Report UIUCDCS-R-2002-2296, Department of Computer Science, University of Illinois at Urbana-Champaign, 2002.