CIS5930: Advanced Data Mining (Spring 2021)

Instructor: Peixiang Zhao

| Syllabus | Announcement | Schedule | Assignment | Resources |


Assignment Information





Project Information

  1. Tree-based ensemble learning
    • XGBoost: a Scalable Tree Boosting System. KDD'16
    • LightGBM: a Highly Efficient Gradient Boosting Decision Tree. NeurIPS'17
    • CatBoost: Unbiased Boosting with Categorical Features. NeurIPS'18
  2. Frequent graph pattern mining
    • gSpan: Graph-based Substructure Pattern Mining. ICDM'03
    • A Quickstart in Frequent Structure Mining Can Make a Difference. KDD'04
    • Flexible and Feasible Support Measures for Mining Frequent Patterns in Large Labeled Graphs. SIGMOD'17
  3. Generative Adversarial Nets (GAN)
    • Generative Adversarial Nets. NeurIPS'14
    • Wasserstein GAN. Arxiv'17
    • Are GANs Created Equal? NeurIPS'18
  4. Graph Embedding
    • DeepWalk - Online Learning of Social Representations (KDD'14)
    • LINE - Large-scale Information Network Embedding (WWW'15)
    • Node2vec - Scalable Feature Learning for Networks (KDD'16)
  5. Data Sketching for Data Streams
    • Mergeable Summaries (TODS'13)
    • Efficient Frequent Directions Algorithm for Sparse Matrices (KDD'16)
    • A high-performance algorithm for identifying frequent items in data streams (IMC'17)