Chenhao Ma

Chenhao Ma

Assistant Professor
School of Data Science
The Chinese University of Hong Kong, Shenzhen
Longgang, Shenzhen, Guangdong, China
machenhao (at) cuhk.edu.cn

Google Scholar ~ DBLP

Bio

Dr. Chenhao Ma is an Assistant Professor at the Chinese University of Hong Kong, Shenzhen. Prior to that, he was a Postdoctoral Fellow at the University of Hong Kong. He received his PhD degree from Department of Computer Science in the University of Hong Kong (HKU) in 2021. He once was a visiting student in the University of New South Wales (UNSW) in 2019. Till now, he has published more than 30 papers in the areas of database and data mining, including one of four Best of SIGMOD2020 (a world flagship conference in database areas, 4/458), and most of them were published in top-tier conferences and journals (e.g., SIGMOD, PVLDB, and TODS). He was awarded the ACM SIGMOD Research Highlight Award 2021. He has served as PC members and reviewers for several top conferences and journals (e.g., VLDB, KDD, WWW, CIKM, TKDE, and VLDBJ).

News

  • I am looking for highly self-motivated PhD/MPhil students, PostDocs, and research assistants. If you are interested in working with me, please feel free to drop me an email. Please check the position info.

  • Applied Sciences Call for paper: Big Data Applications in Transportation

Interests

His research interests mainly focus on large-scale data management and data mining:

  • Graph data management: dense subgraph search, motif analysis, convex optimization, GNN.

  • Traffic data mining: trajectory outlier detection, traffic network weight completion.

  • AI4DB: text2sql, dataset search.

Publications

* indicates Chenhao is a corresponding author

In the Year of 2024
  1. Effective Job-market Mobility Prediction with Attentive Heterogeneous Knowledge Learning and Synergy
    Sida Lin, Zhouyi Zhang, Yankai Chen, Chenhao Ma*, Yixiang Fang, Shan Dai and Guangli Lu
    In The Conference on Information and Knowledge Management (CIKM), Short, 2024.
    Paper

  2. Towards Effective Top-N Hamming Search via Bipartite Graph Contrastive Hashing
    Yankai Chen, Yixiang Fang, Yifei Zhang, Chenhao Ma, Yang Hong, Irwin King.
    In IEEE Transactions on Knowledge and Data Engineering (TKDE), 2024.
    Paper

  3. Efficient Maximal Motif-Clique Enumeration over Large Heterogeneous Information Networks
    Yingli Zhou, Yixiang Fang, Chenhao Ma, Tianci Hou, Xin Huang
    In Proceedings of the VLDB Endowment (PVLDB), 2024.
    Paper

  4. Scalable Algorithm for Finding Balanced Subgraphs with Tolerance in Signed Networks
    Jingbang Chen, Qiuyang Mang, Hangrui Zhou, Richard Peng, Yu Gao, Chenhao Ma*
    In SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2024.
    Paper

  5. Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL Generation
    Ge Qu, Jinyang Li, Bowen Li, Bowen Qin, Nan Huo, Chenhao Ma, Reynold Cheng
    In Annual Meeting of the Association for Computational Linguistics (ACL), Findings, 2024.
    Paper

  6. FDM: Effective and Efficient Incident Detection on Sparse Trajectory Data
    Xiaolin Han, Tobias Grubenmann, Chenhao Ma*, Xiaodong Li, Wenya Sun, Sze Chun Wong, Xuequn Shang, Reynold Cheng
    In Information Systems, 2024.
    Paper

  7. TRoute: Dynamic Time-dependent Route Recommendation on Road Networks
    Xiaolin Han, Xiurui Hu, Chenhao Ma, Xuequn Shang.
    In 21th International Conference on Web Information Systems and Applications (WISA), 2024.

  8. Efficient and Effective Algorithms for Densest Subgraph Discovery and Maintenance.
    Yichen Xu, Chenhao Ma*, Yixiang Fang, Zhifeng Bao
    In The VLDB Journal, 2024.
    Paper

  9. Distributed Shortest Distance Labeling on Large-Scale Graphs.
    Yuanyuan Zeng, Chenhao Ma*, Yixiang Fang
    In Proceedings of the VLDB Endowment (PVLDB), 17, 2024.
    Paper

  10. Efficient Core Decomposition over Large Heterogeneous Information Networks
    Yucan Guo, Chenhao Ma*, Yixiang Fang
    In IEEE International Conference on Data Engineering (ICDE), 2024.
    Paper

  11. A Counting-based Approach for Efficient 𝑘-Clique Densest Subgraph Discovery
    Yingli Zhou, Qingshuo Guo, Yixiang Fang, Chenhao Ma
    In Proceedings of the 2024 ACM SIGMOD International Conference on Management of Data (SIGMOD), 2024.
    Paper

  12. On Efficient Large Sparse Matrix Chain Multiplication
    Chunxu Lin, Wensheng Luo, Yixiang Fang, Chenhao Ma, Xilin Liu, Yuchi Ma
    In Proceedings of the 2024 ACM SIGMOD International Conference on Management of Data (SIGMOD), 2024.
    Paper

  13. A Similarity-based Approach for Efficient Large Quasi-clique Detection
    Jiayang Pang, Chenhao Ma*, Yixiang Fang
    In ACM TheWebConf 2024 Conference (WWW), 2024.
    Paper

  14. Efficient Distributed Hop-Constrained Path Enumeration on Large-Scale Graphs
    Yuanyuan Zeng, Yixiang Fang, Chenhao Ma*, Xu Zhou, Kenli Li
    In Proceedings of the 2024 ACM SIGMOD International Conference on Management of Data (SIGMOD), 2024.
    Paper

  15. Influential Exemplar Replay for Incremental Learning in Recommender Systems
    Xinni Zhang, Yankai Chen, Chenhao Ma, Yixiang Fang, Irwin King
    In AAAI Conference on Artificial Intelligence (AAAI), 2024.
    Paper

  16. Accelerating Directed Densest Subgraph Queries with Software and Hardware Approaches
    Chenhao Ma, Yixiang Fang, Reynold Cheng, Laks. V.S. Lakshmanan, Xiaolin Han, Xiaodong Li
    In The VLDB Journal (VLDBJ), 33(1): 207-230, 2024.
    Paper

In the Year of 2023
  1. MOSER: Scalable Network Motif Discovery using Serial Test
    Mohammad Matin Najafi, Chenhao Ma, Xiaodong Li, Laks V.S. Lakshmanan, Reynold Cheng
    In Proceedings of the VLDB Endowment (PVLDB), 17, 2023.
    Paper

  2. Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs
    Jinyang Li, Binyuan Hui, GE QU, Binhua Li, Jiaxi Yang, Bowen Li, Bailin Wang, Bowen Qin, Ruiying Geng, Nan Huo, Xuanhe Zhou, Chenhao Ma, Guoliang Li, Kevin Chang, Fei Huang, Reynold Cheng, Yongbin Li
    In NeurIPS, Datasets and Benchmarks Track, Spotlight, 2023.
    Paper | Slides

  3. Efficient and Effective Algorithms for Generalized Densest Subgraph Discovery
    Yichen Xu, Chenhao Ma*, Yixiang Fang, Zhifeng Bao
    In Proceedings of the 2023 ACM SIGMOD International Conference on Management of Data (SIGMOD), 2023.
    Paper

  4. On Querying Connected Components in Large Temporal Graphs.
    Haoxuan Xie, Yixiang Fang, Yuyang Xia, Wensheng Luo, Chenhao Ma
    In Proceedings of the 2023 ACM SIGMOD International Conference on Management of Data (SIGMOD), 2023.
    Paper

  5. Graphix-T5: Mixing Pre-Trained Transformers with Graph-Aware Layers for Text-to-SQL Parsing
    Jinyang Li, Binyuan Hui, Reynold Cheng, Bowen Qin, Chenhao Ma, Nan Huo, Fei Huang, Wenyu Du, Luo Si, Yongbin Li
    In AAAI Conference on Artificial Intelligence (AAAI), 2023.
    Paper

  6. Scalable Algorithms for Densest Subgraph Discovery
    Wensheng Luo, Zhuo Tang, Yixiang Fang, Chenhao Ma, Xu Zhou.
    In IEEE International Conference on Data Engineering (ICDE), 2023.
    Paper

In the Year of 2022
  1. Densest Subgraph Discovery on Large Graphs: Applications, Challenges, and Techniques
    Yixiang Fang, Wensheng Luo, Chenhao Ma.
    In Proceedings of the VLDB Endowment (PVLDB), 15, 2022.
    Paper

  2. Finding Locally Densest Subgraphs: A Convex Programming Approach
    Chenhao Ma, Reynold Cheng, Laks V.S. Lakshmanan, Xiaolin Han
    In Proceedings of the VLDB Endowment (PVLDB), 15, 2022.
    Paper

  3. Effective Community Search over Large Star-Schema Heterogeneous Information Networks
    Yangqin Jiang, Yixiang Fang, Chenhao Ma, Xin Cao, Chunshan Li
    In Proceedings of the VLDB Endowment (PVLDB), 15, 2022.
    Paper

  4. DeepTEA: Effective and Efficient Online Time-dependent Trajectory Outlier Detection
    Xiaolin Han, Reynold Cheng, Chenhao Ma* and Tobias Grubenmann
    In Proceedings of the VLDB Endowment (PVLDB), 15(7): 1493-1505, 2022.
    Paper

  5. A Convex-Programming Approach for Efficient Directed Densest Subgraph Discovery
    Chenhao Ma, Yixiang Fang, Reynold Cheng, Laks V.S. Lakshmanan, Xiaolin Han
    In Proceedings of the 2022 ACM SIGMOD International Conference on Management of Data (SIGMOD), pages 845-859, 2022 .
    Paper

  6. Leveraging Contextual Graphs for Stochastic Weight Completion in Sparse Road Networks
    Xiaolin Han, Reynold Cheng, Tobias Grubenmann, Silviu Maniu, Chenhao Ma*, Xiaodong Li
    In SIAM International Conference on Data Mining (SDM), pages 64-72, 2022.
    Paper

  7. The Social Technology and Research (STAR) Lab in the University of Hong Kong
    Reynold Cheng, Chenhao Ma, Xiaodong Li, Yixiang Fang, Ye Liu, Victor Y.L. Wong, Esther Lee, Tai Hing Lam, Sai Yin Ho, Man Ping Wang, Weijie Gong, Wentao Ning, Ben Kao
    In ACM SIGMOD Record, 51, 2022.

In the Year of 2021
  1. On Directed Densest Subgraph Discovery
    Chenhao Ma, Yixiang Fang, Reynold Cheng, Laks V.S. Lakshmanan, Wenjie Zhang, Xuemin Lin
    In ACM Transactions on Database Systems (TODS), 46(4):1-45, 2021.
    Invited as one of four Best of SIGMOD 2020.
    Paper

  2. Efficient Directed Densest Subgraph Discovery
    Chenhao Ma, Yixiang Fang, Reynold Cheng, Laks V.S. Lakshmanan, Wenjie Zhang, Xuemin Lin
    In SIGMOD Record, 50(1):33-40, 2021, Special Issue on the 2021 ACM SIGMOD Research Highlight Award.
    Paper

  3. On Analyzing Graphs with Motif-Paths
    Xiaodong Li, Reynold Cheng, Kevin Chen Chuan Chang, Caihua Shan, Chenhao Ma, Hongtai Cao
    In Proceedings of the VLDB Endowment (PVLDB), 14(6): 1111-1123, 2021.
    Paper

In the Year of 2020
  1. Efficient Algorithms for Densest Subgraph Discovery on Large Directed Graphs
    Chenhao Ma, Yixiang Fang, Reynold Cheng, Laks V.S. Lakshmanan, Wenjie Zhang, Xuemin Lin
    In Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data (SIGMOD), pages 1051-1066, 2020.
    One of four Best of SIGMOD 2020, rate: 4/458.
    Paper

In the Year of 2019
  1. Motif Paths: A New Approach for Analysing Higher-order Semantics between Graph Nodes
    Xiaodong Li, Tse Nam Chan, Reynold Cheng, Caihua Shan, Chenhao Ma, Kevin Chang
    In HKU Technical Reports, 2019.

  2. LINC: A Motif Counting Algorithm for Uncertain Graphs
    Chenhao Ma, Reynold Cheng, Laks V.S. Lakshmanan, Tobias Grubenmann, Yixiang Fang, Xiaodong Li
    In Proceedings of the VLDB Endowment (PVLDB), 13(2): 155-168, 2019.
    Paper

Professional Services

  • Conference PC Members (or Reviewers):
    VLDB: 2025
    KDD: 2024, 2025
    ICDE: 2025
    The Web Conference (WWW): 2024, 2025
    IEEE ICDE TKDE poster track: 2023
    SIGIR Resource paper track: 2023, 2024
    NeurIPS Dataset & Benchmark track: 2023, 2024
    ACM CIKM: 2022, 2023, 2024 (SPC),
    IEEE Big Data: 2024
    VLDB Phd Workshop: 2022
    VLDB Tutorials: 2023, 2024
    ACM SIGMOD Reproducibility, The Availability Committee: 2022, 2023
    ACM WSDM Demo Track: 2023
    ACM DASFAA Demo Track: 2023, 2024
    APWeb-WAIM: 2024
    International Conference on Data Storage and Data Engineering: 2023

  • Conference organizer:
    Session chair, IEEE International Conference on Data Engineering (ICDE): 2023
    Session chair, International Conference on Very Large Data Bases (VLDB): 2023

  • Reviewers for Journals:
    IEEE Transactions on Knowledge and Data Engineering (TKDE)
    The International Journal on Very Large Data Bases (VLDBJ)
    ACM Transactions on the Web (TWEB)
    Pattern Recognition (PR)
    Knowledge and Information Systems (KAIS)
    Information Sciences
    Information Processing and Management (IPM)
    Expert Systems With Applications (ESWA)
    ACM Transactions Management Information Systems (TMIS)
    IEEE Transactions on Computational Social Systems (TCSS)
    International Journal of Intelligent Systems
    Journal of Graph Theory
    BMC Medical Informatics and Decision Making
    Applied Sciences
    Sustainability
    Big Data and Cognitive Computing

Selected Honors and Awards

  • 2021 ACM SIGMOD Research Highlight Award, Jun 2021

  • Hong Kong and China Gas Company Limited Postgradaute Scholarship, 2019-2020

  • One of four Best of SIGMOD 2020

  • Reaching Out Award, Hong Kong SAR Government, 2019

  • HKU Postgraduate Scholarship, 2017-2021

  • Gold Medal, ACM-ICPC Asia Regional Contest, Changchun, Oct 2015

  • National Scholarship, Nov 2014, Nov 2015

Working and Education Experience

  • 2022.8-now: Assistant Professor, The Chinese University of Hong Kong, Shenzhen, China

  • 2021.9-2022.8: Postdoctoral Fellow, The University of Hong Kong, Hong Kong SAR (China)

  • 2017.9-2021.8: Ph.D., The University of Hong Kong, Hong Kong SAR (China)

  • 2013.9-2017.7: B.Eng., Shandong University, China

Teaching Experience

  • CSC3170: Database System, Instructor

    • 2024 Fall

  • CSC2003: Introduction to Java Programming, Instructor

    • 2024 Spring

  • CSC4008: Techniques for Data Mining, Instructor

    • 2023 Spring, 2023 Fall

  • CSC1003: Introduction to Computer Science and Java Programming, Instructor

    • 2022 Fall

  • The Age of Big Data, 2018 Spring, Teaching Assistant

  • Introduction to Database Management Systems, 2018 Fall, Teaching Assistant

  • Advanced Database Systems, 2019 Fall, Teaching Assistant

  • Big Data Management, 2022 Spring, Teaching Assistant