Chao Zhang
Table of Contents
Quick Links
![]() |
Assistant Professor School of Computational Science and Engineering College of Computing Georgia Institute of Technology Office: CODA 1309 Address: 756 W Peachtree St NW, Atlanta, GA 30308 Email: chaozhang@gatech.edu |
Research
My research aims to design machine learning and data-driven models to solve practical and challenging problems in science and engineering. I am particularly interested in the following topics:
- Learning from limited supervision: weak supervision, self supervision, unsupervised learning
- Uncertainty in ML: uncertainty quantification, active learning, decision making under uncertainty
- Spatiotemporal modeling: spatial graphs, time series, dynamics, 3D molecular data
- Generation and inverse design: learning for optimization, generative models, network design
- Knowledge extraction and NLP: information extraction, multi-modal extraction, LM pre-training & fine-tuning
On the data side, I work a lot with spatiotemporal data, text data, and graph data. On the application side, I am passionate about interdisciplinary research and enjoy developing data-driven solutions to accelerate scientific discovery through close collaboration with domain experts. The techniques I develop are motivated by applications in material science, biomedical science, transportation, and public health.
Acknowledgment: My work has been generously supported by research funding/gift from NSF (IIS CAREER-2144338, IIS-2106961, IIS-2008334), ONR MURI , Kolon, HomeDepot, and Adobe. My work has also been recognized by an NSF CAREER Award, a Facebook Faculty Award, an Amazon AWS Machine Learning Research Award, a Google Faculty Research Award, a Kolon Faculty Fellowship, an ACM SIGKDD Dissertation Runner-up Award, and several paper awards from IMWUT (UbiComp), ECML/PKDD, and ML4H.
Projects
Below are the main research projects at my group and some recent representative works:
- Learning with limited supervision: The lack of training data is a major bottleneck for training deep learning models. Our recent works approach this challenge by:
- Fine-tuning pre-trained models with weak supervision:
- Interactive weakly-supervised learning to close the gap between weak & full supervision:
- Uncertainty quantification & exploitation: Uncertainty-aware ML models are key elements to trustworthy AI systems. Unfortunately, many deep learning models produce uncertainty-agnostic point estimates or miscalibrated distributions. We develop techniques to quantify uncertainty for deep learning and exploit uncertainty for downstream decision making:
- Uncertainty quantification:
- When in Doubt: Neural Non-Parametric Uncertainty Quantification for Epidemic Forecasting, NeurIPS 2021
- SDE-Net: Equipping Deep Neural Networks with Uncertainty Estimates, ICML 2020
- Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data, EMNLP 2020
- RDeepSense: Reliable Deep Mobile Computing Models with Uncertainty Estimations, UbiComp 2018
- Uncertainty exploitation:
- End-to-end Stochastic Programming with Energy-based Model, NeurIPS 2022
- Uncertainty quantification:
- Spatiotemporal dynamics modeling: Space and time are two most important dimensions in many science and engineering applications. We develop deep learning models that can model and predict the dynamics of spatial systems, at both macro-level (e.g., transportation, epidemiology) and micro-level (e.g., molecular dynamics).
- Spatiotemporal graphs:
- Time series modeling:
- Generation and inverse design: We study how to guide deep generative models to design samples that satisfy pre-determined properties.
- End-to-end Stochastic Programming with Energy-based Model, NeurIPS 2022
- Transformer-Based Neural Text Generation with Syntactic Guidance, preprint
- A Gradual, Semi-Discrete Approach to Generative Network Training via Explicit Wasserstein Minimization, ICML 2019
- Knowledge extraction and NLP: We design ML models that can automatically extract information about entities, relations, taxonomies, and events from massive unstructured data (e.g., scientific literature). We focus on tackling the challenges of low-resource information extraction and multi-modal information extraction.
- Sparse Conditional Hidden Markov Model for Weakly Supervised Named Entity Recognition, KDD 2022
- BERTifying Hidden Markov Models for Multi-Source Weakly Supervised Named Entity Recognition, ACL 2021
- BOND: Bert-Assisted Open-Domain Named Entity Recognition with Distant Supervision, KDD 2020
- STEAM: Self-Supervised Taxonomy Expansion with Mini-Paths, KDD 2020
- TaxoGen: Unsupervised Topic Taxonomy Construction by Adaptive Term Embedding and Clustering, KDD 2018
Publications
(* denotes equal contribution)
2022
- End-to-end Stochastic Optimization with Energy-based Model
Lingkai Kong, Jiaming Cui, Yuchen Zhuang, Rui Feng, B. Aditya Prakash, Chao Zhang
Annual Conference on Neural Information Processing Systems (NeurIPS), 2022
(Selected as Oral) - UnfoldML: Cost-Aware and Uncertainty-Based Dynamic 2D Prediction for Multi-Stage Classification
Yanbo Xu, Alind Khare, Glenn Matlin, Monish Ramadoss, Rishikesan Kamaleswaran, Chao Zhang, Alexey Tumanov
Annual Conference on Neural Information Processing Systems (NeurIPS), 2022 - Shift-Robust Node Classification via Graph Clustering Co-training
Qi Zhu, Chao Zhang, Chanyoung Park, Carl Yang, Jiawei Han
NeurIPS GLFrontiers Workshop, 2022 - Sparse Conditional Hidden Markov Model for Weakly Supervised Named Entity Recognition
Yinghao Li, Le Song, Chao Zhang
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2022 - Adaptive Multi-view Rule Discovery for Weakly-Supervised Compatible Products Prediction
Rongzhi Zhang, Rebecca West, Xiquan Cui, Chao Zhang
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2022 - CAMUL: Calibrated and Accurate Multi-view Time-Series Forecasting
Harshavardhan Kamarthi, Lingkai Kong, Alexander Rodríguez, Chao Zhang and B. Aditya Prakash
The Web Conference (WWW), 2022 - Precise Clinical Predictions via Counterfactual and Factual Reasoning over Hypergraphs of Electronic Health Records
Ran Xu, Yue Yu, Chao Zhang, Mohammed K Ali, Joyce Ho, Carl Yang
Machine Learning for Health (ML4H), 2022
(Outstanding Paper Award) - Rule-Enhanced Active Learning for Semi-Automated Weak Supervision
David Kartchner, Davi Nakajima An, Wendi Ren, Chao Zhang, Cassie S. Mitchell
AI 3(1), 211-228, 2022 - PRBoost: Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning
Rongzhi Zhang, Yue Yu, Shetty Pranav, Le Song and Chao Zhang
Annual Meeting of the Association for Computational Linguistics (ACL), 2022. - ReSel: N-ary Relation Extraction from Scientific Text and Tables by Learning to Retrieve and Select
Yuchen Zhuang, Yinghao Li, Junyang Zhang, Yue Yu, Yingjun Mou, Xiang Chen, Le Song and Chao Zhang
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022 - COCO-DR: Combating the Distribution Shift in Zero-Shot Dense Retrieval with Contrastive and Distributional Robust Learning
Yue Yu, Chenyan Xiong, Si Sun, Chao Zhang and Arnold Overwijk
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022 - CERES: Pretraining of Graph-Conditioned Transformer for Semi-Structured Session Data
Rui Feng, Chen Luo, Qingyu Yin, Bing Yin, Tuo Zhao, Chao Zhang
Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2022 - AcTune: Uncertainty-Aware Active Self-Training for Active Fine-Tuning of Pretrained Language Models
Yue Yu, Lingkai Kong, Jieyu Zhang, Rongzhi Zhang, Chao Zhang
Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2022 - Self-Training with Differentiable Teacher
Simiao Zuo, Yue Yu, Chen Liang, Haoming Jiang, Siawpeng Er, Chao Zhang, Tuo Zhao, Hongyuan Zha
Findings of Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-Findings), 2022
2021
- When in Doubt: Neural Non-Parametric Uncertainty Quantification for Epidemic Forecasting
Harshavardhan Kamarthi, Lingkai Kong, Alexander Rodríguez, Chao Zhang, B. Aditya Prakash
Annual Conference on Neural Information Processing Systems (NeurIPS), 2021 - Transfer Learning of Graph Neural Networks with Ego-graph Information Maximization
Qi Zhu, Carl Yang, Yidan Xu, Haonan Wang, Chao Zhang, and Jiawei Han
Annual Conference on Neural Information Processing Systems (NeurIPS), 2021 - BERTifying Hidden Markov Models for Multi-Source Weakly Supervised Named Entity Recognition
Yinghao Li, Pranav Shetty, Lucas Liu, Chao Zhang, Le Song
Annual Meeting of the Association for Computational Linguistics (ACL), 2021 - Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach
Yue Yu*, Simiao Zuo*, Haoming Jiang, Wendi Ren, Tuo Zhao, Chao Zhang
Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2021 - Learning from Language: Low-shot Named Entity Recognition via Decomposed Framework
Yaqing Wang, Haoda Chu, Chao Zhang, Jing Gao
Findings of Conference on Empirical Methods in Natural Language Processing (EMNLP-Findings), 2021 - Semantics-Aware Hidden Markov Model for Human Mobility
Hongzhi Shi, Yong Li, Hancheng Cao, Xiangxin Zhou, Chao Zhang, Vassilis Kostakos
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2021. - Supervised Machine Learning-based Wind Prediction to Enable Real-Time Flight Path Planning
Jung-Hyun Kim, Chao Zhang, Simon I. Briceno and Dimitri N. Mavris
AIAA Scitech Forum, 2021 - SumGNN: Multi-typed Drug Interaction Prediction via Efficient Knowledge Graph Summarization
Yue Yu*, Kexin Huang*, Chao Zhang, Lucas M. Glass, Jimeng Sun, Cao Xiao
Bioinformatics, 2021
2020
- T-GCN: A Temporal Graph Convolutional Network for Traffic Prediction
Ling Zhao, Yujiao Song, Chao Zhang, Yu Liu, Pu Wang, Tao Lin, Min Deng, Haifeng Li.
IEEE Transactions on Intelligent Transportation Systems (T-ITS), 21(9), 3848–3858, 2020 - A Linear Time Approach to Computing Time Series Similarity based on Deep Metric Learning
Di Yao, Gao Cong, Chao Zhang, Xuying Meng, Rongchang Duan, Jingping Bi
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2020 - SDE-Net: Equipping Deep Neural Networks with Uncertainty Estimates
Lingkai Kong, Jimeng Sun, Chao Zhang.
International Conference on Machine Learning (ICML), 2020 - STEAM: Self-Supervised Taxonomy Expansion with Mini-Paths
Yue Yu, Yinghao Li, Jiaming Shen, Hao Feng, Jimeng Sun and Chao Zhang.
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2020 - BOND: Bert-Assisted Open-Domain Named Entity Recognition with Distant Supervision
Chen Liang*, Yue Yu*, Haoming Jiang, Siawpeng Er, Ruijia Wang, Tuo Zhao and Chao Zhang
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2020 - LogPar: Logistic PARAFAC2 Factorization for Temporal Binary Data with Missing Values
Kejing Yin, Ardavan Afshar, Joyce Ho, William Cheung, Chao Zhang and Jimeng Sun
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2020 - Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding
Yu Meng, Yunyi Zhang, Jiaxin Huang, Yu Zhang, Chao Zhang and Jiawei Han
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2020 - paper2repo: GitHub Repository Recommendation for Academic Papers
Huajie Shao, Dachun Sun, Jiahao Wu, Zecheng Zhang, Aston Zhang, Shuochao Yao, Shengzhong Liu, Tianshi Wang, Chao Zhang and Tarek Abdelzaher.
The Web Conference (WWW), 2020 - Discriminative Topic Mining via Category-Name Guided Text Embedding
Yu Meng, Jiaxin Huang, Guangyuan Wang, Zihan Wang, Chao Zhang, Yu Zhang and Jiawei Han.
The Web Conference (WWW), 2020 - ReGAL: Rule-Generative Active Learning for Model-in-the-Loop Weak Supervision
David Kartchner, Wendi Ren, Davi Nakajima An, Chao Zhang, Cassie Mitchell.
NeurIPS 2020 HAMLETS workshop on Human and Model in the Loop Evaluation and Training Strategies - Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data
Lingkai Kong, Haoming Jiang, Yuchen Zhuang, Jie Lyu, Tuo Zhao and Chao Zhang.
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020 - Text Classification Using Label Names Only: A Language Model Self-Training Approach
Yu Meng, Yunyi Zhang, Jiaxin Huang, Chenyan Xiong, Heng Ji, Chao Zhang, Jiawei Han.
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020 - SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup
Rongzhi Zhang, Yue Yu and Chao Zhang.
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020 - Denoising Multi-Source Weak Supervision for Neural Text Classification
Wendi Ren, Yinghao Li, Hanting Su, David Kartchner, Cassie Mitchell, and Chao Zhang.
Findings of Conference on Empirical Methods in Natural Language Processing (EMNLP-Findings), 2020 - Joint Aspect-Sentiment Analysis with Minimal User Guidance
Honglei Zhuang, Fang Guo, Chao Zhang, Liyuan Liu and Jiawei Han.
ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2020.
2019
- Multidimensional Mining of Massive Text Data
Chao Zhang, Jiawei Han.
Morgan & Claypool Publishers, 2019 - Spherical Text Embedding
Yu Meng, Jiaxin Huang, Guangyuan Wang, Chao Zhang, Honglei Zhuang, Lance Kaplan, Jiawei Han.
Annual Conference on Neural Information Processing Systems (NeurIPS), 2019 - State-Sharing Sparse Hidden Markov Models for Personalized Sequences
Hongzhi Shi, Chao Zhang, Mingquan Yao, Yong Li, Funing Sun, Depeng Jin.
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2019 - TopicMine: User-Guided Topic Mining by Category-Oriented Embedding
Yu Meng, Jiaxin Huang, Zihan Wang, Chenyu Fan, Guangyuan Wang, Chao Zhang, Jingbo Shang, Lance Kaplan, Jiawei Han.
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2019
(Demo) - CubeNet: Multi-Facet Hierarchical Heterogeneous Network Construction, Analysis, and Mining
Carl Yang, Dai Teng, Siyang Liu, Sayantani Basu, Jieyu Zhang, Jiaming Shen, Chao Zhang, Jingbo Shang, Lance Kaplan, Timothy Harratty, and Jiawei Han.
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2019
(Demo) - A Gradual, Semi-Discrete Approach to Generative Network Training via Explicit Wasserstein Minimization
Yucheng Chen, Matus Telgarsky, Chao Zhang, Bolton Bailey, Daniel Hsu, Jian Peng.
International Conference on Machine Learning (ICML), 2019 - Weakly-Supervised Hierarchical Text Classification
Yu Meng, Jiaming Shen, Chao Zhang, Jiawei Han.
AAAI Conference on Artificial Intelligence (AAAI), 2019 - Computing Trajectory Similarity in Linear Time: A Generic Seed-Guided Neural Metric Learning Approach
Di Yao, Gao Cong, Chao Zhang, Jingping Bi.
IEEE International Conference on Data Engineering (ICDE), 2019 - DPLink: User Identity Linkage via Deep Neural Network From Heterogeneous Mobility Data
Jie Feng, Mingyang Zhang, Huandong Wang, Zeyu Yang, Chao Zhang, Yong Li, Depeng Jin.
The Web Conference (WWW), 2019 - GeoAttn: Localization of Social Media Messages Via Attentional Memory Network
Sha Li, Chao Zhang, Dongming Lei, Ji Li, Jiawei Han.
SIAM International Conference on Data Mining (SDM), 2019 - Semantics-Aware Hidden Markov Model for Human Mobility
Hongzhi Shi, Hancheng Cao, Xiangxin Zhou, Yong Li, Chao Zhang, Vassilis Kostakos, Funing Sun, Fanchao Meng.
SIAM International Conference on Data Mining (SDM), 2019
2018
- Multi-Dimensional Mining of Unstructured Data with Limited Supervision
Chao Zhang
Ph.D. Thesis
(ACM SIGKDD 2019 Dissertation Runner-up Award) - TaxoGen: Unsupervised Topic Taxonomy Construction by Adaptive Term Embedding and Clustering
Chao Zhang, Fangbo Tao, Xiusi Chen, Jiaming Shen, Meng Jiang, Brian Sadler, Michelle Vanni, Jiawei Han.
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2018
(Code) (Data) - HiExpan: Task-Guided Taxonomy Construction by Hierarchical Tree Expansion
Jiaming Shen, Zeqiu Wu, Dongming Lei, Chao Zhang, Xiang Ren, Michelle T. Vanni, Brian M. Sadler, Jiawei Han.
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2018 - Easing Embedding Learning by Comprehensive Transcription of Heterogeneous Information Networks
Yu Shi, Qi Zhu, Fang Guo, Chao Zhang, Jiawei Han.
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2018 - Towards Multidimensional Analysis of Text Corpora
Jingbo Shang, Chao Zhang, Jiaming Shen, Jiawei Han.
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2018
(Tutorial) - DeepMove: Predicting Human Mobility with Attentional Recurrent Networks
Jie Feng, Yong Li, Chao Zhang, Funing Sun, Fanchao Meng, Ang Guo, Depeng Jin.
The International World Wide Web Conference (WWW), 2018
(Code & Data) - Weakly-Supervised Neural Text Classification
Yu Meng, Jiaming Shen, Chao Zhang, Jiawei Han.
ACM International Conference on Information and Knowledge Management (CIKM), 2018
(Code) - Open-Schema Event Profiling for Massive News Corpora
Quan Yuan, Xiang Ren, Wenqi He, Chao Zhang, Xinhe Geng, Lifu Huang, Heng Ji, Chin-Yew Lin, Jiawei Han.
ACM International Conference on Information and Knowledge Management (CIKM), 2018 - Spatiotemporal Activity Modeling Under Data Scarcity: A Graph-Regularized Cross-Modal Embedding Approach
Chao Zhang, Mengxiong Liu, Zhengchao Liu, Carl Yang, Luming Zhang, and Jiawei Han.
AAAI Conference on Artificial Intelligence (AAAI), 2018 - A Spherical Hidden Markov Model for Semantics-Rich Human Mobility Modeling
Wanzheng Zhu +, Chao Zhang +, Shuochao Yao, Xiaobin Gao, and Jiawei Han.
AAAI Conference on Artificial Intelligence (AAAI), 2018 - Doc2Cube: Allocating Documents to Text Cube without Labeled Data
Fangbo Tao +, Chao Zhang +, Xiusi Chen, Meng Jiang, Tim Hanratty, Lance Kaplan, Jiawei Han.
IEEE International Conference on Data Mining (ICDM), 2018
(Code) - RDeepSense: Reliable Deep Mobile Computing Models with Uncertainty Estimations
Shuochao Yao, Yiran Zhao, Huajie Shao, Aston Zhang, Chao Zhang, Shen Li, and Tarek Abdelzaher.
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), 2018 - SenseGAN: Enabling Deep Learning for Internet of Things with a Semi-Supervised Framework
Shuochao Yao, Yiran Zhao, Huajie Shao, Chao Zhang, Aston Zhang, Shaohan Hu, Dongxin Liu, Shengzhong Liu, and Tarek Abdelzaher.
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), 2018
(Distinguished Paper Award) - Deep Learning for the Internet of Things
Shuochao Yao, Yiran Zhao, Aston Zhang, Huajie Shao, Chao Zhang, Lu Su, Tarek Abdelzaher.
IEEE Computer, 2018 - GeoBurst+: Effective and Real-Time Local Event Detection in Geo-Tagged Tweet Streams
Chao Zhang, Dongming Lei, Quan Yuan, Honglei Zhuang, Lance Kaplan, Shaowen Wang, Jiawei Han.
ACM Transactions on Intelligent Systems and Technology (TIST), 2018 - Leveraging the Power of Informative Users for Local Event Detection
Hengtong Zhang, Fenglong Ma, Yaliang Li, Chao Zhang, Tianqi Wang, Yaqing Wang, Jing Gao, Lu Su.
IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 2018 - Learning deep representation for trajectory clustering
Di Yao, Chao Zhang, Zhihua Zhu, Qin Hu,heng Wang, Jianhui Huang, Jingping Bi.
Expert Systems, 2018. - Did You Enjoy the Ride: Understanding Passenger Experience via Heterogeneous Network Embedding
Carl Yang, Chao Zhang, Jiawei Han, Xuewen Chen, and Jieping Ye.
IEEE International Conference on Data Engineering (ICDE), 2018 - ApDeepSense: Deep Learning Uncertainty Estimation without the Pain for IoT Applications
Shuochao Yao, Yiran Zhao, Huajie Shao, Chao Zhang, Aston Zhang, Dongxin Liu, Shengzhong Liu, Lu Su, Tarek Abdelzaher.
IEEE International Conference on Distributed Computing Systems (ICDCS), 2018 - A Constrained Maximum Likelihood Estimator for Unguided Social Sensing
Huajie Shao, Shuochao Yao, Yiran Zhao, Chao Zhang, Jinda Han, Lance Kaplan, Su Lu, and Tarek Abdelzaher.
IEEE International Conference on Computer Communications (InfoCom), 2018 - Towards Personalized Activity Level Prediction in Community Question Answering Websites
Zhenguang Liu, Yingjie Xia, Qi Liu, Qinming He, Yanxiang Chen, Chao Zhang, and Roger Zimmermann.
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), 2018
2017
- TrioVecEvent: Embedding-Based Online Local Event Detection in Geo-Tagged Tweet Streams
Chao Zhang, Liyuan Liu, Dongming Lei, Quan Yuan, Honglei Zhuang, Tim Hanratty, and Jiawei Han.
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2017
(Slides) (Code) (Video) (Featured by Illinois Innovator) - Bridging Collaborative Filtering and Semi-Supervised Learning: A Neural Approach for POI Recommendation
Carl Yang, Lanxiao Bai, Chao Zhang, Quan Yuan and Jiawei Han.
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2017.
(Code & Data) - ReAct: Online Multimodal Embedding for Recency-Aware Spatiotemporal Activity Modeling
Chao Zhang, Keyang Zhang, Quan Yuan, Fangbo Tao, Luming Zhang, Tim Hanratty, and Jiawei Han.
ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2017
(Slides) (Code) (Data) - Regions, Periods, Activities: Uncovering Urban Dynamics via Cross-Modal Representation Learning
Chao Zhang, Keyang Zhang, Quan Yuan, Haoruo Peng, Yu Zheng, Tim Hanratty, Shaowen Wang, and Jiawei Han.
International World Wide Web Conference (WWW), 2017
(Slides) (Code & Data) - Bringing Semantics to Spatiotemporal Data Mining: Challenges, Methods, and Applications
Chao Zhang, Quan Yuan, and Jiawei Han.
IEEE International Conference on Data Engineering (ICDE), 2017
(Tutorial) - PRED: Periodic Region Detection for Mobility Modeling of Social Media Users
Quan Yuan, Wei Zhang, Chao Zhang, Xinhe Geng, Gao Cong, and Jiawei Han.
ACM International Conference on Web Search and Data Mining (WSDM), 2017
(Code & Data) - Towards Space and Time Coupled Social Media Analysis
Chao Zhang, Quan Yuan, Shi Zhi, Sha Li, and Jiawei Han.
2017 ACM International Conference on Information and Knowledge Management (CIKM), 2017
(Tutorial) - Detecting Multiple Periods and Periodic Patterns in Event Time Sequences
Quan Yuan, Jingbo Shang, Xin Cao, Chao Zhang, Xinhe Geng, Jiawei Han.
ACM International Conference on Information and Knowledge Management (CIKM), 2017 - SERM: A Recurrent Model for Next Location Prediction in Semantic Trajectories
Di Yao, Chao Zhang, Jianhui Huang, and Jingping Bi
ACM International Conference on Information and Knowledge Management (CIKM), 2017
(Code & Data) - Urbanity: A System for Interactive Exploration of Urban Dynamics from Streaming Human Sensing Data
Mengxiong Liu, Zhengchao Liu, Chao Zhang, Keyang Zhang, Quan Yuan, Tim Hanratty, and Jiawei Han
ACM International Conference on Information and Knowledge Management (CIKM), 2017
(Demo) - ClaimVerif: A Real-time Claim Verification System Using the Web and Fact Databases
Shi Zhi, Yicheng Sun, Jiayi Liu, Chao Zhang, and Jiawei Han.
ACM International Conference on Information and Knowledge Management (CIKM), 2017 - Trajectory Clustering via Deep Representation Learning
Di Yao, Chao Zhang, Zhihua Zhu, Jianhui Huang, and Jingping Bi.
International Joint Conference on Neural Networks (IJCNN), 2017
(Code) - pg-Causality: Identifying Spatiotemporal Causal Pathways for Air Pollutants with Urban Big Data
Julie Yixuan Zhu +, Chao Zhang +, Huichu Zhang, Shi Zhi, Victor O.K. Li, Jiawei Han, and Yu Zheng.
IEEE Transactions on Big Data (TBD), 2017 - Geographical Data Mining
Chao Zhang and Jiawei Han.
The International Encyclopedia of Geography: People, the Earth, Environment and Technology, 2017 - A Survey on Spatiotemporal and Semantic Data Mining
Quan Yuan, Chao Zhang, Jiawei Han.
Trends in Spatial Analysis and Modelling, Springer, 2017
Earlier
- GMove: Group-Level Mobility Modeling Using Geo-Tagged Social Media
Chao Zhang, Keyang Zhang, Quan Yuan, Luming Zhang, Tim Hanratty, and Jiawei Han.
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2016
(Slides) (Code & Data) - GeoBurst: Real-Time Local Event Detection in Geo-Tagged Tweet Streams
Chao Zhang, Guangyu Zhou, Quan Yuan, Honglei Zhuang, Yu Zheng, Lance Kaplan, Shaowen Wang, Jiawei Han.
ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2016
(Slides) (Code & Data) - Mining Contiguous Sequential Generators in Biological Sequences
Jingsong Zhang, Yinglin Wang, Chao Zhang, and Yongyong Shi
Transactions on Computational Biology and Bioinformatics (TCBB), 13(5): 855–867, 2016 - Assembler: Efficient Discovery of Spatial Co-evolving Patterns in Massive Geo-sensory Data
Chao Zhang, Yu Zheng, Xiuli Ma, Jiawei Han.
ACM SIGKDD Conference on Knowledge Discovery and Pattern Mining (KDD), 2015 - Fast Inbound Top-K Query for Random Walk with Restart
Chao Zhang, Shan Jiang, Yucheng Chen, Yidan Sun, Jiawei Han.
European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), 2015
(Best Student Paper Runner-up Award) - StreamCube: Hierarchical Spatio-temporal Hashtag Clustering for Event Exploration over the Twitter Stream
Wei Feng, Chao Zhang, Wei Zhang, Jiawei Han, Jianyong Wang, Charu Aggarwal, Jianbin Huang.
IEEE International Conference on Data Engineering (ICDE), 2015 - Splitter: Mining Fine-Grained Sequential Patterns in Semantic Trajectories
Chao Zhang, Jiawei Han, Lidan Shou, Jiajun Lu, Thomas La Porta.
International Conference on Very Large Data Bases (VLDB), 2014
(Slides) (Code & Data) - Trendspedia: An Internet Observatory for Analyzing and Visualizing the Evolving Web
Wei Kang, Anthony K. H. Tung, Wei Chen, Xinyu Li, Qiyue Song, Chao Zhang, Feng Zhao, Xiajuan Zhou.
IEEE International Conference on Data Engineering (ICDE), 2014 - Supporting Pattern-Preserving Anonymization for Time-Series Data
Lidan Shou, Xuan Shang, Ke Chen, Gang Chen, Chao Zhang.
IEEE Transactions on Knowledge and Data Engineering (TKDE), 25(4): 877-892, 2013 - Evaluating Geo-Social Influence in Location-Based Social Networks
Chao Zhang, Lidan Shou, Ke Chen, Gang Chen, Yijun Bei.
ACM International Conference on Information and Knowledge Management (CIKM), 2012 - See-To-Retrieve: Efficient Processing of Spatio-Visual Keyword Queries
Chao Zhang, Lidan Shou, Ke Chen, Gang Chen.
ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2012 - What-You-Retrieve-Is-What-You-See: A Preliminary Cyber-Physical Search Engine
Lidan Shou, Ke Chen, Gang Chen, Chao Zhang, Yi Ma, Xian Zhang.
ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2011
Awards
- 2022 ML4H Outstanding Paper Award
- 2022 NSF Career Award
- 2021 Facebook Faculty Research Award
- 2021 Kolon Faculty Fellowship
- 2020 Amazon AWS Machine Learning Research Award
- 2020 Google Faculty Research Award
- 2019 ACM SIGKDD Dissertation Award Runner-up
- 2018 ACM IMWUT Distinguished Paper Award
- 2015 ECML/PKDD Best Student Paper Runner-up Award
- 2013 Chiang Chen Overseas Graduate Fellowship
Software
- SDE-Net: Efficient uncertainty estimation for deep neural networks
- CHMM: BERT-conditional hidden Markov model for multi-source weakly-supervised learning
- COSINE: Language model fine-tuning with weak supervision
- BOND: Distantly-supervised named entity recognition
- STEAM: Automatic taxonomy expansion
- TaxoGen: Unsupervised topic taxonomy construction from text corpus
- WestClass: Weakly-supervised text classification
- GeoBurst: Unsupervised spatiotemporal event detection
Teaching
- 2023 Spring: CX4240: Introduction to Computational Data Analysis
- 2022 Spring: CX4240: Introduction to Computational Data Analysis
- 2021 Fall: CSE8803-DLT: Deep Learning for Text Data
- 2021 Spring: CX4240: Introduction to Computational Data Analysis
- 2020 Fall: CSE8803-DLT: Deep Learning for Text Data
- 2020 Spring: CX4240: Introduction to Computational Data Analysis
- 2019 Fall: CSE8803-DLT: Deep Learning for Text Data
- 2019 Spring: CX4240: Introduction to Computational Data Analysis
Students
Prospective students: I am always looking for strong and motivated students to join our group. If you are interested in working with me, you can either email me or fill out this form.
Current:
- Rui Feng: Ph.D. Student in CS
- Lingkai Kong: Ph.D. Student in CSE
- Yinghao Li: Ph.D. Student in ML
- Haorui Wang: Ph.D. Student in CSE
- Kuan Wang: Ph.D. Student in CSE
- Yue Yu: Ph.D. Student in CSE
- Rongzhi Zhang: Ph.D. Student in ML
- Yuchen Zhuang: Ph.D. Student in ML
- Binghong Chen: Ph.D. Student in CSE (co-advised with Prof. Le Song)
- Pranav Shetty: Ph.D. Student in ML (JP Morgan AI Ph.D. Fellowship, co-advised with Prof. Rampi Ramprasad)
- Yanbo Xu: Ph.D. Student in ML (co-advised with Prof. Alexey Tumanov)
- Vidit Jain: M.S. Student in CS
- Mukund Rungta: M.S. Student in CS
- Junyang Zhang: B.S. Student in CS
Alumni:
- Piyush Patil: M.S. Student in CS
- Mengyang Liu: M.S. Student in CSE
- Isaac Rehg: M.S. in CS
- Wendi Ren: M.S. in CSE
- Ruijia Wang: M.S. in CSE
- Yi Rong: Visiting Ph.D. Student