YaGuang Li (李亚光)
Biography
YaGuang Li is a Principal Research Scientist at Google DeepMind, working on Gemini DeepThink and the Gemini reasoning stack, with a focus on post-training and efficient reinforcement learning (RL) scaling.
He co-led the finetuning and post-training efforts of Gemini 1.5 and Gemini 1.0 for Gemini Advanced.
He also contributed to LaMDA and PaLM-2 across pretraining, instruction tuning, and efficient serving.
He earned his Ph.D. in Computer Science from the
University of Southern California, advised by Prof.
Cyrus Shahabi and Prof. Yan Liu. His research spans large language models and deep learning on graphs, with applications in spatiotemporal forecasting and relational inference.
Experience
-
Principal Research Scientist
Apr 2026 - Present
Gemini DeepThink and RL scaling.
Senior Staff Research Scientist
May 2024 - Apr 2026
RL scaling for advanced reasoning and competition-level problem solving.
Staff Research Engineer
May 2023 - Apr 2024
Co-led Gemini 1.0 and Gemini 1.5 fine-tuning.
-
Senior Research Engineer, Google Brain
Apr 2021 - Apr 2023
Led serving and efficiency work for LaMDA and Bard; contributed to LaMDA and PaLM 2 pretraining.
Research Engineer
Aug 2019 - Mar 2021
AutoML for time-series forecasting.
Selected Publications
Technical Reports and Preprints
-
Towards Autonomous Mathematics Research
Tony Feng,
Trieu H. Trinh,
Garrett Bingham, Dawsen Hwang, Yuri Chervonyi, Junehyuk Jung, Joonkyung Lee, Carlo Pagano,
Sang-hyun Kim, Federico Pasqualotto, Sergei Gukov,
Jonathan N. Lee, Junsu Kim, Kaiying Hou,
Golnaz Ghiasi, Yi Tay,
YaGuang Li, Chenkai Kuang, Yuan Liu, Hanzhao Lin, Evan Zheran Liu, Nigamaa Nayakanti, Xiaomeng Yang,
Heng-Tze Cheng,
Demis Hassabis, Koray Kavukcuoglu,
Quoc V. Le, Thang Luong
arXiv Preprint, 2026
-
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities
Gemini Team, Google
arXiv Preprint, 2025
-
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Gemini Team, Google, YaGuang Li (Lead, Finetuning - Text)
Preprint, 2024
-
Gemini: A Family of Highly Capable Multimodal Models
Gemini Team, Google, YaGuang Li (Co-Lead, Gemini App)
Preprint, 2023
-
PaLM-2 Technical Report
PaLM 2 Team, Google
arXiv Preprint, 2023
-
LaMDA: Language models for dialog applications
LaMDA Team, Google
arXiv Preprint, 2022
Refereed Papers
-
Controlled Decoding from Language Models
Sidharth Mudgal, Jong Lee, Harish Ganapathy, YaGuang Li, Tao Wang, Yanping Huang, Zhifeng Chen,
Heng-Tze Cheng,
Michael Collins,
Trevor Strohman, Jilin Chen, Alex Beutel, Ahmad Beirami
International Conference on Machine Learning, (ICML) 2024
-
HyperPrompt: Prompt-based Task-Conditioning of Transformers
Yun He, Steven Zheng, Yi Tay, Jai Gupta,
Yu Du, Vamsi Aribandi, Zhe Zhao, YaGuang Li,
Zhao Chen, Donald Metzler, Heng-Tze Cheng,
Ed H Chi
International Conference on Machine Learning, (ICML) 2022
-
Predicting Origin-Destination Flow via Multi-Perspective Graph Convolutional Network
Hongzhi Shi, Quanming Yao, Qi Guo,
YaGuang Li, Lingyu Zhang, Jieping Ye,
Yong Li,
Yan Liu
IEEE International Conference on Data Engineering, (ICDE), 2020 (Research Track, Short Paper)
-
DETECT: Deep Trajectory Clustering for Mobility-Behavior Analysis
Mingxuan Yue,
YaGuang Li, Haoze Yang, Ritesh Ahuja,
Yao-yi Chiang, and
Cyrus Shahabi
IEEE International Conference on Big Data, (IEEE Bigdata), 2019
-
CoSTCo: A Neural Tensor Completion Model for Sparse Tensors
Hanpeng Liu, YaGuang Li,
Michael Tsang,
Yan Liu
ACM SIGKDD Conference on Knowledge Discovery and Data Mining, (KDD), 2019 (Research Track, Oral Presentation)
[
Paper] [
Video] [
Code]
-
Spatiotemporal
Multi-Graph Convolution Network for Ride-hailing Demand Forecasting
YaGuang Li*, Xu Geng*, Leye Wang, Lingyu Zhang,
Qiang Yang, Jieping Ye and
Yan Liu (*Equal Contribution)
The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI), 2019 (Oral Presentation)
[
Paper] [
Poster] [
Slides]
-
Multi-task Representation Learning for
Travel Time Estimation
YaGuang Li, Kun Fu, Zheng Wang, Cyrus
Shahabi, Jieping Ye and Yan Liu
ACM SIGKDD Conference on Knowledge Discovery and Data Mining, (KDD), 2018 (Research Track)
[
Paper] [
Video] [
Poster]
-
Diffusion Convolutional Recurrent Neural
Network: Data-Driven Traffic Forecasting
YaGuang Li, Rose Yu, Cyrus Shahabi and
Yan Liu
International Conference on Learning Representations, (ICLR), 2018
[
Paper] [
Code] [
Slides] [
Poster]
-
Exploiting
Spatiotemporal Patterns for Accurate Air Quality Forecasting using Deep
Learning
Yijun Lin, Nikhit Mago, Yu Gao, YaGuang Li, Yao-Yi Chiang, José Luis Ambite and Cyrus Shahabi.
ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM SIGSPATIAL), 2018
[
Paper]
-
Deep
Learning: A Generic Approach for Extreme Condition Traffic Forecasting
YaGuang Li*, Rose Yu*, Cyrus Shahabi, Ugur Demiryurek and Yan Liu (*Equal Contribution)
Proceedings of the 2017 SIAM International Conference on Data Mining (SDM), 2017 (Best paper Nomination)
[
Paper]
Patents and Applications
-
Prompt Complexity for Large Language Models
Google LLC
US Patent Application, US20250086405A1, 2025
-
Efficient Training and Utilization of Large Language Models
Google LLC
US Patent Application, US20250045534A1, 2025
-
Large Language Model (LLM) Quantization
Google LLC
US Patent Application, US20240428006A1, 2024
-
Blockwise Controlled Decoding of Natural Language Output Generated Using a Large Language Model
Google LLC
US Patent Application, US20240330334A1, 2024
-
Generating Multi-modal Response(s) Through Utilization of Large Language Model(s)
Google LLC
US Patent, US11907674B1, 2024
-
Regression and Time Series Forecasting
Google LLC
US Patent Application, US20220383145A1, 2022