Long T. Le

Long T. Le is a Staff Research Engineer in Google Cloud AI Research with the mission to bring advance AI to the world. He's currently focusing in new LLM solution like distillation, RAG, Agent. Before that, he worked on a new deep learning method for tabular data, covid-19 forecasting and recommendation AI. Before joining Google, he was a machine learning engineer in Capital One in NYC. At Capital One, he developed different models in loan optimization and first-party fraud detection. He earned his Ph.D. in computer science from Rutgers University. Before that, he earned a bachelor in computing from National University at Singapore.

Research Areas

Authored Publications

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting

Zilong Wang

Zifeng Wang

Long Le

Steven Zheng

Swaroop Mishra

Vincent Perot

Yuwei Zhang

Anush Mattapalli

Ankur Taly

Jingbo Shang

Chen-Yu Lee

Tomas Pfister

ICLR 2025

Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling

Wenda Xu

Rujun Han

Zifeng Wang

Long Le

Dhruv Madeka

Lei Li

William Wang

Rishabh Agarwal

Chen-Yu Lee

Tomas Pfister

2025

Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling

Rujun Han

Tomas Pfister

Chen-Yu Lee

Lei Li

Wenda Xu

Long Le

Rishabh Agarwal

William Wang

Dhruv Madeka

Zifeng Wang

ICLR 2025

In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialog Agents

Zhen Tan

Jun Yan

I-Hung Hsu

Rujun Han

Zifeng Wang

Long Le

Yiwen Song

Yanfei Chen

Hamid Palangi

George Lee

Anand Iyer

Tianlong Chen

Huan Liu

Chen-Yu Lee

Tomas Pfister

ACL 2025

Found in the middle: Calibrating Positional Attention Bias Improves Long Context Utilization

Cheng-Yu Hsieh

Yung-Sung Chuang

Chun-Liang Li

Zifeng Wang

Long Le

Abhishek Kumar

James Glass

Alexander Ratner

Chen-Yu Lee

Ranjay Krishna

Tomas Pfister

2024

CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation

I-Hung Hsu

Zifeng Wang

Long Le

Lesly Miculicich

Nanyun Peng

Chen-Yu Lee

Tomas Pfister

2024

CodecLM: Aligning Language Models with Tailored Synthetic Data

Zifeng Wang

Chun-Liang Li

Vincent Perot

Long Le

Jin Miao

Zizhao Zhang

Chen-Yu Lee

Tomas Pfister

NAACL 2024

A prospective evaluation of AI-augmented epidemiology to forecast COVID-19 in the USA and Japan

Sercan Arik

Joel Shor

Joe Ledsam

Raj Sinha

Jinsung Yoon

Arkady Epshteyn

Ashwin Sura Ravi

Beth Luan

Chun-Liang Li

Daisuke Yoneoka

Dario Sava

Elli Kanal

Hiroaki Miyata

Hiroki Kayama

Isaac Jones

Joe Mckenna

Johan Euphrosine

Kris Popendorf

Leyou Zhang

Long T. Le

Michael W. Dusenberry

Mimi Sun

Nate Yoder

Shashank Singh

Shuhei Nomura

Thomas Tsai

Tomas Pfister

Vikas Menon

npj Digital Medicine (2021)

Interpretable Sequence Learning for Covid-19 Forecasting

Sercan Arik

Chun-Liang Li

Jinsung Yoon

Raj Sinha

Arkady Epshteyn

Long T. Le

Vik Menon

Shashank Singh

Leyou Zhang

Martin Nikoltchev

Yash Kumar Sonthalia

Hootan Nakhost

Elli Kanal

Tomas Pfister

NeurIPS (2020)

Search on Google Scholar

Defining the technology of today and tomorrow.

Philosophy

People

Research areas

Foundational ML & Algorithms

Computing Systems & Quantum AI

Science, AI & Society

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Long T. Le

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Research areas

Foundational ML & Algorithms

Computing Systems & Quantum AI

Science, AI & Society

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Long T. Le

Research Areas

Filter by:

Publications

Years

Research Areas

Teams

Join us