Publication & Citation Trends
Publications
11 total
In-Place Test-Time Training
Cited by 0
Semantic Scholar
Efficient Reasoning for Large Reasoning Language Models via Certainty-Guided Reflection Suppression
Cited by 17
Semantic Scholar
Beyond Online Sampling: Bridging Offline-to-Online Alignment via Dynamic Data Transformation for LLMs
Cited by 1
Semantic Scholar
DPO Meets PPO: Reinforced Token Optimization for RLHF
Cited by 115
Semantic Scholar
Do Efficient Transformers Really Save Computation?
Cited by 29
Semantic Scholar
Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
Cited by 25
Semantic Scholar
How Numerical Precision Affects Arithmetical Reasoning Capabilities of LLMs
Cited by 23
Semantic Scholar
Research Topics
Topic Modeling
(3)
Advanced Graph Neural Networks
(3)
Reinforcement Learning in Robotics
(2)
Adversarial Robustness in Machine Learning
(2)
Explainable Artificial Intelligence (XAI)
(2)
Affiliations
Peking University