zhu's picture

5 33 1

zhu

xuekai

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 24 days ago

P1: Mastering Physics Olympiads with Reinforcement Learning

commented on a paper about 2 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

updated a model about 2 months ago

xuekai/FlowRL-DeepSeek-7B-code

View all activity

Organizations

Papers 16

arxiv:2509.15207

arxiv:2509.09674

arxiv:2509.08827

arxiv:2509.04419

models 3

xuekai/FlowRL-DeepSeek-7B-code

8B • Updated Oct 27 • 7

xuekai/FlowRL-Qwen2.5-32B-math

33B • Updated Oct 27 • 8

xuekai/FlowRL-Qwen2.5-7B-math

8B • Updated Oct 27 • 161

datasets 2

xuekai/flowrl-data-collection

Preview • Updated Sep 28 • 311

xuekai/pad_train

Viewer • Updated Mar 21, 2024 • 184k • 27