arxiv:2509.15207
zhu
xuekai
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
24 days ago
P1: Mastering Physics Olympiads with Reinforcement Learning
commented on
a paper
about 2 months ago
FlowRL: Matching Reward Distributions for LLM Reasoning
updated
a model
about 2 months ago
xuekai/FlowRL-DeepSeek-7B-code