- 👋 Hi, I’m @CSfufu
- I am currently focus on VLM Agentic reasoning and Reinforcement Learning.
Highlights
- Pro
Pinned Loading
-
XiaoYee/Awesome_Efficient_LRM_Reasoning
XiaoYee/Awesome_Efficient_LRM_Reasoning Public😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond
-
Revisual-R1
Revisual-R1 Public[ICLR 2026]🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement l…
-
hiyouga/EasyR1
hiyouga/EasyR1 PublicEasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
-
shawn0728/ARES
shawn0728/ARES Public[ICLR 2026]🌴 ARES is an open-source framework for adaptive multimodal reasoning, featuring a two-stage pipeline—Adaptive Cold-Start and Entropy-Shaped Policy Optimization—to balance reasoning depth…
-
verl-project/verl
verl-project/verl Publicverl: Volcano Engine Reinforcement Learning for LLMs
-
shawn0728/Unify-Agent
shawn0728/Unify-Agent Public🐧 Unify-Agent: An end-to-end unified multimodal agent for faithful, knowledge-grounded image generation.
If the problem persists, check the GitHub status page or contact support.

