SAGE Improves GRPO Under Sparse Rewards | Let's Data Science