Test-Time Training Scaling Laws for Chemical Exploration in Drug Design
Chemical language models (CLMs) leveraging reinforcement learning (RL) have shown promise in de novo molecular design, yet often suffer from mode collapse, limiting their explor...