-
Notifications
You must be signed in to change notification settings - Fork 2
Open
Description
Description
Hi, I am trying to reproduce the CoMAS training process using Qwen2.5-3B-Instruct as the base model. The environment setup (Ray, VLLM, DeepSpeed) is successful, and the model weights are loaded correctly.
However, during the Simulating phase (rollout), I am encountering frequent parsing errors related to the scorer's response. It seems the scorer agent is either returning None or generating text that cannot be parsed as an integer.
Error Logs
Here are the specific error logs appearing in the console during Simulating round:
Failed to parse the scorer response: 'NoneType' object has no attribute 'text'
Failed to parse the scorer response: 'NoneType' object has no attribute 'text'
Failed to parse the scorer response: invalid literal for int() with base 10: '" and "'
Failed to parse the scorer response: 'NoneType' object has no attribute 'text'
Simulating round 2 out of 2...
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels