Support Qwen3.5 by chenyushuo · Pull Request #515 · agentscope-ai/Trinity-RFT

chenyushuo · 2026-03-05T06:55:39Z

Description

Support Qwen3.5
Upgrade vllm to 0.17.0, transformers to 5.3.0.

Checklist

Please check the following items before code is ready to be reviewed.

Code has passed all tests
Docstrings have been added/updated in Google Style
Documentation has been updated
Code is ready for review

gemini-code-assist · 2026-03-05T06:55:42Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

…port_qwen35

chenyushuo · 2026-03-11T08:01:45Z

/unittest-all

chenyushuo · 2026-03-11T08:03:46Z

/gemini review

github-actions · 2026-03-11T09:41:57Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
261	256	0	5	0	0	1h 37m

Skipped

Tests	Status
tests/common/vllm_test.py::TestTinkerAsyncAPIServer::test_api_async	skipped ⏭️
tests/trainer/trainer_test.py::TestTrainerSFTWarmupGSM8K::test_trainer	skipped ⏭️
tests/trainer/trainer_test.py::TestTinkerTrainer::test_trainer	skipped ⏭️
tests/trainer/trainer_test.py::TestTinkerTrainer::test_trainer_class	skipped ⏭️
tests/utils/swanlab_test.py::TestSwanlabMonitor::test_swanlab_monitor_smoke	skipped ⏭️

Tests

Test Name	Status	Duration
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_batch_level_std_grpo	✅	5ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_batch_level_step_wise_grpo_advantage	✅	3ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_duplicate_grpo	✅	5ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_grpo_advantage	✅	3ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_grpo_correct_bias	✅	2ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_grpo_reward_std	✅	1ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_step_wise_grpo_advantage	✅	2ms
tests/algorithm/advantage_fn_test.py::TestGroupedAdvantageFn::test_step_wise_grpo_with_std_threshold	✅	2ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_abs_kl_fn	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_corrected_k3_fallback	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_corrected_k3_loss	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_corrected_k3_same_policy	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_corrected_k3_with_old_logprob	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_dummy_kl_fn	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_k1_kl_fn	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_k2_kl_fn	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_k3_kl_fn	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_kl_loss_aggregation_modes	✅	1ms
tests/algorithm/kl_fn_test.py::KLFnTest::test_low_var_kl_fn	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_dpo_policy_loss	✅	2ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_gspo_policy_loss	✅	2ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_mix_policy_loss	✅	3ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_opmd_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_ppo_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_ppo_policy_loss_with_sequence_masking	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_sapo_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_sft_policy_loss	✅	1ms
tests/buffer/experience_pipeline_test.py::TestExperiencePipeline::test_experience_pipeline	✅	10.7s
tests/buffer/experience_pipeline_test.py::TestExperiencePipeline::test_pass_rate_calculation	✅	7.2s
tests/buffer/experience_storage_test.py::ExperienceStorageTest::test_sql_experience_buffer	✅	2.8s
tests/buffer/experience_storage_test.py::ExperienceStorageTest::test_sql_storage_0_sft	✅	4.5s
tests/buffer/experience_storage_test.py::ExperienceStorageTest::test_sql_storage_1_dpo	✅	5.1s
tests/buffer/file_test.py::TestFileBuffer::test_file_reader	✅	416ms
tests/buffer/file_test.py::TestFileBuffer::test_file_writer	✅	1.7s
tests/buffer/formatter_test.py::TestFormatter::test_dpo_messages_formatter	✅	1.4s
tests/buffer/formatter_test.py::TestFormatter::test_dpo_plaintext_formatter	✅	1.3s
tests/buffer/formatter_test.py::TestFormatter::test_multi_modal_sft_formatter	✅	1.7s
tests/buffer/formatter_test.py::TestFormatter::test_sft_messages_formatter	✅	2.7s
tests/buffer/formatter_test.py::TestFormatter::test_sft_plaintext_formatter	✅	2.2s
tests/buffer/formatter_test.py::TestFormatter::test_task_formatter	✅	481ms
tests/buffer/queue_test.py::TestQueueBuffer::test_priority_queue_buffer_reuse	✅	6.6s
tests/buffer/queue_test.py::TestQueueBuffer::test_priority_queue_capacity	✅	2.3s
tests/buffer/queue_test.py::TestQueueBuffer::test_priority_queue_reuse_count_control	✅	4.0s
tests/buffer/queue_test.py::TestQueueBuffer::test_queue_buffer_0_queue	✅	3.1s
tests/buffer/queue_test.py::TestQueueBuffer::test_queue_buffer_1_priority_queue	✅	3.1s
tests/buffer/queue_test.py::TestQueueBuffer::test_queue_buffer_capacity	✅	4.0s
tests/buffer/reader_test.py::TestBufferReader::test_buffer_reader_registration	✅	1.1s
tests/buffer/reward_shaping_mapper_test.py::TestRewardShapingMapper::test_basic_usage	✅	7ms
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_0::test_default_queue_default_sample_strategy	✅	2.1s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_0::test_default_queue_staleness_control_sample_strategy	✅	1.5s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_0::test_priority_queue_default_sample_strategy	✅	1.8s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_0::test_priority_queue_staleness_control_sample_strategy	✅	1.7s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_0::test_sql_staleness_control_sample_strategy	✅	4.7s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_1::test_default_queue_default_sample_strategy	✅	2.1s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_1::test_default_queue_staleness_control_sample_strategy	✅	1.7s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_1::test_priority_queue_default_sample_strategy	✅	1.6s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_1::test_priority_queue_staleness_control_sample_strategy	✅	1.8s
tests/buffer/sample_strategy_test.py::ExperienceStorageTest_1::test_sql_staleness_control_sample_strategy	✅	4.1s
tests/buffer/sql_test.py::TestSQLBuffer::test_sql_exp_buffer_read_write_0	✅	5.9s
tests/buffer/sql_test.py::TestSQLBuffer::test_sql_exp_buffer_read_write_1	✅	2.8s
tests/buffer/sql_test.py::TestSQLBuffer::test_sql_task_buffer_read_write	✅	3.4s
tests/buffer/task_scheduler_test.py::TestTaskScheduler::test_task_scheduler_0	✅	84ms
tests/buffer/task_scheduler_test.py::TestTaskScheduler::test_task_scheduler_1	✅	65ms
tests/buffer/task_scheduler_test.py::TestTaskScheduler::test_task_scheduler_2	✅	114ms
tests/buffer/task_scheduler_test.py::TestTaskScheduler::test_task_scheduler_3	✅	112ms
tests/buffer/task_scheduler_test.py::TestTaskScheduler::test_task_scheduler_4	✅	113ms
tests/buffer/task_scheduler_test.py::TestTaskScheduler::test_task_scheduler_5	✅	117ms
tests/buffer/task_scheduler_test.py::TestTaskScheduler::test_task_scheduler_6	✅	130ms
tests/buffer/task_scheduler_test.py::TestTaskScheduler::test_task_scheduler_simple	✅	51ms
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_0_file	✅	347ms
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_1_sql	✅	2.7s
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_2_file	✅	42ms
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_3_sql	✅	3.1s
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_4_file	✅	43ms
tests/buffer/task_storage_test.py::TaskStorageTest::test_read_task_5_sql	✅	3.7s
tests/cli/launcher_test.py::TestLauncherMain::test_debug_mode	✅	1m 3s
tests/cli/launcher_test.py::TestLauncherMain::test_log_mode	✅	152ms
tests/cli/launcher_test.py::TestLauncherMain::test_main_run_command	✅	6.8s
tests/cli/launcher_test.py::TestLauncherMain::test_main_run_in_dlc	✅	1.2s
tests/cli/launcher_test.py::TestLauncherMain::test_main_studio_command	✅	693ms
tests/cli/launcher_test.py::TestLauncherMain::test_multi_stage_run	✅	4.8s
tests/common/config_test.py::TestConfig::test_all_examples_are_valid	✅	21.3s
tests/common/config_test.py::TestConfig::test_chat_template_path	✅	76ms
tests/common/config_test.py::TestConfig::test_config_flatten	✅	32ms
tests/common/config_test.py::TestConfig::test_continue_from_checkpoint_is_valid	✅	161ms
tests/common/config_test.py::TestConfig::test_default_workflow	✅	77ms
tests/common/config_test.py::TestConfig::test_load_default_config	✅	29.5s
tests/common/config_test.py::TestConfig::test_max_token_len_per_gpu_set_correctly	✅	78ms
tests/common/config_test.py::TestConfig::test_optimizer_config_propagation	✅	77ms
tests/common/config_test.py::TestConfig::test_update_config_from_ray_cluster	✅	434ms
tests/common/experience_test.py::TestEID::test_eid_properties	✅	1ms
tests/common/experience_test.py::TestExperience::test_action_mask_and_logprobs_type	✅	1ms
tests/common/experience_test.py::TestExperience::test_assertions	✅	1ms
tests/common/experience_test.py::TestExperience::test_deserialize_legacy_pickle_payload	✅	1ms
tests/common/experience_test.py::TestExperience::test_deserialize_single_rejects_batch_payload	✅	1ms
tests/common/experience_test.py::TestExperience::test_dpo_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_hf_datasets_conversion	✅	14ms
tests/common/experience_test.py::TestExperience::test_multi_turn_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_serialize_deserialize	✅	1ms
tests/common/experience_test.py::TestExperience::test_serialize_many_deserialize_many	✅	1ms
tests/common/experience_test.py::TestExperience::test_serialize_many_with_shared_multimodal_tensor	✅	1ms
tests/common/experience_test.py::TestExperience::test_single_turn_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_to_dict	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_experience_model_experience_conversion	✅	1ms
tests/common/sudoku_test.py::test_9x9_generator_produces_valid_solution	✅	1ms
tests/common/sudoku_test.py::test_9x9_generator_creates_holes	✅	1ms
tests/common/sudoku_test.py::test_9x9_solution_is_fully_filled	✅	1ms
tests/common/sudoku_test.py::test_judge_allows_incomplete_board	✅	1ms
tests/common/sudoku_test.py::test_judge_detects_row_violation	✅	1ms
tests/common/sudoku_test.py::test_judge_detects_column_violation	✅	1ms
tests/common/sudoku_test.py::test_judge_detects_block_violation	✅	1ms
tests/common/sudoku_test.py::test_4x4_generator_produces_valid_solution	✅	1ms
tests/common/sudoku_test.py::test_4x4_solution_is_fully_filled	✅	1ms
tests/common/sudoku_test.py::test_4x4_judge_detects_row_violation	✅	1ms
tests/common/sudoku_test.py::test_4x4_judge_detects_block_violation	✅	1ms
tests/common/vllm_test.py::ModelWrapperTest_0::test_generate	✅	1m 2s
tests/common/vllm_test.py::ModelWrapperTest_1::test_generate	✅	47.4s
tests/common/vllm_test.py::ModelWrapperTest_2::test_generate	✅	42.1s
tests/common/vllm_test.py::TestModelLen_0::test_model_len	✅	28.3s
tests/common/vllm_test.py::TestModelLen_1::test_model_len	✅	24.1s
tests/common/vllm_test.py::TestModelLen_2::test_model_len	✅	31.1s
tests/common/vllm_test.py::TestModelLenWithoutPromptTruncation::test_model_len	✅	28.2s
tests/common/vllm_test.py::TestMessageProcess::test_no_prompt_truncation	✅	27.6s
tests/common/vllm_test.py::TestMessageProcess::test_truncation_status	✅	27.8s
tests/common/vllm_test.py::TestAPIServer::test_api	✅	26.3s
tests/common/vllm_test.py::TestLogprobs::test_logprobs_api	✅	23.7s
tests/common/vllm_test.py::TestAsyncAPIServer::test_api_async	✅	25.5s
tests/common/vllm_test.py::TestTinkerAsyncAPIServer::test_api_async	⏭️	1ms
tests/common/vllm_test.py::TestTokenizer::test_action_mask	✅	540ms
tests/common/vllm_test.py::TestTokenizer::test_action_mask_with_tools	✅	1.0s
tests/common/vllm_test.py::TestAPIServerToolCall_0_deepseek_r1::test_api_tool_calls	✅	32.8s
tests/common/vllm_test.py::TestAPIServerToolCall_1::test_api_tool_calls	✅	25.8s
tests/common/vllm_test.py::TestSuperLongGeneration::test_generate	✅	1m 9s
tests/common/vllm_test.py::TestTinkerAPI::test_tinker_api	✅	42.5s
tests/explorer/explorer_test.py::TestExplorerCountdownEval::test_explorer	✅	1m 44s
tests/explorer/explorer_test.py::TestExplorerEvalDetailedStats::test_explorer	✅	1m 12s
tests/explorer/explorer_test.py::TestExplorerGSM8KRULERNoEval::test_explorer	✅	58.8s
tests/explorer/explorer_test.py::TestExplorerGSM8k::test_explorer	✅	3m 1s
tests/explorer/explorer_test.py::ServeTest::test_serve	✅	56.7s
tests/explorer/proxy_test.py::RecorderTest::test_recorder	✅	82ms
tests/explorer/scheduler_test.py::SchedulerTest::test_async_workflow	✅	5.2s
tests/explorer/scheduler_test.py::SchedulerTest::test_concurrent_operations	✅	5.2s
tests/explorer/scheduler_test.py::SchedulerTest::test_dynamic_timeout	✅	12.9s
tests/explorer/scheduler_test.py::SchedulerTest::test_get_results	✅	29.3s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_non_repeatable_workflow_0	✅	4.8s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_non_repeatable_workflow_1	✅	4.7s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_repeatable_workflow_0	✅	4.8s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_repeatable_workflow_1	✅	4.7s
tests/explorer/scheduler_test.py::SchedulerTest::test_multi_step_execution	✅	5.5s
tests/explorer/scheduler_test.py::SchedulerTest::test_non_repeatable_workflow	✅	5.3s
tests/explorer/scheduler_test.py::SchedulerTest::test_over_rollout_min_wait	✅	13.0s
tests/explorer/scheduler_test.py::SchedulerTest::test_scheduler_all_methods	✅	15.2s
tests/explorer/scheduler_test.py::SchedulerTest::test_scheduler_restart_after_stop	✅	9.2s
tests/explorer/scheduler_test.py::SchedulerTest::test_split_tasks	✅	8.3s
tests/explorer/scheduler_test.py::SchedulerTest::test_stepwise_experience_eid	✅	25.4s
tests/explorer/scheduler_test.py::SchedulerTest::test_wait_all	✅	7.9s
tests/explorer/scheduler_test.py::SchedulerTest::test_wait_all_timeout_with_multi_batch	✅	14.0s
tests/explorer/scheduler_test.py::TestRunnerStateCollection::test_runner_state_collection	✅	9.8s
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_reward_propagation_workflow_0	✅	1ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_reward_propagation_workflow_1	✅	602ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_step_wise_reward_workflow_0	✅	1ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_step_wise_reward_workflow_1	✅	1.0s
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_workflows_raise_error	✅	1ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_workflows_stop_at_max_env_steps	✅	1.0s
tests/explorer/workflow_test.py::WorkflowTest::test_gsm8k_workflow	✅	13ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_boxed_workflow	✅	17ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_complex_workflow	✅	783ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_eval_workflow	✅	4ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_fraction_workflow	✅	12ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_workflow	✅	8ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_repeatable_0	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_repeatable_1	✅	101ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_resettable_0	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_resettable_1	✅	201ms
tests/explorer/workflow_test.py::MultiTurnWorkflowTest_0::test_multi_turn_workflow	✅	20.4s
tests/explorer/workflow_test.py::MultiTurnWorkflowTest_1::test_multi_turn_workflow	✅	20.8s
tests/explorer/workflow_test.py::TestWorkflowStateRecording::test_workflow_state_recording	✅	4.0s
tests/explorer/workflow_test.py::TestAgentScopeWorkflowAdapter::test_adapter_v0	✅	681ms
tests/explorer/workflow_test.py::TestAgentScopeWorkflowAdapter::test_adapter_v1	✅	15ms
tests/explorer/workflow_test.py::TestWorkflowRunner::test_workflow_runner	✅	137ms
tests/explorer/workflow_test.py::TestWorkflowRunner::test_workflow_runner_get_state	✅	8.1s
tests/explorer/workflow_test.py::TestWorkflowRunner::test_workflow_with_openai	✅	24.5s
tests/explorer/workflow_test.py::TestConcurrentWorkflowRunner::test_concurrent_workflow_runner	✅	42.0s
tests/manager/log_manager_test.py::TestLogManager::test_file_rotation	✅	1ms
tests/manager/log_manager_test.py::TestLogManager::test_init_and_tracking	✅	1ms
tests/manager/log_manager_test.py::TestLogManager::test_keyword_filter_and_search_pattern	✅	1ms
tests/manager/synchronizer_test.py::TestSynchronizerExit_0::test_synchronizer	✅	2m 12s
tests/manager/synchronizer_test.py::TestSynchronizerExit_1::test_synchronizer	✅	2m 28s
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_0::test_synchronizer	✅	2m 6s
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_1::test_synchronizer	✅	1m 51s
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_2::test_synchronizer	✅	2m 6s
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_3::test_synchronizer	✅	2m 43s
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_4::test_synchronizer	✅	2m 25s
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_5::test_synchronizer	✅	2m 47s
tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_0::test_synchronizer	✅	1m 7s
tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_1::test_synchronizer	✅	1m 1s
tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_2::test_synchronizer	✅	1m 2s
tests/manager/synchronizer_test.py::TestPullLatestWeights::test_no_new_version_logs_warning	✅	3ms
tests/manager/synchronizer_test.py::TestPullLatestWeights::test_pull_latest_weights_0	✅	2ms
tests/manager/synchronizer_test.py::TestPullLatestWeights::test_pull_latest_weights_1	✅	3ms
tests/manager/synchronizer_test.py::TestPullLatestWeights::test_pull_latest_weights_2	✅	2ms
tests/manager/synchronizer_test.py::TestPullLatestWeights::test_pull_latest_weights_3	✅	2ms
tests/service/data_juicer_test.py::TestDataJuicer::test_config	✅	819ms
tests/service/data_juicer_test.py::TestDataJuicer::test_server_start	✅	21.0s
tests/service/data_juicer_test.py::TestDataJuicerExperiencePipeline::test_data_juicer_operators	✅	20.1s
tests/service/data_juicer_test.py::TestDataJuicerTaskPipeline::test_data_juicer_task_pipeline	✅	15.0s
tests/trainer/trainer_test.py::TestTrainerCountdown_0_fsdp::test_trainer	✅	3m 43s
tests/trainer/trainer_test.py::TestTrainerCountdown_1_megatron::test_trainer	✅	5m 5s
tests/trainer/trainer_test.py::TestStepAheadAsyncRL::test_trainer	✅	1m 41s
tests/trainer/trainer_test.py::TestTrainerGSM8K_0_fsdp::test_trainer	✅	1m 10s
tests/trainer/trainer_test.py::TestTrainerGSM8K_1_fsdp2::test_trainer	✅	1m 5s
tests/trainer/trainer_test.py::TestTrainerGSM8K_2_fsdp::test_trainer	✅	1m 8s
tests/trainer/trainer_test.py::TestTrainerGSM8K_3_fsdp2::test_trainer	✅	1m 17s
tests/trainer/trainer_test.py::TestTrainerSFTWarmupGSM8K::test_trainer	⏭️	1ms
tests/trainer/trainer_test.py::TestTrainerDPO::test_trainer	✅	39.0s
tests/trainer/trainer_test.py::TestTrainerSFT::test_trainer	✅	36.1s
tests/trainer/trainer_test.py::TestTrainerToolsSFT::test_trainer_tools	✅	36.2s
tests/trainer/trainer_test.py::TestFullyAsyncMode_0_fsdp::test_fully_async_mode	✅	1m 41s
tests/trainer/trainer_test.py::TestFullyAsyncMode_1_fsdp::test_fully_async_mode	✅	1m 42s
tests/trainer/trainer_test.py::TestFullyAsyncMode_2_megatron::test_fully_async_mode	✅	2m 32s
tests/trainer/trainer_test.py::TestTrainerCheckpointSave_0_fsdp::test_trainer	✅	2m 54s
tests/trainer/trainer_test.py::TestTrainerCheckpointSave_1_megatron::test_trainer	✅	5m 43s
tests/trainer/trainer_test.py::TestTrainerMIX::test_trainer	✅	2m 4s
tests/trainer/trainer_test.py::TestServeWithTrainer::test_serve_with_trainer	✅	1m 52s
tests/trainer/trainer_test.py::TestMultiModalGRPO::test_trainer	✅	1m 55s
tests/trainer/trainer_test.py::TestMultiModalSFT::test_trainer	✅	1m 5s
tests/trainer/trainer_test.py::TestTrainerLoRA::test_trainer	✅	3m 15s
tests/trainer/trainer_test.py::TestOverRollout::test_trainer	✅	1m 5s
tests/trainer/trainer_test.py::TestTrainerPromptTruncation::test_trainer	✅	47.8s
tests/trainer/trainer_test.py::TestTinkerTrainer::test_trainer	⏭️	1ms
tests/trainer/trainer_test.py::TestTinkerTrainer::test_trainer_class	⏭️	1ms
tests/trainer/trainer_test.py::AgentScopeTunerTest::test_agentscope_tuner	✅	1m 19s
tests/trainer/trainer_test.py::ColocateModeTest::test_trainer	✅	1m 58s
tests/utils/eval_utils_test.py::TestComputeScore::test_both_boxed_and_equivalent	✅	2ms
tests/utils/eval_utils_test.py::TestComputeScore::test_both_boxed_and_not_equivalent	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_empty_ground_truth	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_empty_solution_string	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_multiple_boxed_answers_in_solution	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_solution_boxed_truth_raw_and_equivalent	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_solution_boxed_truth_raw_and_not_equivalent	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_solution_not_boxed	✅	1ms
tests/utils/eval_utils_test.py::TestComputeScore::test_solution_raw_and_ground_truth_boxed_equivalent	✅	1ms
tests/utils/eval_utils_test.py::TestMathEvalUtils::test_extract_answer	✅	3ms
tests/utils/eval_utils_test.py::TestMathEvalUtils::test_verify_math_answer	✅	60ms
tests/utils/eval_utils_test.py::TestEvalUtils::test_is_equiv	✅	5ms
tests/utils/log_test.py::LogTest::test_actor_log	✅	2.0s
tests/utils/log_test.py::LogTest::test_group_by_node	✅	1.9s
tests/utils/log_test.py::LogTest::test_no_actor_log	✅	881ms
tests/utils/plugin_test.py::TestPluginLoader::test_load_plugins_local_0__workspace_tests_utils_plugins	✅	80ms
tests/utils/plugin_test.py::TestPluginLoader::test_load_plugins_local_1_tests_utils_plugins	✅	78ms
tests/utils/plugin_test.py::TestPluginLoader::test_load_plugins_remote_0__workspace_tests_utils_plugins	✅	9.1s
tests/utils/plugin_test.py::TestPluginLoader::test_load_plugins_remote_1_tests_utils_plugins	✅	9.2s
tests/utils/plugin_test.py::TestPluginLoader::test_passing_custom_class_0__workspace_tests_utils_plugins	✅	5.3s
tests/utils/plugin_test.py::TestPluginLoader::test_passing_custom_class_1_tests_utils_plugins	✅	4.9s
tests/utils/registry_test.py::TestRegistryWithRay::test_dynamic_import	✅	2.4s
tests/utils/registry_test.py::TestRegistry::test_algorithm_registry_mapping	✅	8ms
tests/utils/registry_test.py::TestRegistry::test_buffer_module_registry_mapping	✅	2ms
tests/utils/registry_test.py::TestRegistry::test_common_module_registry_mapping	✅	50ms
tests/utils/registry_test.py::TestRegistry::test_register_module	✅	1ms
tests/utils/registry_test.py::TestRegistry::test_utils_module_registry_mapping	✅	1ms
tests/utils/swanlab_test.py::TestSwanlabMonitor::test_swanlab_monitor_smoke	⏭️	1ms

Github Test Reporter by CTRF 💚

Copilot

Pull request overview

Adds support and compatibility patches for the Qwen3.5 model family, including updates to trainer data preparation, model monkey-patching, vLLM integration, and Megatron distributed checkpoint metadata handling.

Changes:

Add Qwen3.5-specific monkey patches (forward overrides, ulysses sequence-parallel hooks, FLOPs estimator wiring).
Adjust verl trainer preprocessing to use an “empty” HF model for multimodal rope-index computation.
Extend Megatron distributed checkpoint save/load metadata support and widen several dependency constraints.

Reviewed changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
`trinity/trainer/verl/verl_trainer.py`	Builds an empty HF model (no weights) and passes it into `to_data_proto` for rope-index handling.
`trinity/trainer/verl/verl_config.py`	Adds Megatron distributed-checkpoint optimizer reshaping toggles.
`trinity/trainer/verl/utils.py`	Switches multimodal rope-index path from processor-based to model-based and removes local `hf_processor` shim.
`trinity/trainer/verl/monkey_patch.py`	Refactors fused-backend forward patching and adds Qwen3.5 model-type handling & related patches.
`trinity/trainer/verl/megatron_checkpoint_manager.py`	Adds content metadata generation/plumbing for dist-checkpointing save/load (compat workaround pending verl upgrade).
`trinity/common/patch/qwen3_5.py`	Introduces Qwen3.5 forward implementations for torch/triton backends plus ulysses linear-attn decorator.
`trinity/common/models/vllm_patch/worker_patch.py`	Expands supported vLLM version range for the prompt-logprobs patch.
`trinity/common/models/vllm_patch/__init__.py`	Adds a transformers>=5 compatibility monkey-patch for `PreTrainedConfig.__init__`.
`trinity/common/models/vllm_model.py`	Adjusts prompt truncation plumbing across vLLM versions (SamplingParams vs `tokenization_kwargs`).
`pyproject.toml`	Relaxes megatron-core / transformer-engine / flash-attn version constraints.
`.github/workflows/docker/docker-compose.yaml`	Updates CI docker image tags used by the workflow.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

trinity/trainer/verl/megatron_checkpoint_manager.py

trinity/common/models/vllm_patch/worker_patch.py

pyproject.toml

Support Qwen3.5

5f6d359

chenyushuo added 3 commits March 11, 2026 15:59

Upgrade the dependency versions of vllm, transformers, and megatron

c09b62f

Merge branch 'main' of github.com:modelscope/Trinity-RFT into dev/sup…

d296f8b

…port_qwen35

fix pre commit

5267498

chenyushuo requested a review from Copilot March 11, 2026 09:48

Copilot started reviewing on behalf of chenyushuo March 11, 2026 09:49 View session

Copilot AI reviewed Mar 11, 2026

View reviewed changes

trinity/trainer/verl/megatron_checkpoint_manager.py Show resolved Hide resolved

trinity/common/models/vllm_patch/worker_patch.py Show resolved Hide resolved

pyproject.toml Show resolved Hide resolved

pyproject.toml Show resolved Hide resolved

upgrade vllm

4a15384

pan-x-c approved these changes Mar 11, 2026

View reviewed changes

pan-x-c merged commit cae0f9b into agentscope-ai:main Mar 11, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Qwen3.5#515

Support Qwen3.5#515
pan-x-c merged 5 commits intoagentscope-ai:mainfrom
chenyushuo:dev/support_qwen35

chenyushuo commented Mar 5, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Mar 5, 2026

Uh oh!

chenyushuo commented Mar 11, 2026

Uh oh!

chenyushuo commented Mar 11, 2026

Uh oh!

github-actions bot commented Mar 11, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

chenyushuo commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

gemini-code-assist bot commented Mar 5, 2026

Uh oh!

chenyushuo commented Mar 11, 2026

Uh oh!

chenyushuo commented Mar 11, 2026

Uh oh!

github-actions bot commented Mar 11, 2026

Summary

Skipped

Tests

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

chenyushuo commented Mar 5, 2026 •

edited

Loading