You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I upgraded from M2.5 to M2.7, I am using 4 x A100(80G), the M2.5 is compared to be on-the-fly, but the M2.7 is really verbose and slow, I am running with the VLLM, it's getting worse after some amount of requests.
I upgraded from M2.5 to M2.7, I am using 4 x A100(80G), the M2.5 is compared to be on-the-fly, but the M2.7 is really verbose and slow, I am running with the VLLM, it's getting worse after some amount of requests.