You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Dec 17, 2025. It is now read-only.
have fixed num_clients=1. Couldn't them be changed to:
importmultiprocessingnum_cores=multiprocessing.cpu_count() # Get total available CPU coresclients=construct_clients(llm_api=llm_api, num_clients=num_cores)
As I've seen in the docs, the parallelism in ray can be done 2 ways:
clients= [OpenAIChatCompletionsClient.remote() for_inrange(8)] # multiple actors # OR@ray.remote(num_cpus=2) # Each actor uses 2 CPUsclassOpenAIChatCompletionsClient(LLMClient):
pass
And in this case I would prefer the first way... Am I missing anything or the way the code is written does not use all the available CPUs??