Update distributed_inferencing.md

mudler · web-flow · commit 153e97715543 · 2024-07-22T17:35:10.000+02:00
Signed-off-by: Ettore Di Giacinto &lt;mudler@users.noreply.github.com&gt;
diff --git a/docs/content/docs/features/distributed_inferencing.md b/docs/content/docs/features/distributed_inferencing.md
@@ -11,7 +11,7 @@ This functionality enables LocalAI to distribute inference requests across multi
 LocalAI supports two modes of distributed inferencing via p2p:
 
 - **Federated Mode**: Requests are shared between the cluster and routed to a single worker node in the network based on the load balancer's decision.
-- **Worker Mode**: Requests are processed by all the workers which contributes to the final inference result (by sharing the model weights).
+- **Worker Mode** (aka "model sharding" or "splitting weights"): Requests are processed by all the workers which contributes to the final inference result (by sharing the model weights).
 
 ## Usage