From c6da9533ac1424df29495f7d75b10528b02d9117 Mon Sep 17 00:00:00 2001 From: Bagatur <22008038+baskaryan@users.noreply.github.com> Date: Mon, 20 May 2024 15:15:31 -0700 Subject: [PATCH] docs: correct langserve link (#21940) --- docs/docs/concepts.mdx | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/docs/concepts.mdx b/docs/docs/concepts.mdx index d068e4b0cff..924be764ca3 100644 --- a/docs/docs/concepts.mdx +++ b/docs/docs/concepts.mdx @@ -66,7 +66,7 @@ LCEL was designed from day 1 to **support putting prototypes in production, with When you build your chains with LCEL you get the best possible time-to-first-token (time elapsed until the first chunk of output comes out). For some chains this means eg. we stream tokens straight from an LLM to a streaming output parser, and you get back parsed, incremental chunks of output at the same rate as the LLM provider outputs the raw tokens. **Async support** -Any chain built with LCEL can be called both with the synchronous API (eg. in your Jupyter notebook while prototyping) as well as with the asynchronous API (eg. in a [LangServe](https://docs.smith.langchain.com) server). This enables using the same code for prototypes and in production, with great performance, and the ability to handle many concurrent requests in the same server. +Any chain built with LCEL can be called both with the synchronous API (eg. in your Jupyter notebook while prototyping) as well as with the asynchronous API (eg. in a [LangServe](/docs/langserve/) server). This enables using the same code for prototypes and in production, with great performance, and the ability to handle many concurrent requests in the same server. **Optimized parallel execution** Whenever your LCEL chains have steps that can be executed in parallel (eg if you fetch documents from multiple retrievers) we automatically do it, both in the sync and the async interfaces, for the smallest possible latency.