langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-18 00:51:18 +00:00

Author	SHA1	Message	Date
Bagatur	317fb86fd9	openai[patch]: fix int test (#29395 )	2025-01-23 21:23:01 +00:00
Bagatur	8d566a8fe7	openai[patch]: detect old models in with_structured_output (#29392 ) Co-authored-by: ccurme <chester.curme@gmail.com>	2025-01-23 20:47:32 +00:00
ccurme	c20f7418c7	openai[patch]: fix Azure LLM test (#29302 ) The tokens I get are: ``` ['', '\n\n', 'The', ' sun', ' was', ' setting', ' over', ' the', ' horizon', ',', ' casting', ''] ``` so possibly an extra empty token is included in the output. lmk @efriis if we should look into this further.	2025-01-19 17:25:42 +00:00
ccurme	c616b445f2	anthropic[patch]: support `parallel_tool_calls` (#29257 ) Need to: - Update docs - Decide if this is an explicit kwarg of bind_tools - Decide if this should be in standard test with flag for supporting	2025-01-17 19:41:41 +00:00
Erick Friis	bbc3e3b2cf	openai: disable streaming for o1 by default (#29147 ) Currently 400s https://community.openai.com/t/streaming-support-for-o1-o1-2024-12-17-resulting-in-400-unsupported-value/1085043 o1-mini and o1-preview stream fine	2025-01-11 02:24:11 +00:00
ccurme	6e63ccba84	openai[minor]: release 0.3 (#29100 ) ## Goal Solve the following problems with `langchain-openai`: - Structured output with `o1` [breaks out of the box](https://langchain.slack.com/archives/C050X0VTN56/p1735232400232099). - `with_structured_output` by default does not use OpenAI’s [structured output feature](https://platform.openai.com/docs/guides/structured-outputs). - We override API defaults for temperature and other parameters. ## Breaking changes: - Default method for structured output is changing to OpenAI’s dedicated [structured output feature](https://platform.openai.com/docs/guides/structured-outputs). For schemas specified via TypedDict or JSON schema, strict schema validation is disabled by default but can be enabled by specifying `strict=True`. - To recover previous default, pass `method="function_calling"` into `with_structured_output`. - Models that don’t support `method="json_schema"` (e.g., `gpt-4` and `gpt-3.5-turbo`, currently the default model for ChatOpenAI) will raise an error unless `method` is explicitly specified. - To recover previous default, pass `method="function_calling"` into `with_structured_output`. - Schemas specified via Pydantic `BaseModel` that have fields with non-null defaults or metadata (like min/max constraints) will raise an error. - To recover previous default, pass `method="function_calling"` into `with_structured_output`. - `strict` now defaults to False for `method="json_schema"` when schemas are specified via TypedDict or JSON schema. - To recover previous behavior, use `with_structured_output(schema, strict=True)` - Schemas specified via Pydantic V1 will raise a warning (and use `method="function_calling"`) unless `method` is explicitly specified. - To remove the warning, pass `method="function_calling"` into `with_structured_output`. - Streaming with default structured output method / Pydantic schema no longer generates intermediate streamed chunks. - To recover previous behavior, pass `method="function_calling"` into `with_structured_output`. - We no longer override default temperature (was 0.7 in LangChain, now will follow OpenAI, currently 1.0). - To recover previous behavior, initialize `ChatOpenAI` or `AzureChatOpenAI` with `temperature=0.7`. - Note: conceptually there is a difference between forcing a tool call and forcing a response format. Tool calls may have more concise arguments vs. generating content adhering to a schema. Prompts may need to be adjusted to recover desired behavior. --------- Co-authored-by: Jacob Lee <jacoblee93@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2025-01-10 10:50:32 -05:00
ccurme	815bfa1913	openai[patch]: support streaming with json_schema response format (#29044 ) - Stream JSON string content. Final chunk includes parsed representation (following OpenAI [docs](https://platform.openai.com/docs/guides/structured-outputs#streaming)). - Mildly (?) breaking change: if you were using streaming with `response_format` before, usage metadata will disappear unless you set `stream_usage=True`. ## Response format Before: ![Screenshot 2025-01-06 at 11 59 01 AM](https://github.com/user-attachments/assets/e54753f7-47d5-421d-b8f3-172f32b3364d) After: ![Screenshot 2025-01-06 at 11 58 13 AM](https://github.com/user-attachments/assets/34882c6c-2284-45b4-92f7-5b5b69896903) ## with_structured_output For pydantic output, behavior of `with_structured_output` is unchanged (except for warning disappearing), because we pluck the parsed representation straight from OpenAI, and OpenAI doesn't return it until the stream is completed. Open to alternatives (e.g., parsing from content or intermediate dict chunks generated by OpenAI). Before: ![Screenshot 2025-01-06 at 12 38 11 PM](https://github.com/user-attachments/assets/913d320d-f49e-4cbb-a800-b394ae817fd1) After: ![Screenshot 2025-01-06 at 12 38 58 PM](https://github.com/user-attachments/assets/f7a45dd6-d886-48a6-8d76-d0e21ca767c6)	2025-01-09 10:32:30 -05:00
Erick Friis	187131c55c	Revert "integrations[patch]: remove non-required chat param defaults" (#29048 ) Reverts langchain-ai/langchain#26730 discuss best way to release default changes (esp openai temperature)	2025-01-06 14:45:34 -08:00
Bagatur	3d7ae8b5d2	integrations[patch]: remove non-required chat param defaults (#26730 ) anthropic: - max_retries openai: - n - temperature - max_retries fireworks - temperature groq - n - max_retries - temperature mistral - max_retries - timeout - max_concurrent_requests - temperature - top_p - safe_mode --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2025-01-06 22:26:22 +00:00
Bagatur	c3ccd93c12	patch openai json mode test (#28831 )	2024-12-19 21:43:32 +00:00
Bagatur	ce6748dbfe	xfail openai image token count test (#28828 )	2024-12-19 21:23:30 +00:00
Bagatur	1378ddfa5f	openai[patch]: type reasoning_effort (#28825 )	2024-12-19 19:36:49 +00:00
Bagatur	4a531437bb	core[patch], openai[patch]: Handle OpenAI developer msg (#28794 ) - Convert developer openai messages to SystemMessage - store additional_kwargs={"__openai_role__": "developer"} so that the correct role can be reconstructed if needed - update ChatOpenAI to read in openai_role --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-12-18 21:54:07 +00:00
Bagatur	e4d3ccf62f	json mode standard test (#25497 ) Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-12-17 18:47:34 +00:00
Erick Friis	0eb7ab65f1	multiple: fix xfailed signatures (#28597 )	2024-12-06 15:39:47 -08:00
ccurme	b8e861a63b	openai[patch]: add standard tests for embeddings (#28540 )	2024-12-05 17:00:27 +00:00
ccurme	42b18824c2	openai[patch]: use max_completion_tokens in place of max_tokens (#26917 ) `max_tokens` is deprecated: https://platform.openai.com/docs/api-reference/chat/create#chat-create-max_tokens --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-11-26 16:30:19 +00:00
Erick Friis	29f8a79ebe	groq,openai,mistralai: fix unit tests (#28279 )	2024-11-22 04:54:01 +00:00
ccurme	56499cf58b	openai[patch]: unskip test and relax tolerance in embeddings comparison (#28262 ) From what I can tell response using SDK is not deterministic: ```python import numpy as np import openai documents = ["disallowed special token '<\|endoftext\|>'"] model = "text-embedding-ada-002" direct_output_1 = ( openai.OpenAI() .embeddings.create(input=documents, model=model) .data[0] .embedding ) for i in range(10): direct_output_2 = ( openai.OpenAI() .embeddings.create(input=documents, model=model) .data[0] .embedding ) print(f"{i}: {np.isclose(direct_output_1, direct_output_2).all()}") ``` ``` 0: True 1: True 2: True 3: True 4: False 5: True 6: True 7: True 8: True 9: True ``` See related discussion here: https://community.openai.com/t/can-text-embedding-ada-002-be-made-deterministic/318054 Found the same result using `"text-embedding-3-small"`.	2024-11-21 10:23:10 -08:00
Erick Friis	0dbaf05bb7	standard-tests: rename langchain_standard_tests to langchain_tests, release 0.3.2 (#28203 )	2024-11-18 19:10:39 -08:00
Bagatur	33dbfba08b	openai[patch]: default to invoke on o1 stream() (#27983 )	2024-11-08 19:12:59 -08:00
ccurme	66966a6e72	openai[patch]: release 0.2.6 (#27924 ) Some additions in support of [predicted outputs](https://platform.openai.com/docs/guides/latency-optimization#use-predicted-outputs) feature: - Bump openai sdk version - Add integration test - Add example to integration docs The `prediction` kwarg is already plumbed through model invocation.	2024-11-05 23:02:24 +00:00
Bagatur	ede953d617	openai[patch]: fix schema formatting util (#27685 )	2024-10-28 15:46:47 +00:00
Bagatur	655ced84d7	openai[patch]: accept json schema response format directly (#27623 ) fix #25460 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-24 18:19:15 +00:00
Erick Friis	7d65a32ee0	openai: audio modality, remove sockets from unit tests (#27436 )	2024-10-18 08:02:09 -07:00
Bagatur	ce33c4fa40	openai[patch]: default temp=1 for o1 (#27206 )	2024-10-08 15:45:21 -07:00
Bagatur	bd5b335cb4	standard-tests[patch]: fix oai usage metadata test (#27122 )	2024-10-04 20:00:48 +00:00
Bagatur	4935a14314	core,integrations[minor]: Dont error on fields in model_kwargs (#27110 ) Given the current erroring behavior, every time we've moved a kwarg from model_kwargs and made it its own field that was a breaking change. Updating this behavior to support the old instantiations / serializations. Assuming build_extra_kwargs was not something that itself is being used externally and needs to be kept backwards compatible	2024-10-04 11:30:27 -07:00
Erick Friis	e8e5d67a8d	openai: fix None token detail (#27091 ) happens in Azure	2024-10-04 01:25:38 +00:00
Bagatur	c09da53978	openai[patch]: add usage metadata details (#27080 )	2024-10-03 14:01:03 -07:00
ccurme	7091a1a798	openai[patch]: increase token limit in azure integration tests (#26901 ) `test_json_mode` occasionally runs into this	2024-09-26 14:31:33 +00:00
ccurme	2a4c5713cd	openai[patch]: fix azure integration tests (#26791 )	2024-09-23 17:49:15 -04:00
Bagatur	e1e4f88b3e	openai[patch]: enable Azure structured output, parallel_tool_calls=Fa… (#26599 ) …lse, tool_choice=required response_format=json_schema, tool_choice=required, parallel_tool_calls are all supported for gpt-4o on azure.	2024-09-22 22:25:22 -07:00
Anton Dubovik	3e2cb4e8a4	openai: embeddings: supported chunk_size when check_embedding_ctx_length is disabled (#23767 ) Chunking of the input array controlled by `self.chunk_size` is being ignored when `self.check_embedding_ctx_length` is disabled. Effectively, the chunk size is assumed to be equal 1 in such a case. This is suprising. The PR takes into account `self.chunk_size` passed by the user. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-20 16:58:45 -07:00
Bagatur	5ced41bf50	anthropic[patch]: fix tool call and tool res image_url handling (#26587 ) Co-authored-by: ccurme <chester.curme@gmail.com>	2024-09-17 14:30:07 -07:00
Erick Friis	c2a3021bb0	multiple: pydantic 2 compatibility, v0.3 (#26443 ) Signed-off-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Dan O'Donovan <dan.odonovan@gmail.com> Co-authored-by: Tom Daniel Grande <tomdgrande@gmail.com> Co-authored-by: Grande <Tom.Daniel.Grande@statsbygg.no> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Tomaz Bratanic <bratanic.tomaz@gmail.com> Co-authored-by: ZhangShenao <15201440436@163.com> Co-authored-by: Friso H. Kingma <fhkingma@gmail.com> Co-authored-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Nuno Campos <nuno@langchain.dev> Co-authored-by: Morgante Pell <morgantep@google.com>	2024-09-13 14:38:45 -07:00
liuhetian	7fc9e99e21	openai[patch]: get output_type when using with_structured_output (#26307 ) - This allows pydantic to correctly resolve annotations necessary when using openai new param `json_schema` Resolves issue: #26250 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-09-13 11:42:01 -07:00
Harrison Chase	28ad244e77	community, openai: support nested dicts (#26414 ) needed for thinking tokens --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-12 21:47:47 -07:00
Bagatur	dba308447d	fmt	2024-09-04 11:28:04 -07:00
Bagatur	3ec93c2817	standard-tests[patch]: add Ser/Des test	2024-09-04 10:24:06 -07:00
Friso H. Kingma	af11fbfbf6	langchain_openai: Make sure the response from the async client in the astream method of ChatOpenAI is properly awaited in case of "include_response_headers=True" (#26031 ) - Description: This is a one line change. the `self.async_client.with_raw_response.create(payload)` call is not properly awaited within the `_astream` method. In `_agenerate` this is done already, but likely forgotten in the other method. - Issue: Not applicable - Dependencies:** No dependencies required. (If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.) --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-09-04 13:26:48 +00:00
Eugene Yurtsev	bc3b851f08	openai[patch]: Upgrade @root_validators in preparation for pydantic 2 migration (#25491 ) * Upgrade @root_validator in openai pkg * Ran notebooks for all but AzureAI embeddings --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-09-03 14:42:24 -07:00
Bagatur	bc3b02651c	standard-tests[patch]: test init from env vars (#25983 )	2024-09-03 19:05:39 +00:00
ccurme	2e5c379632	openai[patch]: fix get_num_tokens for function calls (#25785 ) Closes https://github.com/langchain-ai/langchain/issues/25784 See additional discussion [here](`0a4ee864e9 (r145147380)`).	2024-08-27 20:18:19 +00:00
Hyman	58e72febeb	openai:compatible with other llm usage meta data (#24500 ) - [ ] PR message: - Description: Compatible with other llm (eg: deepseek-chat, glm-4) usage meta data - Issue: N/A - Dependencies: no new dependencies added - [ ] Add tests and docs: libs/partners/openai/tests/unit_tests/chat_models/test_base.py ```shell cd libs/partners/openai poetry run pytest tests/unit_tests/chat_models/test_base.py::test_openai_astream poetry run pytest tests/unit_tests/chat_models/test_base.py::test_openai_stream poetry run pytest tests/unit_tests/chat_models/test_base.py::test_deepseek_astream poetry run pytest tests/unit_tests/chat_models/test_base.py::test_deepseek_stream poetry run pytest tests/unit_tests/chat_models/test_base.py::test_glm4_astream poetry run pytest tests/unit_tests/chat_models/test_base.py::test_glm4_stream ``` --------- Co-authored-by: hyman <hyman@xiaozancloud.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-23 16:59:14 -07:00
Yusuke Fukasawa	0258cb96fa	core[patch]: add additionalProperties recursively to oai function if strict (#25169 ) Hello. First of all, thank you for maintaining such a great project. ## Description In https://github.com/langchain-ai/langchain/pull/25123, support for structured_output is added. However, `"additionalProperties": false` needs to be included at all levels when a nested object is generated. error from current code: https://gist.github.com/fufufukakaka/e9b475300e6934853d119428e390f204 ``` BadRequestError: Error code: 400 - {'error': {'message': "Invalid schema for response_format 'JokeWithEvaluation': In context=('properties', 'self_evaluation'), 'additionalProperties' is required to be supplied and to be false", 'type': 'invalid_request_error', 'param': 'response_format', 'code': None}} ``` Reference: [Introducing Structured Outputs in the API](https://openai.com/index/introducing-structured-outputs-in-the-api/) ```json { "model": "gpt-4o-2024-08-06", "messages": [ { "role": "system", "content": "You are a helpful math tutor." }, { "role": "user", "content": "solve 8x + 31 = 2" } ], "response_format": { "type": "json_schema", "json_schema": { "name": "math_response", "strict": true, "schema": { "type": "object", "properties": { "steps": { "type": "array", "items": { "type": "object", "properties": { "explanation": { "type": "string" }, "output": { "type": "string" } }, "required": ["explanation", "output"], "additionalProperties": false } }, "final_answer": { "type": "string" } }, "required": ["steps", "final_answer"], "additionalProperties": false } } } } ``` In the current code, `"additionalProperties": false` is only added at the last level. This PR introduces the `_add_additional_properties_key` function, which recursively adds `"additionalProperties": false` to the entire JSON schema for the request. Twitter handle: `@fukkaa1225` Thank you! --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-23 00:08:58 +00:00
ccurme	b83f1eb0d5	core, partners: implement standard tracing params for LLMs (#25410 )	2024-08-16 13:18:09 -04:00
ccurme	01ecd0acba	openai[patch]: fix json mode for Azure (#25488 ) https://github.com/langchain-ai/langchain/issues/25479 https://github.com/langchain-ai/langchain/issues/25485 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-16 09:50:50 -07:00
Bagatur	0b4608f71e	infra: temp skip oai embeddings test (#25148 )	2024-08-07 17:51:39 +00:00
Bagatur	09fbce13c5	openai[patch]: ChatOpenAI.with_structured_output json_schema support (#25123 )	2024-08-07 08:09:07 -07:00

1 2 3

102 Commits