langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-02-21 14:43:07 +00:00

Author	SHA1	Message	Date
Harrison Chase	a41e91d650	cr	2023-03-02 14:44:07 -08:00
Harrison Chase	7e2ae5570a	Harrison/new prompt abstraction (#1405 )	2023-03-02 12:04:12 -08:00
Harrison Chase	0abf4d4c7d	Harrison/new prompt abstraction (#1404 )	2023-03-02 11:58:55 -08:00
Harrison Chase	6db04cfe65	Merge branch 'master' into harrison/memory-chat	2023-03-02 11:55:40 -08:00
Eugene Yurtsev	a83a371069	Minor documentation update in initialize_agent (#1397 ) Updating documentation in initialize_agent. One thing that could benefit from further clarification is the responsibility breakdown by between an AgentExecutor vs. an Agent. The documentation for an AgentExecutor does not clarify that. From the class attributes, it appears that executor has access to the tools, while the agent is only aware of the tool names. Anyway, additional clarification would be beneficial on the AgentExecutor class.	2023-03-02 11:46:35 -08:00
Harrison Chase	34214f5fa2	Harrison/new prompt abstraction (#1399 )	2023-03-02 11:41:18 -08:00
Nuno Campos	499e76b199	Allow the regular openai class to be used for ChatGPT models (#1393 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> v0.0.100	2023-03-02 09:04:18 -08:00
Kacper Łukawski	8947797250	Return Cohere embeddings as lists of floats (#1394 ) This PR fixes the types returned by Cohere embeddings. Currently, Cohere client returns instances of `cohere.embeddings.Embeddings`. Since the transport layer relies on JSON, some numbers might be represented as ints, not floats, which happens quite often. While that doesn't seem to be an issue, it breaks some pydantic models if they require strict floats.	2023-03-02 09:02:10 -08:00
Jason Gill	1989e7d4c2	Update examples to prevent confusing missing _type warning (#1391 ) The YAML and JSON examples of prompt serialization now give a strange `No '_type' key found, defaulting to 'prompt'` message when you try to run them yourself or copy the format of the files. The reason for this harmless warning is that the _type key was not in the config files, which means they are parsed as a standard prompt. This could be confusing to new users (like it was confusing to me after upgrading from 0.0.85 to 0.0.86+ for my few_shot prompts that needed a _type added to the example_prompt config), so this update includes the _type key just for clarity. Obviously this is not critical as the warning is harmless, but it could be confusing to track down or be interpreted as an error by a new user, so this update should resolve that.	2023-03-02 07:39:57 -08:00
Harrison Chase	dda5259f68	bump version to 0.0.99 (#1390 ) v0.0.99	2023-03-02 07:25:59 -08:00
Kacper Łukawski	f032609f8d	Add `recursive` parameter to `DirectoryLoader` (#1389 ) This PR allows loading a directory recursively.	2023-03-02 07:06:26 -08:00
Kacper Łukawski	9ac442624c	Add Qdrant named arguments (#1386 ) This PR: - Increases `qdrant-client` version to 1.0.4 - Introduces custom content and metadata keys (as requested in #1087) - Moves all the `QdrantClient` parameters into the method parameters to simplify code completion	2023-03-02 07:05:14 -08:00
Francisco Ingham	34abcd31b9	remove limit clause from prompt for compatibility with ms sql server (#1385 ) For reference see: `8a35811556` Co-authored-by: Francisco Ingham <>	2023-03-02 07:02:42 -08:00
Harrison Chase	098a0ff568	cr	2023-03-01 23:08:09 -08:00
Harrison Chase	7d0502e964	cr	2023-03-01 22:47:15 -08:00
Harrison Chase	ae65e8c5f4	cr	2023-03-01 22:02:08 -08:00
Ankush Gola	fe30be6fba	add async and streaming support to `OpenAIChat` (#1378 ) title says it all	2023-03-01 21:55:43 -08:00
Harrison Chase	d6584fde16	cr	2023-03-01 21:28:30 -08:00
Lakshya Agarwal	cfed0497ac	Minor grammatical fixes (#1325 ) Fixed typos and links in a few places across documents	2023-03-01 21:18:09 -08:00
Ryan Dao	59157b6891	Bug: Fix Python version validation in PythonAstREPLTool (#1373 ) The current logic checks if the Python major version is < 8, which is wrong. This checks if the major and minor version is < 3.9.	2023-03-01 21:15:27 -08:00
Harrison Chase	e178008b75	Harrison/track token usage (#1382 ) Co-authored-by: Zak King <zaking17@gmail.com>	2023-03-01 21:15:13 -08:00
Harrison Chase	1cd8996074	Harrison/summarizer chain (#1356 ) Co-authored-by: Tim Asp <707699+timothyasp@users.noreply.github.com>	2023-03-01 20:59:07 -08:00
yakigac	cfae03042d	Fix the openaichat example (#1377 ) The example was wrong.	2023-03-01 18:24:32 -08:00
Harrison Chase	6cfd0ca73a	cr	2023-03-01 17:53:38 -08:00
Harrison Chase	522452adae	cr	2023-03-01 17:46:43 -08:00
Harrison Chase	007278a358	cr	2023-03-01 17:44:25 -08:00
Harrison Chase	79964e6409	cr	2023-03-01 17:38:48 -08:00
Harrison Chase	c3046309fb	cr	2023-03-01 17:06:23 -08:00
Harrison Chase	95cfd002a7	cr	2023-03-01 17:06:06 -08:00
Harrison Chase	acaa2d3ee4	cr	2023-03-01 16:44:45 -08:00
Harrison Chase	f635a31992	memory chat	2023-03-01 15:27:20 -08:00
Harrison Chase	156bdb6590	memory stuff	2023-03-01 14:11:07 -08:00
Harrison Chase	04220be616	stash	2023-03-01 14:04:28 -08:00
Harrison Chase	12aacdbfb4	stash	2023-03-01 13:24:36 -08:00
Harrison Chase	4b5e850361	chatgpt wrapper (#1367 ) v0.0.98	2023-03-01 11:47:01 -08:00
Harrison Chase	4d4b43cf5a	fix doc names (#1354 )	2023-03-01 09:40:31 -08:00
Harrison Chase	c01f9100e4	bump version to 0097 (#1365 ) v0.0.97	2023-03-01 08:20:24 -08:00
Christie Jacob	edb3915ee7	typo in vectorstores (#1362 )	2023-03-01 07:21:37 -08:00
Harrison Chase	fe7dbecfe6	pandas and csv agents (#1353 )	2023-02-28 22:19:11 -08:00
Harrison Chase	02ec72df87	improve docs (#1351 )	2023-02-28 21:37:18 -08:00
Jon Luo	92ab27e4b8	sql doc formatting (#1350 ) My bad, missed a few tabs between the two PRs	2023-02-28 19:54:46 -08:00
Ankush Gola	82baecc892	Add a SQL agent for interacting with SQL Databases and JSON Agent for interacting with large JSON blobs (#1150 ) This PR adds * `ZeroShotAgent.as_sql_agent`, which returns an agent for interacting with a sql database. This builds off of `SQLDatabaseChain`. The main advantages are 1) answering general questions about the db, 2) access to a tool for double checking queries, and 3) recovering from errors * `ZeroShotAgent.as_json_agent` which returns an agent for interacting with json blobs. * Several examples in notebooks --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-02-28 19:44:39 -08:00
Jon Luo	35f1e8f569	separate columns by tabs instead of single space in sql sample rows (#1348 ) Use tabs to separate columns instead of a single space - confusing when there are spaces in a cell	2023-02-28 18:59:53 -08:00
kurehajime	6c629b54e6	Fixed arguments passed to InvalidTool.run(). (#1340 ) [InvalidTool.run()](`72ef69d1ba/langchain/agents/tools.py (L43)`) returns "{arg}is not a valid tool, try another one.". However, no function name is actually given in the argument. This causes LLM to be stuck in a loop, unable to find the right tool. This may resolve these Issues. https://github.com/hwchase17/langchain/issues/998 https://github.com/hwchase17/langchain/issues/702	2023-02-28 18:58:23 -08:00
James Brotchie	3574418a40	Fix link in summarization.md (#1344 ) "Utilities for working with Documents" was linking to a non-useful page. Re-linked to the utils page that includes info about working with docs.	2023-02-28 18:58:12 -08:00
Jon Luo	5bf8772f26	add option to use user-defined SQL table info (#1347 ) Currently, table information is gathered through SQLAlchemy as complete table DDL and a user-selected number of sample rows from each table. This PR adds the option to use user-defined table information instead of automatically collecting it. This will use the provided table information and fall back to the automatic gathering for tables that the user didn't provide information for. Off the top of my head, there are a few cases where this can be quite useful: - The first n rows of a table are uninformative, or very similar to one another. In this case, hand-crafting example rows for a table such that they provide the good, diverse information can be very helpful. Another approach we can think about later is getting a random sample of n rows instead of the first n rows, but there are some performance considerations that need to be taken there. Even so, hand-crafting the sample rows is useful and can guarantee the model sees informative data. - The user doesn't want every column to be available to the model. This is not an elegant way to fulfill this specific need since the user would have to provide the table definition instead of a simple list of columns to include or ignore, but it does work for this purpose. - For the developers, this makes it a lot easier to compare/benchmark the performance of different prompting structures for providing table information in the prompt. These are cases I've run into myself (particularly cases 1 and 3) and I've found these changes useful. Personally, I keep custom table info for a few tables in a yaml file for versioning and easy loading. Definitely open to other opinions/approaches though!	2023-02-28 18:58:04 -08:00
Harrison Chase	924bba5ce9	bump version (#1342 ) v0.0.96	2023-02-28 08:48:32 -08:00
Harrison Chase	786852e9e6	partial variables (#1308 )	2023-02-28 08:40:35 -08:00
Tim Asp	72ef69d1ba	Add new iFixit document loader (#1333 ) iFixit is a wikipedia-like site that has a huge amount of open content on how to fix things, questions/answers for common troubleshooting and "things" related content that is more technical in nature. All content is licensed under CC-BY-SA-NC 3.0 Adding docs from iFixit as context for user questions like "I dropped my phone in water, what do I do?" or "My macbook pro is making a whining noise, what's wrong with it?" can yield significantly better responses than context free response from LLMs.	2023-02-27 20:40:20 -08:00
Matt Robinson	1aa41b5741	feat: document loader for image files (#1330 ) ### Summary Adds a document loader for image files such as `.jpg` and `.png` files. ### Testing Run the following using the example document from the [`unstructured` repo](https://github.com/Unstructured-IO/unstructured/tree/main/example-docs). ```python from langchain.document_loaders.image import UnstructuredImageLoader loader = UnstructuredImageLoader("layout-parser-paper-fast.jpg") loader.load() ```	2023-02-27 14:43:32 -08:00

1 2 3 4 5 ...

743 Commits