langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-07-12 02:55:21 +00:00

Author	SHA1	Message	Date
vowelparrot	7b96cf7a66	Add marathon times example	2023-04-19 07:24:01 -07:00
vowelparrot	42f28fb45a	Add Marathon Times Example	2023-04-19 07:23:30 -07:00
vowelparrot	b6943789ba	Merge branch 'master' into agents-4-18	2023-04-18 20:49:18 -07:00
vowelparrot	8aa784bfbd	stash	2023-04-18 20:40:16 -07:00
Zander Chase	90ef705ced	Update Tool Input (#3103 ) - Remove dynamic model creation in the `args()` property. _Only infer for the decorator (and add an argument to NOT infer if someone wishes to only pass as a string)_ - Update the validation example to make it less likely to be misinterpreted as a "safe" way to run a repl There is one example of "Multi-argument tools" in the custom_tools.ipynb from yesterday, but we could add more. The output parsing for the base MRKL agent hasn't been adapted to handle structured args at this point in time --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-04-18 18:18:33 -07:00
vowelparrot	a5c996189c	lint	2023-04-18 18:16:42 -07:00
vowelparrot	ee8ddc7d52	Updated Notebook	2023-04-18 18:11:41 -07:00
Harrison Chase	7331004e06	cr	2023-04-18 17:02:17 -07:00
Francesco	19116010ee	Add exeption for when version metadata cannot be found for package (#3107 ) Solves #3097 Already ran tests and lint.	2023-04-18 16:44:40 -07:00
Rahul Behal	d57c64abcf	Added task orchestration chains (#2878 ) - This PR essentially implements the chains outlined in the [LangChain Wiki](https://python.langchain.com/en/latest/use_cases/agents/baby_agi.html), some slight modifications and re-factoring was done to get it to work with the standard `VectorStore` class and to have all the relevant functions in the same class as BabyAGI so that the chain could run effectively independently - I think it is worth having these chains as a standalone on the LangChain repo, as orchestration is becoming huge, and there are a lot of other chains being built off of these as a base - `make lint` and `make format` have been run locally and are succeeding	2023-04-18 16:24:17 -07:00
vowelparrot	77058211b0	add browser	2023-04-18 16:22:13 -07:00
vowelparrot	17f029cdf9	add hil	2023-04-18 15:31:31 -07:00
vowelparrot	8a2578c5be	AutoGPT Draft	2023-04-18 15:01:35 -07:00
Carmen Sam	d54c88aa21	Add allowed and disallowed special arguments to BaseOpenAI (#3012 ) ## Background This PR fixes this error when there are special tokens when querying the chain: ``` Encountered text corresponding to disallowed special token '<\|endofprompt\|>'. If you want this text to be encoded as a special token, pass it to `allowed_special`, e.g. `allowed_special={'<\|endofprompt\|>', ...}`. If you want this text to be encoded as normal text, disable the check for this token by passing `disallowed_special=(enc.special_tokens_set - {'<\|endofprompt\|>'})`. To disable this check for all special tokens, pass `disallowed_special=()`. ``` Refer to the code snippet below, it breaks in the chain line. ``` chain = ConversationalRetrievalChain.from_llm( ChatOpenAI(openai_api_key=OPENAI_API_KEY), retriever=vectorstore.as_retriever(), qa_prompt=prompt, condense_question_prompt=condense_prompt, ) answer = chain({"question": f"{question}"}) ``` However `ChatOpenAI` class is not accepting `allowed_special` and `disallowed_special` at the moment so they cannot be passed to the `encode()` in `get_num_tokens` method to avoid the errors. ## Change - Add `allowed_special` and `disallowed_special` attributes to `BaseOpenAI` class. - Pass in `allowed_special` and `disallowed_special` as arguments of `encode()` in tiktoken. --------- Co-authored-by: samcarmen <“carmen.samkahman@gmail.com”>	2023-04-18 09:34:08 -07:00
Harrison Chase	9d23cfc7dd	bump version to 143 (#3095 ) v0.0.143	2023-04-18 09:12:57 -07:00
Harrison Chase	aad0a498ac	Harrison/output error (#3094 ) Co-authored-by: yummydum <sumita@nowcast.co.jp>	2023-04-18 08:59:56 -07:00
Harrison Chase	1c1b77bbfe	Harrison/discord (#3092 ) Co-authored-by: Rajtilak Bhattacharjee <rajtilak.blog@gmail.com>	2023-04-18 08:19:23 -07:00
Boris Feld	14e4d30659	Comet ml updates 17 04 2023 (#3074 ) I made a couple of improvements to the Comet tracker: * The Comet project name is configurable in various ways (code, environment variable or file), having a default value in code meant that users couldn't set the project name in an environment variable or in a file. * I added error catching when the `flush_tracker` is called in order to avoid crashing the whole process. Instead we are gonna display a warning or error log message (`extra={"show_traceback": True}` is an internal convention to force the display of the traceback when using our own logger). I decided to add the error catching after seeing the following error in the third example of the notebook: ``` COMET ERROR: Failed to export agent or LLM to Comet Traceback (most recent call last): File "/home/lothiraldan/project/cometml/langchain/langchain/callbacks/comet_ml_callback.py", line 484, in _log_model langchain_asset.save(langchain_asset_path) File "/home/lothiraldan/project/cometml/langchain/langchain/agents/agent.py", line 591, in save raise ValueError( ValueError: Saving not supported for agent executors. If you are trying to save the agent, please use the `.save_agent(...)` During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/home/lothiraldan/project/cometml/langchain/langchain/callbacks/comet_ml_callback.py", line 449, in flush_tracker self._log_model(langchain_asset) File "/home/lothiraldan/project/cometml/langchain/langchain/callbacks/comet_ml_callback.py", line 488, in _log_model langchain_asset.save_agent(langchain_asset_path) File "/home/lothiraldan/project/cometml/langchain/langchain/agents/agent.py", line 599, in save_agent return self.agent.save(file_path) File "/home/lothiraldan/project/cometml/langchain/langchain/agents/agent.py", line 145, in save agent_dict = self.dict() File "/home/lothiraldan/project/cometml/langchain/langchain/agents/agent.py", line 119, in dict _dict = super().dict() File "pydantic/main.py", line 449, in pydantic.main.BaseModel.dict File "pydantic/main.py", line 868, in _iter File "pydantic/main.py", line 743, in pydantic.main.BaseModel._get_value File "/home/lothiraldan/project/cometml/langchain/langchain/schema.py", line 381, in dict output_parser_dict["_type"] = self._type File "/home/lothiraldan/project/cometml/langchain/langchain/schema.py", line 376, in _type raise NotImplementedError NotImplementedError ``` I still need to investigate and try to fix it, it looks related to saving an agent to a file.	2023-04-18 07:32:29 -07:00
engkheng	fe68051d34	Fix typo in `docs/reference.rst` (#3081 ) fix typo	2023-04-18 07:31:00 -07:00
Azam Iftikhar	188e9b9beb	Allowing HuggingFaceEmbeddings from the cached weight (#3084 ) ### https://github.com/hwchase17/langchain/issues/3079 Allow initializing HuggingFaceEmbeddings from the cached weight	2023-04-18 07:30:35 -07:00
Roma	55f6f80a59	fix typo (#3085 )	2023-04-18 07:29:33 -07:00
TysBradford	7dae39b57d	slightly clearer docs (#3088 ) Took me a second to realise the examples required to manually print the output of the conversation predict. This might make it clearer for others	2023-04-18 07:28:29 -07:00
James O'Dwyer	0257829776	Bump Metal to use index_id (#3089 ) ## Use `index_id` over `app_id` We made a major update to index + retrieve based on Metal Indexes (instead of apps). With this change, we accept an index instead of an app in each of our respective core apis. [More details here](https://docs.getmetal.io/api-reference/core/indexing).	2023-04-18 07:28:13 -07:00
Hamza Kyamanywa	064a1db2b2	[Documentation] Show how to initiate pinecone from an existing index (#3070 ) ## What is this PR for: * This PR adds a commented line of code in the documentation that shows how someone can use the Pinecone client with an already existing Pinecone index * The documentation currently only shows how to create a pinecone index from langchain documents but not how to load one that already exists	2023-04-18 07:27:46 -07:00
Harrison Chase	894c272a56	tool validation logic	2023-04-17 21:59:32 -07:00
Harrison Chase	1920536d99	Harrison/obsidian (#3060 ) Co-authored-by: Ben Hofferber <hofferber.ben@gmail.com>	2023-04-17 21:57:32 -07:00
Zander Chase	93c0514105	Add Twitter Tweet Loader (#3050 ) Reformatted version of #3022 --------- Co-authored-by: LiaoKong <568250549@qq.com>	2023-04-17 21:44:54 -07:00
__Jay__	2984ad3964	updated llm response parsing action (#3058 ) Sometimes the LLM response (generated code) tends to miss the ending ticks "```". Therefore causing the text parsing to fail due to not enough values to unpack. The 2 extra `_` don't add value and can cause errors. Suggest to simply update the `_, action, _` to just `action` then with index. Fixes issue #3057	2023-04-17 21:42:13 -07:00
Harrison Chase	db968284f8	tools refactor (#2961 ) Co-authored-by: vowelparrot <130414180+vowelparrot@users.noreply.github.com>	2023-04-17 21:35:29 -07:00
Sebastian	7a8c935b90	Edited for better readability (#3059 ) It looks like some dropdown functionality was intended, but it caused the markdown code to glitch which hurt readability.	2023-04-17 21:34:57 -07:00
Matthieu	822cdb161b	Adding shared chromaDB client option (#2886 ) This pull request addresses the need to share a single `chromadb.Client` instance across multiple instances of the `Chroma` class. By implementing a shared client, we can maintain consistency and reduce resource usage when multiple instances of the `Chroma` classes are created. This is especially relevant in a web app, where having multiple `Chroma` instances with a `persist_directory` leads to these clients not being synced. This PR implements this option while keeping the rest of the architecture unchanged. Changes: 1. Add a client attribute to the `Chroma` class to store the shared `chromadb.Client` instance. 2. Modify the `from_documents` method to accept an optional client parameter. 3. Update the `from_documents` method to use the shared client if provided or create a new client if not provided. Let me know if anything needs to be modified - thanks again for your work on this incredible repo	2023-04-17 21:22:39 -07:00
Harrison Chase	b140d366e3	Harrison/jira (#3055 ) Co-authored-by: William Li <32046231+zywilliamli@users.noreply.github.com> Co-authored-by: William Li <twelvehertz@Williams-MacBook-Air.local>	2023-04-17 21:14:40 -07:00
Amir Karimi	ae7ed31386	Fix redundancy check about config_type in AGENT_TO_CLASS (#2934 ) Fix of issue #2874	2023-04-17 21:05:48 -07:00
J Wynia	b40f90ea04	Spelling to correct conservation to conservation (#3049 ) Issue #3048 corrected spelling	2023-04-17 21:03:03 -07:00
leo-gan	c33883a40e	fixed the Cohere example title (#3053 ) - fixed the Cohere example title (bug in #3041, sorry for it) - fixed the runhouse.ipynb file name inconsistency	2023-04-17 21:02:52 -07:00
Harrison Chase	5107fac656	Harrison/rec gd (#3054 ) Co-authored-by: Benjamin Scholtz <BenSchZA@users.noreply.github.com>	2023-04-17 21:02:35 -07:00
Harrison Chase	eee2f23a79	Harrison/qa eg (#3052 ) Co-authored-by: Sukhpal Saini <bdcorps@users.noreply.github.com>	2023-04-17 20:56:42 -07:00
Harrison Chase	db7106cb79	Harrison/image caption loader (#3051 ) Co-authored-by: Sean Saito <saitosean@ymail.com>	2023-04-17 20:49:10 -07:00
Benjamin Scholtz	36138f28c8	Add GoogleSQL prompt (#2992 ) This PR extends upon @jzluo 's PR #2748 which addressed dialect-specific issues with SQL prompts, and adds a prompt that uses backticks for column names when querying BigQuery. See [GoogleSQL quoted identifiers](https://cloud.google.com/bigquery/docs/reference/standard-sql/lexical#quoted_identifiers). Additionally, the SQL agent currently uses a generic prompt. Not sure how best to adopt the same optional dialect-specific prompts as above, but will consider making an issue and PR for that too. See [langchain/agents/agent_toolkits/sql/prompt.py](langchain/agents/agent_toolkits/sql/prompt.py).	2023-04-17 20:44:54 -07:00
Naveen Tatikonda	bb619cd535	Pass kwargs to get OpenSearch client from_texts (#2993 ) ### Description Pass kwargs to get OpenSearch client from `from_texts` function ### Issues Resolved https://github.com/hwchase17/langchain/issues/2819 Signed-off-by: Naveen Tatikonda <navtat@amazon.com>	2023-04-17 20:44:30 -07:00
Harutaka Kawamura	ba9cc230fa	Stringify `AgentType` before saving to yaml (#2998 ) Code to reproduce the issue (with `langchain==0.0.141`): ```python from langchain.agents import initialize_agent, load_tools from langchain.llms import OpenAI llm = OpenAI(temperature=0.9, verbose=True) tools = load_tools(["llm-math"], llm=llm) agent = initialize_agent(tools, llm, agent="zero-shot-react-description", verbose=True) agent.save_agent("agent.yaml") with open("agent.yaml") as f: print(f.read()) ``` Output: ``` _type: !!python/object/apply:langchain.agents.agent_types.AgentType - zero-shot-react-description allowed_tools: - Calculator ... ``` I expected `_type` to be `zero-shot-react-description` but it's actually not. This PR fixes it by stringifying `AgentType` (`Enum`). Signed-off-by: harupy <hkawamura0130@gmail.com>	2023-04-17 20:43:39 -07:00
Nuno Campos	e25528c4f0	Fix incorrect value of outputKeys on AnalyzeDocumentsChain (#3010 )	2023-04-17 20:32:46 -07:00
engkheng	19febc77d6	Support inference of `input_variables` from `jinja2` template (#3013 ) `langchain.prompts.PromptTemplate` is unable to infer `input_variables` from jinja2 template. ```python # Using langchain v0.0.141 template_string = """\ Hello world Your variable: {{ var }} {# This will not get rendered #} {% if verbose %} Congrats! You just turned on verbose mode and got extra messages! {% endif %} """ template = PromptTemplate.from_template(template_string, template_format="jinja2") print(template.input_variables) # Output ['# This will not get rendered #', '% endif %', '% if verbose %'] ``` --------- Co-authored-by: engkheng <ongengkheng929@example.com>	2023-04-17 20:31:03 -07:00
Nuno Campos	dac32c59e5	Nc/combining output parser (#3014 ) Co-authored-by: vowelparrot <130414180+vowelparrot@users.noreply.github.com>	2023-04-17 20:29:53 -07:00
Nuno Campos	79bb5c4f95	Port format instructions fix from js (#3015 )	2023-04-17 20:29:17 -07:00
Harrison Chase	e3cf00b88b	redis from url (#3024 )	2023-04-17 20:28:12 -07:00
Davis Chase	19c85aa990	Factor out doc formatting and add validation (#3026 ) @cnhhoang850 slightly more generic fix for #2944, works for whatever the expected metadata keys are not just `source`	2023-04-17 20:28:01 -07:00
Naveen Tatikonda	3453b7457c	OpenSearch: Add Support for Boolean Filter with ANN search (#3038 ) ### Description Add Support for Boolean Filter with ANN search Documentation - https://opensearch.org/docs/latest/search-plugins/knn/filter-search-knn/#boolean-filter-with-ann-search ### Issues Resolved https://github.com/hwchase17/langchain/issues/2924 Signed-off-by: Naveen Tatikonda <navtat@amazon.com>	2023-04-17 20:26:26 -07:00
leo-gan	5420a0e404	updated langchain/docs/modules/models/llms/integrations/ notebooks (#3041 ) - Updated `langchain/docs/modules/models/llms/integrations/` notebooks: added links to the original sites, the install information, etc. - Added the `nlpcloud` notebook. - Removed "Example" from Titles of some notebooks, so all notebook titles are consistent.	2023-04-17 20:25:32 -07:00
Azam Iftikhar	471ef84835	Examples fixed (#3042 ) ### https://github.com/hwchase17/langchain/issues/2997 Replaced `conversation.memory.store` to `conversation.memory.entity_store.store` As conversation.memory.store doesn't exist and re-ran the whole file.	2023-04-17 20:25:01 -07:00

1 2 3 4 5 ...

1399 Commits