langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-03-18 11:07:36 +00:00

Author	SHA1	Message	Date
vowelparrot	116201d4f7	Update decay rate	2023-04-16 15:47:43 -07:00
vowelparrot	c28524d817	Re-run to test	2023-04-16 13:31:45 -07:00
Harrison Chase	62254e1438	cr	2023-04-15 21:07:36 -07:00
Harrison Chase	5f5670eb00	cr	2023-04-15 21:06:39 -07:00
vowelparrot	4ffc58e07b	Add similarity_search_with_normalized_similarities (#2916 ) Add a method that exposes a similarity search with corresponding normalized similarity scores. Implement only for FAISS now. ### Motivation: Some memory definitions combine `relevance` with other scores, like recency , importance, etc. While many (but not all) of the `VectorStore`'s expose a `similarity_search_with_score` method, they don't all interpret the units of that score (depends on the distance metric and whether or not the the embeddings are normalized). This PR proposes a `similarity_search_with_normalized_similarities` method that lets consumers of the vector store not have to worry about the metric and embedding scale. Most providers default to euclidean distance, with Pinecone being one exception (defaults to cosine _similarity_). --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-04-15 21:06:08 -07:00
Harrison Chase	478b5038cc	Merge branch 'vwp/characters' of github.com:hwchase17/langchain into vwp/characters	2023-04-15 20:57:27 -07:00
vowelparrot	4681f1f59f	cr	2023-04-15 18:56:38 -07:00
vowelparrot	06a66a0e84	Merge branch 'vwp/similarity_search_with_distances' into vwp/characters	2023-04-15 18:50:12 -07:00
vowelparrot	f0ce4c8450	cr	2023-04-15 18:49:51 -07:00
Harrison Chase	1d09d15fdd	stash	2023-04-15 18:25:17 -07:00
vowelparrot	585b6654e2	Add Default	2023-04-15 17:15:28 -07:00
Tim Asp	b9db20481f	Fix wrong token counts from `get_num_tokens` from openai llms (#2952 ) The encoding fetch was out of date. Luckily OpenAI has a nice[ `encoding_for_model`](`46287bfa49/tiktoken/model.py`) function in `tiktoken` we can use now.	2023-04-15 16:09:17 -07:00
Tim Asp	fea5619ce9	Add title, lang, description to Web loader document metadata (#2955 ) Title, lang and description are on almost every web page, and are incredibly useful pieces of information that currently isn't captured with the current web base loader I thought about adding the title and description to the content of the document, as that content could be useful in search, but I left it out for right now. If you think it'd be worth adding, happy to add it. I've found it's nice to have the title/description in the metadata to have some structured data when retrieving rows from vectordbs for use with summary and source citation, so if we do want to add it to the `page_content`, i'd advocate for it to also be included in metadata.	2023-04-15 16:07:08 -07:00
Maciej Pióro	f7bf917baf	Fix missing docker-compose (#2899 ) Fix missing `docker-compose` command if only `docker compose` (note space) is available.	2023-04-15 16:05:11 -07:00
Harrison Chase	b634489b2e	bump version to 141 (#2950 ) v0.0.141	2023-04-15 12:56:39 -07:00
Harrison Chase	274b25c010	SVM retriever (#2947 ) (#2949 ) Add SVM retriever class, based on https://github.com/karpathy/randomfun/blob/master/knn_vs_svm.ipynb. Testing still WIP, but the logic is correct (I have a local implementation outside of Langchain working). --------- Co-authored-by: Lance Martin <122662504+PineappleExpress808@users.noreply.github.com> Co-authored-by: rlm <31treehaus@31s-MacBook-Pro.local>	2023-04-15 12:49:59 -07:00
Harrison Chase	baf350e32b	parametrize redis (#2946 )	2023-04-15 12:47:36 -07:00
dev2049	36aa7f30e4	Move PythonRepl -> langchain.utilities (#2917 )	2023-04-15 10:50:25 -07:00
dev2049	7c73e9df5d	Add kwargs to VectorStore.maximum_marginal_relevance (#2921 ) Same as similarity_search, allows child classes to add vector store-specific args (this was technically already happening in couple places but now typing is correct).	2023-04-15 10:49:49 -07:00
Davit Buniatyan	b3a5b51728	[minor] Deep Lake auth improvements in docs, kwargs pass, faster tests (#2927 ) Minor cosmetic changes - Activeloop environment cred authentication in notebooks with `getpass.getpass` (instead of CLI which not always works) - much faster tests with Deep Lake pytest mode on - Deep Lake kwargs pass Notes - I put pytest environment creds inside `vectorstores/conftest.py`, but feel free to suggest a better location. For context, if I put in `test_deeplake.py`, `ruff` doesn't let me to set them before import deeplake --------- Co-authored-by: Davit Buniatyan <d@activeloop.ai>	2023-04-15 10:49:16 -07:00
Harrison Chase	c4ae8c1d24	bump ver to 140 (#2895 ) v0.0.140	2023-04-15 09:23:19 -07:00
Nahin Khan	ad3973a3b8	Fix typo (#2942 )	2023-04-15 08:53:25 -07:00
Harrison Chase	cf2789d86d	delete antropic chat notebook (#2945 )	2023-04-15 08:48:51 -07:00
Hai Nguyen Mau	0aa828b1dc	typo fix (#2937 ) missing w in link	2023-04-15 08:31:43 -07:00
vowelparrot	e9cc8db66a	Merge branch 'vwp/similarity_search_with_distances' into vwp/characters	2023-04-15 08:14:22 -07:00
vowelparrot	d31d92acdd	Update __from	2023-04-15 08:13:49 -07:00
vowelparrot	a742c2ca3d	Update the twr implementation	2023-04-15 08:10:21 -07:00
Ankush Gola	ec59e9d886	Fix ChatAnthropic stop_sequences error (#2919 ) (#2920 ) Note to self: Always run integration tests, even on "that last minute change you thought would be safe" :) --------- Co-authored-by: Mike Lambert <mike.lambert@anthropic.com>	2023-04-14 17:22:01 -07:00
vowelparrot	265fb5f829	Add tests	2023-04-14 16:42:04 -07:00
Akash NP	13a0ed064b	add encoding to avoid UnicodeDecodeError (#2908 ) About Specify encoding to avoid UnicodeDecodeError when reading .txt for users who are following the tutorial. Reference ``` return codecs.charmap_decode(input,self.errors,decoding_table)[0] UnicodeDecodeError: 'charmap' codec can't decode byte 0x9d in position 1205: character maps to <undefined> ``` Environment OS: Win 11 Python: 3.8	2023-04-14 16:36:03 -07:00
vowelparrot	285bd56d11	Update nits	2023-04-14 15:26:08 -07:00
vowelparrot	42ea0a0b33	Rm method	2023-04-14 15:25:14 -07:00
vowelparrot	8bd2434862	Update	2023-04-14 15:24:21 -07:00
Mike Lambert	392f1b3218	Add Anthropic ChatModel to langchain (#2293 ) * Adds an Anthropic ChatModel * Factors out common code in our LLMModel and ChatModel * Supports streaming llm-tokens to the callbacks on a delta basis (until a future V2 API does that for us) * Some fixes	2023-04-14 15:09:07 -07:00
Kwuang Tang	66bef1d7ed	Ignore files from .gitignore in Git loader (#2909 ) fixes #2905 extends #2851	2023-04-14 15:02:21 -07:00
vowelparrot	fb677984f1	Merge branch 'vwp/similarity_search_with_distances' into vwp/characters	2023-04-14 14:48:46 -07:00
vowelparrot	8b74c6e54a	Add similarity_search_with_normalized_similarities Add a method that exposes a sim ilarity search with corresponding normalized similarity scores.	2023-04-14 14:47:28 -07:00
Boris Feld	7ee87eb0c8	Comet callback updates (#2889 ) I'm working with @DN6 and I made some small fixes and improvements after playing with the integration.	2023-04-14 13:19:58 -07:00
dev2049	634358db5e	Fix OpenAI LLM docstring (#2910 )	2023-04-14 11:09:36 -07:00
pranjaldoshi96	30573b2e30	Correct instruction to use openweathermap utility in docstring (#2906 ) Co-authored-by: Pranjal Doshi <pranjald@nvidia.com>	2023-04-14 10:46:20 -07:00
Kwuang Tang	a508afa91c	Add file filter param to Git loader (#2904 ) Allows users to specify what files should be loaded instead of indiscriminately loading the entire repo. extends #2851 NOTE: for reviewers, `hide whitespace` option recommended since I changed the indentation of an if-block to use `continue` instead so it looks less like a Christmas tree :)	2023-04-14 10:45:54 -07:00
Ismail Pelaseyed	7e525a3b91	Add link to repo for deploying LangChain to Digitalocean App Platform (#2894 ) This PR adds a link to a minimal example of deploying `LangChain` to `Digitalocean App Platform`.	2023-04-14 08:55:21 -07:00
Peter Stolz	ccacf804a8	Fix format string in pinecone error handling (#2897 )	2023-04-14 08:53:02 -07:00
Francis Felici	86189cdcf9	Update load_qa_chain() docstring (#2900 ) Seems to be missing `map_rerank` as a potential argument of `chain_type`	2023-04-14 08:51:30 -07:00
Harrison Chase	8fef69296d	nits (#2873 )	2023-04-14 07:55:12 -07:00
Harrison Chase	0a38bbc750	updates to vectorstore memory (#2875 )	2023-04-14 07:54:57 -07:00
Ikko Eltociear Ashimine	203c0eb2ae	docs: update getting_started.ipynb (#2883 ) HuggingFace -> Hugging Face	2023-04-14 07:40:26 -07:00
ecneladis	1a44b71ddf	Fix Baby AGI notebooks (#2882 ) - fix broken notebook cell in `ae485b623d` - Python Black formatting	2023-04-14 07:40:04 -07:00
Harrison Chase	2c4138f80a	cr	2023-04-13 23:38:14 -07:00
Harrison Chase	ee377f4029	stash	2023-04-13 23:25:54 -07:00

1 2 3 4 5 ...

1350 Commits