langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-03-18 11:07:36 +00:00

Author	SHA1	Message	Date
Harrison Chase	e12e00df12	use output parsers in agents (#2987 )	2023-04-16 13:15:21 -07:00
Mauricio Scheffer	7302787a7b	Fix docs for parse_with_prompt (#2986 )	2023-04-16 12:57:04 -07:00
Azam Iftikhar	1e655d5ffd	Fixed Regular expression (#2933 ) ### https://github.com/hwchase17/langchain/issues/2898 Instead of `"Action" and "Action Input"` keywords, we are getting `"Action 1" and "Action 1 Input" or "Action Input 1" ` from gpt-3.5-turbo Updated the Regular expression to handle all these cases Attaching the screenshot of the result from the updated Regular expression. <img width="1036" alt="Screenshot 2023-04-16 at 1 39 00 AM" src="https://user-images.githubusercontent.com/55012400/232251184-23ca6cc2-7229-411a-b6e1-53b2f5ec18a5.png">	2023-04-16 09:16:50 -07:00
Harrison Chase	88d3ce12b8	Harrison/diffbot (#2984 ) Co-authored-by: Manuel Saelices <msaelices@gmail.com>	2023-04-16 09:11:24 -07:00
vowelparrot	5ca7ce77cd	Remove pythonrepl from LLM-MathChain (#2943 ) Use numexpr evaluate instead of the python REPL to avoid malicious code injection. Tested against the (limited) math dataset and got the same score as before. For more permissive tools (like the REPL tool itself), other approaches ought to be provided (some combination of Sanitizer + Restricted python + unprivileged-docker + ...), but for a calculator tool, only mathematical expressions should be permitted. See https://github.com/hwchase17/langchain/issues/814	2023-04-16 08:50:32 -07:00
Chetanya Rastogi	aead062a70	Add an example tutorial for using PDFMinerPDFasHTMLLoader (#2960 ) Last week I added the `PDFMinerPDFasHTMLLoader`. I am adding some example code in the notebook to serve as a tutorial for how that loader can be used to create snippets of a pdf that are structured within sections. All the other loaders only provide the `Document` objects segmented by pages but that's pretty loose given the amount of other metadata that can be extracted. With the new loader, one can leverage font-size of the text to decide when a new sections starts and can segment the text more semantically as shown in the tutorial notebook. The cell shows that we are able to find the content of entire section under Related Work for the example pdf which is spread across 2 pages and hence is stored as two separate documents by other loaders	2023-04-16 08:34:39 -07:00
Nahin Khan	9a03f00e6c	Fix typos (#2977 )	2023-04-16 08:28:36 -07:00
Harrison Chase	274b25c010	SVM retriever (#2947 ) (#2949 ) Add SVM retriever class, based on https://github.com/karpathy/randomfun/blob/master/knn_vs_svm.ipynb. Testing still WIP, but the logic is correct (I have a local implementation outside of Langchain working). --------- Co-authored-by: Lance Martin <122662504+PineappleExpress808@users.noreply.github.com> Co-authored-by: rlm <31treehaus@31s-MacBook-Pro.local>	2023-04-15 12:49:59 -07:00
Davit Buniatyan	b3a5b51728	[minor] Deep Lake auth improvements in docs, kwargs pass, faster tests (#2927 ) Minor cosmetic changes - Activeloop environment cred authentication in notebooks with `getpass.getpass` (instead of CLI which not always works) - much faster tests with Deep Lake pytest mode on - Deep Lake kwargs pass Notes - I put pytest environment creds inside `vectorstores/conftest.py`, but feel free to suggest a better location. For context, if I put in `test_deeplake.py`, `ruff` doesn't let me to set them before import deeplake --------- Co-authored-by: Davit Buniatyan <d@activeloop.ai>	2023-04-15 10:49:16 -07:00
Harrison Chase	c4ae8c1d24	bump ver to 140 (#2895 )	2023-04-15 09:23:19 -07:00
Nahin Khan	ad3973a3b8	Fix typo (#2942 )	2023-04-15 08:53:25 -07:00
Harrison Chase	cf2789d86d	delete antropic chat notebook (#2945 )	2023-04-15 08:48:51 -07:00
Hai Nguyen Mau	0aa828b1dc	typo fix (#2937 ) missing w in link	2023-04-15 08:31:43 -07:00
Ankush Gola	ec59e9d886	Fix ChatAnthropic stop_sequences error (#2919 ) (#2920 ) Note to self: Always run integration tests, even on "that last minute change you thought would be safe" :) --------- Co-authored-by: Mike Lambert <mike.lambert@anthropic.com>	2023-04-14 17:22:01 -07:00
Akash NP	13a0ed064b	add encoding to avoid UnicodeDecodeError (#2908 ) About Specify encoding to avoid UnicodeDecodeError when reading .txt for users who are following the tutorial. Reference ``` return codecs.charmap_decode(input,self.errors,decoding_table)[0] UnicodeDecodeError: 'charmap' codec can't decode byte 0x9d in position 1205: character maps to <undefined> ``` Environment OS: Win 11 Python: 3.8	2023-04-14 16:36:03 -07:00
Boris Feld	7ee87eb0c8	Comet callback updates (#2889 ) I'm working with @DN6 and I made some small fixes and improvements after playing with the integration.	2023-04-14 13:19:58 -07:00
Kwuang Tang	a508afa91c	Add file filter param to Git loader (#2904 ) Allows users to specify what files should be loaded instead of indiscriminately loading the entire repo. extends #2851 NOTE: for reviewers, `hide whitespace` option recommended since I changed the indentation of an if-block to use `continue` instead so it looks less like a Christmas tree :)	2023-04-14 10:45:54 -07:00
Ismail Pelaseyed	7e525a3b91	Add link to repo for deploying LangChain to Digitalocean App Platform (#2894 ) This PR adds a link to a minimal example of deploying `LangChain` to `Digitalocean App Platform`.	2023-04-14 08:55:21 -07:00
Harrison Chase	8fef69296d	nits (#2873 )	2023-04-14 07:55:12 -07:00
Harrison Chase	0a38bbc750	updates to vectorstore memory (#2875 )	2023-04-14 07:54:57 -07:00
Ikko Eltociear Ashimine	203c0eb2ae	docs: update getting_started.ipynb (#2883 ) HuggingFace -> Hugging Face	2023-04-14 07:40:26 -07:00
ecneladis	1a44b71ddf	Fix Baby AGI notebooks (#2882 ) - fix broken notebook cell in `ae485b623d` - Python Black formatting	2023-04-14 07:40:04 -07:00
Nicolas	3c7204d604	docs: Quick fix to Mendable Search (#2876 ) Fixed a small issue on the icon UI when using in Safari.	2023-04-13 23:15:57 -07:00
Harrison Chase	07d7096de6	Harrison/playwright (#2871 ) Co-authored-by: Manuel Saelices <msaelices@gmail.com>	2023-04-13 22:15:03 -07:00
ecneladis	74abeb8c53	Update output in Git notebook (#2868 ) Supplemental to https://github.com/hwchase17/langchain/pull/2851. Updates one notebook cell that I forgot to commit before.	2023-04-13 21:56:17 -07:00
Nicolas	0226b375d9	docs: Mendable Search integration (#2803 ) Mendable Seach Integration is Finally here! Hey yall, After various requests for Mendable in Python docs, we decided to get our hands dirty and try to implement it. Here is a version where we implement our floating button that sits on the bottom right of the screen that once triggered (via press or CMD K) will work the same as the js langchain docs. Super excited about this and hopefully the community will be too. @hwchase17 will send you the admin details via dm etc. The anon_key is fine to be public. Let me know if you need any further customization. I added the langchain logo to it.	2023-04-13 21:52:25 -07:00
ecneladis	016738e676	Add GitLoader (#2851 )	2023-04-13 21:39:20 -07:00
vowelparrot	bf0887c486	Add Slack Directory Loader (#2841 ) Fixes linting issue from #2835 Adds a loader for Slack Exports which can be a very valuable source of knowledge to use for internal QA bots and other use cases. ```py # Export data from your Slack Workspace first. from langchain.document_loaders import SLackDirectoryLoader SLACK_WORKSPACE_URL = "https://awesome.slack.com" loader = ("Slack_Exports", SLACK_WORKSPACE_URL) docs = loader.load() ```	2023-04-13 21:31:59 -07:00
Benjamin Tan Wei Hao	c26a259ba6	Fix tiny typo (#2863 )	2023-04-13 20:26:26 -07:00
Jon Luo	f3180f05f9	Update sql chain notebook to clarify use of SQLAlchemy for connections (#2850 ) Have seen questions about whether or not the `SQLDatabaseChain` supports more than just sqlite, which was unclear in the docs, so tried to clarify that and how to connect to other dialects.	2023-04-13 11:46:59 -07:00
leo-gan	ecc1a0c051	added code-analysis-deeplake.ipynb (#2844 ) This notebook is heavily copied from the `twitter-the-algorithm-analysis-deeplake.ipynb`	2023-04-13 11:29:59 -07:00
Tim Asp	70ffe470aa	Add easy print method to openai callback (#2848 ) Found myself constantly copying the snippet outputting all the callback tracking details. so adding a simple way to output the full context	2023-04-13 11:28:42 -07:00
vowelparrot	82d1d5f24e	Fix grammar in Vector Memory Docs (#2847 )	2023-04-13 11:00:09 -07:00
Tim Asp	53dc157145	[Docs] minor fixes to loaders links and rst warnings (#2846 ) The doc loaders index was picking up a bunch of subheadings because I mistakenly made the MD titles H1s. Fixed that. also the easy minor warnings from docs_build	2023-04-13 10:54:40 -07:00
Harrison Chase	1609950597	Harrison/retriever memory (#2804 ) Co-authored-by: vowelparrot <130414180+vowelparrot@users.noreply.github.com>	2023-04-13 10:03:43 -07:00
Rounak Datta	7688bf9182	WhatsApp document loader - update regex (#2776 ) I was testing out the WhatsApp Document loader, and noticed that sometimes the date is of the following format (notice the additional underscore): ``` 3/24/23, 1:54_PM - +91 99999 99999 joined using this group's invite link 3/24/23, 6:29_PM - +91 99999 99999: When are we starting then? ``` Wierdly, the underscore is visible in Vim, but not on editors like VSCode. I presume it is some unusual character/line terminator. Nevertheless, I think handling this edge case will make the document loader more robust.	2023-04-13 09:48:32 -07:00
vowelparrot	2db9b7a45d	Revert "Add Slack Directory Loader (#2835 )" (#2839 ) This reverts commit `a6f767ae7a`. To fix the linting error.	2023-04-13 09:42:54 -07:00
Azam Iftikhar	2a89dc8c1c	Fixing factually incorrect example (#2810 ) ### https://github.com/hwchase17/langchain/issues/2802 It appears that Google's Flan model may not perform as well as other models, I used a simple example to get factually correct answer.	2023-04-13 08:42:39 -07:00
vowelparrot	a6f767ae7a	Add Slack Directory Loader (#2835 ) Adds a loader for Slack Exports which can be a very valuable source of knowledge to use for internal QA bots and other use cases. ```py # Export data from your Slack Workspace first. from langchain.document_loaders import SLackDirectoryLoader SLACK_WORKSPACE_URL = "https://awesome.slack.com" loader = ("Slack_Exports", SLACK_WORKSPACE_URL) docs = loader.load() ``` --------- Co-authored-by: Mikhail Dubov <mikhail@chattermill.io>	2023-04-13 08:39:07 -07:00
Preetesh Jain	61858c5a08	Fix headings in docs (ClearML and Comet) (#2808 ) This PR fixes the document structure in the [Ecosystem](https://python.langchain.com/en/latest/ecosystem.html) page. Also adds a fix for the heading on the [Comet](https://python.langchain.com/en/latest/ecosystem/comet_tracking.html) page for more consistency with other ecosystem tools. ## Screenshot <img width="878" alt="image" src="https://user-images.githubusercontent.com/6207830/231674921-9bf25376-cf14-4dba-be3c-08e0abda6154.png"> <img width="869" alt="image" src="https://user-images.githubusercontent.com/6207830/231675105-d8e42df4-2d01-435b-9e09-3371522fd2ce.png">	2023-04-13 08:24:16 -07:00
Harrison Chase	9a96691803	cr	2023-04-13 08:23:33 -07:00
Harrison Chase	1bb0706955	Harrison/comet ml (#2799 ) Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: Boris Feld <lothiraldan@gmail.com>	2023-04-12 21:21:51 -07:00
Harrison Chase	b2bc5ef56a	agent refactor (#2801 )	2023-04-12 21:21:41 -07:00
Harrison Chase	e49f1e628c	Harrison/gpt cache (#2744 ) Co-authored-by: SimFG <bang.fu@zilliz.com>	2023-04-12 14:16:58 -07:00
Harrison Chase	425c437cd3	cr	2023-04-12 13:46:58 -07:00
Harrison Chase	a2d729e537	cr	2023-04-12 13:44:21 -07:00
Harrison Chase	7adbc4fbb4	agent memory (#2792 )	2023-04-12 12:51:15 -07:00
wangml999	fa0c9390c2	Update custom_agent.ipynb (#2767 ) Fixed an issue the agent is not taking the user's question as input.	2023-04-12 09:13:46 -07:00
Nuhman Pk	789cc314c5	Typo (#2747 )	2023-04-12 09:06:30 -07:00
Harrison Chase	b92a89e29f	cr	2023-04-11 23:52:14 -07:00

1 2 3 4 5 ...

629 Commits