langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-04-03 19:04:23 +00:00

Author	SHA1	Message	Date
Mohammad Mohtashim	f3dd4a10cf	DROP BOX Loader Documentation Update (#14047 ) - Description: Update the document for drop box loader + made the messages more verbose when loading pdf file since people were getting confused - Issue: #13952 - Tag maintainer: @baskaryan, @eyurtsev, @hwchase17, --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-11-29 17:25:35 -08:00
Cheng (William) Huang	a00db4b28f	Add multi-input Reddit search tool (#13893 ) - Description: Added a tool called RedditSearchRun and an accompanying API wrapper, which searches Reddit for posts with support for time filtering, post sorting, query string and subreddit filtering. - Issue: #13891 - Dependencies: `praw` module is used to search Reddit - Tag maintainer: @baskaryan , and any of the other maintainers if needed - Twitter handle: None. Hello, This is our first PR and we hope that our changes will be helpful to the community. We have run `make format`, `make lint` and `make test` locally before submitting the PR. To our knowledge, our changes do not introduce any new errors. Our PR integrates the `praw` package which is already used by RedditPostsLoader in LangChain. Nonetheless, we have added integration tests and edited unit tests to test our changes. An example notebook is also provided. These changes were put together by me, @Anika2000, @CharlesXu123, and @Jeremy-Cheng-stack Thank you in advance to the maintainers for their time. --------- Co-authored-by: What-Is-A-Username <49571870+What-Is-A-Username@users.noreply.github.com> Co-authored-by: Anika2000 <anika.sultana@mail.utoronto.ca> Co-authored-by: Jeremy Cheng <81793294+Jeremy-Cheng-stack@users.noreply.github.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-11-29 20:16:40 -05:00
Jawad Arshad	00a6e8962c	langchain[minor]: Add serpapi tools (#13934 ) - Description: Added some of the more endpoints supported by serpapi that are not suported on langchain at the moment, like google trends, google finance, google jobs, and google lens - Issue: [Add support for many of the querying endpoints with serpapi #11811](https://github.com/langchain-ai/langchain/issues/11811) --------- Co-authored-by: zushenglu <58179949+zushenglu@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Ian Xu <ian.xu@mail.utoronto.ca> Co-authored-by: zushenglu <zushenglu1809@gmail.com> Co-authored-by: KevinT928 <96837880+KevinT928@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-29 14:02:57 -08:00
h3l	dbaeb163aa	langchain[minor]: add volcengine endpoint as LLM (#13942 ) - Description: Volc Engine MaaS serves as an enterprise-grade, large-model service platform designed for developers. You can visit its homepage at https://www.volcengine.com/docs/82379/1099455 for details. This change will facilitate developers to integrate quickly with the platform. - Issue: None - Dependencies: volcengine - Tag maintainer: @baskaryan - Twitter handle: @he1v3tica --------- Co-authored-by: lvzhong <lvzhong@bytedance.com>	2023-11-29 13:16:42 -08:00
Mohammad Ahmad	1600ebe6c7	langchain[patch]: Mask API key for ForeFrontAI LLM (#14013 ) - Description: Mask API key for ForeFrontAI LLM and associated unit tests - Issue: https://github.com/langchain-ai/langchain/issues/12165 - Dependencies: N/A - Tag maintainer: @eyurtsev - Twitter handle: `__mmahmad__` I made the API key non-optional since linting required adding validation for None, but the key is required per documentation: https://python.langchain.com/docs/integrations/llms/forefrontai	2023-11-29 13:12:19 -08:00
yoch	a0e859df51	langchain[patch]: fix cohere reranker init #12899 (#14029 ) - Description: use post field validation for `CohereRerank` - Issue: #12899 and #13058 - Dependencies: - Tag maintainer: @baskaryan --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-29 12:57:06 -08:00
123-fake-st	9bd6e9df36	update pdf document loaders' metadata source to url for online pdf (#13274 ) - Description: Update 5 pdf document loaders in `langchain.document_loaders.pdf`, to store a url in the metadata (instead of a temporary, local file path) if the user provides a web path to a pdf: `PyPDFium2Loader`, `PDFMinerLoader`, `PDFMinerPDFasHTMLLoader`, `PyMuPDFLoader`, and `PDFPlumberLoader` were updated. - The updates follow the approach used to update `PyPDFLoader` for the same behavior in #12092 - The `PyMuPDFLoader` changes required additional work in updating `langchain.document_loaders.parsers.pdf.PyMuPDFParser` to be able to process either an `io.BufferedReader` (from local pdf) or `io.BytesIO` (from online pdf) - The `PDFMinerPDFasHTMLLoader` change used a simpler approach since the metadata is assigned by the loader and not the parser - Issue: Fixes #7034 - Dependencies: None ```python # PyPDFium2Loader example: # old behavior >>> from langchain.document_loaders import PyPDFium2Loader >>> loader = PyPDFium2Loader('https://arxiv.org/pdf/1706.03762.pdf') >>> docs = loader.load() >>> docs[0].metadata {'source': '/var/folders/7z/d5dt407n673drh1f5cm8spj40000gn/T/tmpm5oqa92f/tmp.pdf', 'page': 0} # new behavior >>> from langchain.document_loaders import PyPDFium2Loader >>> loader = PyPDFium2Loader('https://arxiv.org/pdf/1706.03762.pdf') >>> docs = loader.load() >>> docs[0].metadata {'source': 'https://arxiv.org/pdf/1706.03762.pdf', 'page': 0} ```	2023-11-29 15:07:46 -05:00
Toshish Jawale	6f64cb5078	Remove deprecated param and flexibility for prompt (#13310 ) - Description: Updated to remove deprecated parameter penalty_alpha, and use string variation of prompt rather than json object for better flexibility. - Issue: the issue # it fixes (if applicable), - Dependencies: N/A - Tag maintainer: @eyurtsev - Twitter handle: @symbldotai --------- Co-authored-by: toshishjawale <toshish@symbl.ai> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-11-29 14:48:25 -05:00
Tomaz Bratanic	3eb391561b	langchain[minor]: Reduce the number of tokens required to describe a Cypher/Neo4j schema (#13851 ) Instead of using JSON-like syntax to describe node and relationship properties we changed to a shorter and more concise schema description Old: ``` Node properties are the following: [{'properties': [{'property': 'name', 'type': 'STRING'}], 'labels': 'Movie'}, {'properties': [{'property': 'name', 'type': 'STRING'}], 'labels': 'Actor'}] Relationship properties are the following: [] The relationships are the following: ['(:Actor)-[:ACTED_IN]->(:Movie)'] ``` New: ``` Node properties are the following: Movie {name: STRING},Actor {name: STRING} Relationship properties are the following: The relationships are the following: (:Actor)-[:ACTED_IN]->(:Movie) ```	2023-11-29 11:13:12 -08:00
Sauhaard	7ec4dbeb80	langchain[minor]: Add StackExchange API integration (#14002 ) Implements [#12115](https://github.com/langchain-ai/langchain/issues/12115) Who can review? @baskaryan , @eyurtsev , @hwchase17 Integrated Stack Exchange API into Langchain, enabling access to diverse communities within the platform. This addition enhances Langchain's capabilities by allowing users to query Stack Exchange for specialized information and engage in discussions. The integration provides seamless interaction with Stack Exchange content, offering content from varied knowledge repositories. A notebook example and test cases were included to demonstrate the functionality and reliability of this integration. - Add StackExchange as a tool. - Add unit test for the StackExchange wrapper and tool. - Add documentation for the StackExchange wrapper and tool. If you have time, could you please review the code and provide any feedback as necessary! My team is welcome to any suggestions. --------- Co-authored-by: Yuval Kamani <yuvalkamani@gmail.com> Co-authored-by: Aryan Thakur <aryanthakur@Aryans-MacBook-Pro.local> Co-authored-by: Manas1818 <79381912+manas1818@users.noreply.github.com> Co-authored-by: aryan-thakur <61063777+aryan-thakur@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-29 10:32:07 -08:00
Bagatur	d4405bc94e	langchain[patch]: Release 0.0.343 (#14037 )	2023-11-29 10:31:03 -08:00
Yves Zumbühl	9c0ad0cebb	langchain[patch]: Improve HyDe with custom prompts and ability to supply the run_manager (#14016 ) - Description: The class allows to only select between a few predefined prompts from the paper. That is not ideal, since other use cases might need a custom prompt. The changes made allow for this. To be able to monitor those, I also added functionality to supply a custom run_manager. - Issue: no issue, but a new feature, - Dependencies: none, - Tag maintainer: @hwchase17, - Twitter handle: @yvesloy --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-29 09:40:53 -08:00
Chad Norvell	1c4bfb8c5f	langchain[patch]: Mathpix PDF loader supports arbitrary extra params (#13950 ) - Description: Support providing whatever extra parameters you want to the Mathpix PDF loader API request. - Issue: #12773 - Dependencies: None --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-29 02:12:32 -08:00
Unai Garay Maestre	9e2ae866c4	langchain[patch]: Adds progress bar to GooglePalmEmbeddings (#13812 ) - Description: Adds a tqdm progress bar to GooglePalmEmbeddings when embedding a list. - Issue: #13637 - Dependencies: TQDM as a main dependency (instead of extra) Signed-off-by: ugm2 <unaigaraymaestre@gmail.com> --------- Signed-off-by: ugm2 <unaigaraymaestre@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-11-29 01:58:53 -08:00
David Norman	a578076aea	Mask api key for Together LLM (#13981 ) - Description: Add unit tests and mask api key for Together LLM - Issue: the issue https://github.com/langchain-ai/langchain/issues/12165 , - Dependencies: N/A - Tag maintainer: ?, - Twitter handle: N/A --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-11-28 22:57:40 -05:00
Johnny	6463d2d0bd	small fix matching engine AttributeError - object has no attribute (#13763 ) This PR is fixing an attributeError: object endpoint has no attribute "_public_match_client" when using gcp matching engine with private VPC network. @baskaryan, @eyurtsev, @hwchase17. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-11-28 22:42:29 -05:00
Amyh102	750485eaa8	Add object parsing functionality (#13864 ) * Description: Parses huggingface dataset Sequence objects into strings for Document loading. * Issue: Fixes #10674 * Tag maintainter: @baskaryan @eyurtsev --------- Co-authored-by: Amy Han <amyhan@Amys-Air.lan> Co-authored-by: Amy Han <amyhan@Amys-MacBook-Air.local>	2023-11-28 22:33:16 -05:00
ggeutzzang	981f78f920	Fix: (issue #13825 ) Getting an error with DallEAPIWrapper (#13874 ) - Description: As of OpenAI's Python package 1.0, the existing DallEAPIWrapper does not work correctly, so the example in the LangChain Documentation link below does not work either. https://python.langchain.com/docs/integrations/tools/dalle_image_generator Also, since OpenAI only supports DALL-E version 2 or version 3, I modified the DallEAPIWrapper to support it. - Issue: #13825 - Twitter handle: ggeutzzang	2023-11-28 22:31:25 -05:00
Kunal	74045bf5c0	max length attribute for spacy splitter for large docs (#13875 ) For large size documents spacy splitter doesn't work it throws an error as shown in below screenshot. Reason its default max_length is 1000000 and there is no option to increase it. So i added it in this PR. ![image](https://github.com/langchain-ai/langchain/assets/73680423/613625c3-0e21-4834-9aad-2a73cf56eecc) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-28 22:30:26 -05:00
Wang Wei	fe9341a29c	feat: Add ERNIE-Bot-8K model support for ErnieBotChat. (#13716 ) - Description: According to the document https://cloud.baidu.com/doc/WENXINWORKSHOP/s/6lp69is2a, add ERNIE-Bot-8K model support for ErnieBotChat. - Dependencies: Before using the ERNIE-Bot-8K, you should have the model's access authority.	2023-11-28 22:22:23 -05:00
Burak Ömür	0e462b72ef	Update openai/create_llm_result function to consider kwargs (#13815 ) Replace this entire comment with: - Description: updates `create_llm_result` function within `openai.py` to consider latest `params`, - Issue: #8928 - Dependencies: -, - Tag maintainer: - - Twitter handle: [burkomr](https://twitter.com/burkomr) <!-- If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Burak Ömür <burakomur@retorio.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-11-28 22:02:38 -05:00
chyroc	f97ab84c6b	Merge pull request #13907 * feat: mask api_key for jina	2023-11-28 21:24:50 -05:00
nhywieza	9b86fb3fcb	secretStr for baichuan chat model api key (#13946 ) Merge pull request #13946 * secretStr for baichuan chat model api key	2023-11-28 21:20:23 -05:00
卢靖轩	aff1dba252	Merge pull request #13945 * feat: mask api key for nlpcloud	2023-11-28 21:16:36 -05:00
Leonid Kuligin	85bb3a418c	Switched VertexAI models from preview (#13657 ) Replace this entire comment with: - Description: VertexAI models are now GA, moved away from using preview ones from the SDK - Issue: #13606 --------- Co-authored-by: Nuno Campos <nuno@boringbits.io>	2023-11-28 20:38:04 -05:00
Erick Friis	5eca1bd93f	Library Licenses (#13300 ) Same change as #8403 but in other libs also updates (c) LangChain Inc. instead of @hwchase17	2023-11-28 17:34:27 -08:00
Bagatur	14799b139a	infra[patch]: add base deps and fix docs lint (#13998 )	2023-11-28 17:27:37 -08:00
Théo LEBRUN	926d4cfda7	Set default region from boto3 session for Bedrock (#13694 ) - Description: Set default region from boto3 session for Bedrock - Issue: #13683	2023-11-28 20:26:54 -05:00
Snow	1a33e5b500	Repair Wikipedia document loader `load_max_docs` and improve test coverage. (#13769 ) Description: Repair Wikipedia document loader `load_max_docs` and improve test coverage. Issue: The Wikipedia document loader was not respecting the `load_max_docs` paramater (not reported) and would always return a maximum of 10 documents. This is because the API wrapper (in `utilities/wikipedia.py`) wasn't passing `top_k_results` to the underlying [Wikipedia library](https://wikipedia.readthedocs.io/en/latest/code.html#module-wikipedia). By default this library returns 10 results. The default number of results for the document loader has been reduced from 100 to 25. This is because loading 100 results takes a very long time and is an inconvenient default. It should possibly be 10. In addition, the documentation for the loader reported that there was a hard limit (300) on the number of documents returned. In actuality 300 is the maximum Wikipedia query character length set by the API wrapper. Tests have been added for the document loader (previously missing) and to test the correct numbers of documents are being returned by each class, both by default, and when overridden. Also repaired is the `assert_docs` test which has been updated to correctly test for the default metadata (which includes `source` in recent releases). Dependencies: nil Tag maintainer: @leo-gan Twitter handle: @queenvictoria	2023-11-28 20:26:40 -05:00
Bob Lin	04c4878306	Remove `python_repl` from _BASE_TOOLS (#13962 ) ### Description: Previously `python_repl` was a built-in tool, but now it has been moved to `langchain_experimental`. When I use `load_tools` I get an error: ```python In [1]: from langchain.agents import load_tools In [2]: load_tools(["python_repl"]) --------------------------------------------------------------------------- ImportError Traceback (most recent call last) Cell In[2], line 1 ----> 1 load_tools(["python_repl"]) File ~/workspace/langchain/libs/langchain/langchain/agents/load_tools.py:530, in load_tools(tool_names, llm, callbacks, kwargs) 528 tool_names.extend(requests_method_tools) 529 elif name in _BASE_TOOLS: --> 530 tools.append(_BASE_TOOLS[name]()) 531 elif name in _LLM_TOOLS: 532 if llm is None: File ~/workspace/langchain/libs/langchain/langchain/agents/load_tools.py:84, in _get_python_repl() 83 def _get_python_repl() -> BaseTool: ---> 84 raise ImportError( 85 "This tool has been moved to langchain experiment. " 86 "This tool has access to a python REPL. " 87 "For best practices make sure to sandbox this tool. " 88 "Read https://github.com/langchain-ai/langchain/blob/master/SECURITY.md " 89 "To keep using this code as is, install langchain experimental and " 90 "update relevant imports replacing 'langchain' with 'langchain_experimental'" 91 ) ImportError: This tool has been moved to langchain experiment. This tool has access to a python REPL. For best practices make sure to sandbox this tool. Read https://github.com/langchain-ai/langchain/blob/master/SECURITY.md To keep using this code as is, install langchain experimental and update relevant imports replacing 'langchain' with 'langchain_experimental' ``` In this case, it will be very confusing. I think it is no longer a built-in tool now, so it can be removed from `_BASE_TOOLS` ### Issue: https://github.com/langchain-ai/langchain/issues/13858, https://github.com/langchain-ai/langchain/issues/13859, https://github.com/langchain-ai/langchain/issues/13856 ### Twitter handle:** [lin_bob57617](https://twitter.com/lin_bob57617)	2023-11-28 20:13:54 -05:00
Leonid Ganeline	52eee458bb	renamed `google_vertex_ai_vector_search` notebook (#13484 ) The `integrations/vectorstores/matchingengine.ipynb` example has the "Google Vertex AI Vector Search" title. This place this Title in the wrong order in the ToC (it is sorted by the file name). - Renamed `integrations/vectorstores/matchingengine.ipynb` into `integrations/vectorstores/google_vertex_ai_vector_search.ipynb`. - Updated a correspondent comment in docstring - Rerouted old URL to a new URL --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-11-28 16:58:29 -08:00
Leonid Ganeline	bf5787f58b	experimental[patch]: fixed namespace bug (#13585 ) It was : `from langchain.schema.prompts import BasePromptTemplate` but because of the breaking change in the ns, it is now `from langchain.schema.prompt_template import BasePromptTemplate` This bug prevents building the API Reference for the langchain_experimental	2023-11-28 16:40:27 -08:00
Taqi Jaffri	144710ad9a	langchain[minor]: Updated DocugamiLoader, includes breaking changes (#13265 ) There are the following main changes in this PR: 1. Rewrite of the DocugamiLoader to not do any XML parsing of the DGML format internally, and instead use the `dgml-utils` library we are separately working on. This is a very lightweight dependency. 2. Added MMR search type as an option to multi-vector retriever, similar to other retrievers. MMR is especially useful when using Docugami for RAG since we deal with large sets of documents within which a few might be duplicates and straight similarity based search doesn't give great results in many cases. We are @docugami on twitter, and I am @tjaffri --------- Co-authored-by: Taqi Jaffri <tjaffri@docugami.com>	2023-11-28 15:56:22 -08:00
Bagatur	a20e8f8bb0	experimental[patch]: release 0.0.43 (#13570 )	2023-11-28 15:38:09 -08:00
Bagatur	d8fe987ef5	langchain[patch]: release 0.0.342 (#13992 )	2023-11-28 14:34:57 -08:00
david qiu	9fb6805be4	langchain[minor]: Add retriever for Knowledge Bases for Amazon Bedrock (#13980 ) - Description: Adds a retriever implementation for [Knowledge Bases for Amazon Bedrock](https://aws.amazon.com/bedrock/knowledge-bases/), a new service announced at AWS re:Invent, shortly before this PR was opened. This depends on the `bedrock-agent-runtime` service, which will be included in a future version of `boto3` and of `botocore`. We will open a follow-up PR documenting the minimum required versions of `boto3` and `botocore` after that information is available. - Issue: N/A - Dependencies: `boto3>=1.33.2, botocore>=1.33.2` - Tag maintainer: @baskaryan - Twitter handles: `@pjain7` `@dead_letter_q` This PR includes a documentation notebook under `docs/docs/integrations/retrievers`, which I (@dlqqq) have verified independently. EDIT: `bedrock-agent-runtime` service is now included in `boto3>=1.33.2`: `5cf793f493` --------- Co-authored-by: Piyush Jain <piyushjain@duck.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-28 14:10:23 -08:00
Bagatur	1aed2d1f08	core[patch]: release 0.0.7 (#13989 )	2023-11-28 14:05:01 -08:00
David Duong	eb67f07e32	Track RunnableAssign as a separate run trace (#13972 ) Addressing incorrect order being sent to callbacks / tracers, due to the nature of threading --------- Co-authored-by: Nuno Campos <nuno@boringbits.io>	2023-11-28 22:02:31 +00:00
Nuno Campos	0f255bb6c4	In Runnable.stream_log build up final_output from adding output chunks (#12781 ) Add arg to omit streamed_output list, in cases where final_output is enough this saves bandwidth <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-11-28 21:50:41 +00:00
Nuno Campos	970fe23feb	Fixes for opengpts release (#13960 )	2023-11-28 21:49:43 +00:00
David Duong	947daaf833	Exclude Bedrock client and credentials_profile_name fields from serialisation (#13603 )	2023-11-28 16:34:46 -05:00
Bagatur	48fbc5513d	infra[patch], langchain[patch]: fix test deps and upper bound langchain dep on core(#13984 )	2023-11-28 13:26:15 -08:00
Stefano Lottini	1fd724293b	Astra DB vector store, move constructor docstring to class docstring (#13784 ) This PR rearranges the docstring for the `AstraDB` vector store class so as to have all useful information in the _class_ docstring for ease of reading. (incidentally, due to an oversight, the docstring that was in the constructor ended up buried below some lines of code, thereby disappearing altogether from accessibility. Apologies.)	2023-11-28 16:25:44 -05:00
Johannes Foulds	fc40bd4cdb	AnthropicFunctions function_call compatibility (#13901 ) - Description: Updates to `AnthropicFunctions` to be compatible with the OpenAI `function_call` functionality. - Issue: The functionality to indicate `auto`, `none` and a forced function_call was not completely implemented in the existing code. - Dependencies: None - Tag maintainer: @baskaryan , and any of the other maintainers if needed. - Twitter handle: None I have specifically tested this functionality via AWS Bedrock with the Claude-2 and Claude-Instant models.	2023-11-28 16:22:55 -05:00
mengjincn	05ea4fd37d	fix merge None value and non None value error (#13703 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-11-28 15:49:56 -05:00
Ali Orozgani	32d794f5a3	iMessage loader: implement message content extraction from attributed… (#13634 ) - Description: We are adding functionality to extract message content from the `attributedBody` field of the database, in case the content is not in the `text` field. - Issue: Closes #13326 and #10680 - Dependencies: None. - Tag maintainer: @eyurtsev, @hwchase17 --------- Co-authored-by: onotate <johnp.pham@mail.utoronto.ca>	2023-11-28 15:45:43 -05:00
William FH	e5256bcb69	[Evals] Add Project Tags (#13982 ) Add them to project extra	2023-11-28 11:38:59 -08:00
Nuno Campos	e0bcc98436	infra[patch]: Use langchain core in-tree as a dev dependency (#13957 ) Using the published version means master is broken for contributors whenever we make changes in one lib that depend on the other.	2023-11-28 09:23:43 -08:00
unifyh	2703a1b061	Fix `MarkdownHeaderTextSplitter` not recognizing tilde-fenced code blocks (#13511 ) - Description: Previously `MarkdownHeaderTextSplitter` did not consider tilde-fenced code blocks (https://spec.commonmark.org/0.30/#fenced-code-blocks). This PR fixes that. ````md # Bug caused by previous implementation: ~~~py foo() # This is a comment that would be considered header bar() ~~~ ```` - Tag maintainer: @baskaryan	2023-11-28 11:52:38 -05:00
Leonid Ganeline	7929b26017	office365 toolkit bug fixes (#13618 ) Several bug fixes: - emails: instead of `bcc` the `cc` is used. - errors in the truncation descriptions - no truncation of the `message_search` Several updates: - generalized UTC format - truncation limit can be changed now in _call()	2023-11-28 11:49:24 -05:00

... 24 25 26 27 28 ...

3208 Commits