langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-07-11 23:40:24 +00:00

Author	SHA1	Message	Date
ccurme	9fa17bfabe	docs; fix links in v0.2.0 (#21483 )	2024-05-09 11:05:17 -04:00
Erick Friis	5542eacad8	docs: sidebar autogen hidden support (#21454 )	2024-05-09 00:23:52 +00:00
Erick Friis	74044e44a5	docs: useBaseUrl on svg paths (#21446 )	2024-05-08 21:55:42 +00:00
Sokolov Fedor	f4ddf64faa	community: Add MarkdownifyTransformer to langchain_community.document_transformers (#21247 ) - Added new document_transformer: MarkdonifyTransformer, that uses `markdonify` package with customizable options to convert HTML to Markdown. It's similar to Html2TextTransformer, but has more flexible options and also I've noticed that sometimes MarkdownifyTransformer performs better than html2text one, so that's why I use markdownify on my project. - Added docs and tests - Usage: ```python from langchain_community.document_transformers import MarkdownifyTransformer markdownify = MarkdownifyTransformer() docs_transform = markdownify.transform_documents(docs) ``` - Example of better performance on simple task, that I've noticed: ``` <html> <head><title>Reports on product movement</title></head> <body> <p data-block-key="2wst7">The reports on product movement will be useful for forming supplier orders and controlling outcomes.</p> </body> ``` Html2TextTransformer: ```python [Document(page_content='The reports on product movement will be useful for forming supplier orders and\ncontrolling outcomes.\n\n')] # Here we can see 'and\ncontrolling', which has extra '\n' in it ``` MarkdownifyTranformer: ```python [Document(page_content='Reports on product movement\n\nThe reports on product movement will be useful for forming supplier orders and controlling outcomes.')] ``` --------- Co-authored-by: Sokolov Fedor <f.sokolov@sokolov-macbook.bbrouter> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Sokolov Fedor <f.sokolov@sokolov-macbook.local> Co-authored-by: Sokolov Fedor <f.sokolov@192.168.1.6>	2024-05-08 14:45:13 -07:00
Erick Friis	21d14549a9	docs: v0.2 docs in master (#21438 ) current python.langchain.com is building from branch `v0.1`. Iterate on v0.2 docs here. --------- Signed-off-by: Weichen Xu <weichen.xu@databricks.com> Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: jacoblee93 <jacoblee93@gmail.com> Co-authored-by: Leonid Ganeline <leo.gan.57@gmail.com> Co-authored-by: Leonid Kuligin <lkuligin@yandex.ru> Co-authored-by: Averi Kitsch <akitsch@google.com> Co-authored-by: Nuno Campos <nuno@langchain.dev> Co-authored-by: Nuno Campos <nuno@boringbits.io> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Martín Gotelli Ferenaz <martingotelliferenaz@gmail.com> Co-authored-by: Fayfox <admin@fayfox.com> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev> Co-authored-by: Dawson Bauer <105886620+djbauer2@users.noreply.github.com> Co-authored-by: Ravindu Somawansa <ravindu.somawansa@gmail.com> Co-authored-by: Dhruv Chawla <43818888+Dominastorm@users.noreply.github.com> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: WeichenXu <weichen.xu@databricks.com> Co-authored-by: Benito Geordie <89472452+benitoThree@users.noreply.github.com> Co-authored-by: kartikTAI <129414343+kartikTAI@users.noreply.github.com> Co-authored-by: Kartik Sarangmath <kartik@thirdai.com> Co-authored-by: Sevin F. Varoglu <sfvaroglu@octoml.ai> Co-authored-by: MacanPN <martin.triska@gmail.com> Co-authored-by: Prashanth Rao <35005448+prrao87@users.noreply.github.com> Co-authored-by: Hyeongchan Kim <kozistr@gmail.com> Co-authored-by: sdan <git@sdan.io> Co-authored-by: Guangdong Liu <liugddx@gmail.com> Co-authored-by: Rahul Triptahi <rahul.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: pjb157 <84070455+pjb157@users.noreply.github.com> Co-authored-by: Eun Hye Kim <ehkim1440@gmail.com> Co-authored-by: kaijietti <43436010+kaijietti@users.noreply.github.com> Co-authored-by: Pengcheng Liu <pcliu.fd@gmail.com> Co-authored-by: Tomer Cagan <tomer@tomercagan.com> Co-authored-by: Christophe Bornet <cbornet@hotmail.com>	2024-05-08 12:29:59 -07:00
JuHyung Son	710e57d779	upstage: deprecate UPSTAGE_DOCUMENT_AI_API_KEY (#21363 ) Description: We are merging UPSTAGE_DOCUMENT_AI_API_KEY and UPSTAGE_API_KEY into one, and only UPSTAGE_API_KEY will be used going forward. And we changed the base class of ChatUpstage to BaseChatOpenAI. --------- Co-authored-by: Sean <chosh0615@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-08 18:02:26 +00:00
Kevin Zhang	0715545378	docs: fix typo in text (#21393 ) Description: The previous text had an unclosed parenthesis, this fix adds the closing parenthesis	2024-05-08 15:58:15 +00:00
Tomaz Bratanic	dd70f2f473	Update graph docs (#21414 ) Update the deprecated docs and added node properties to graph construction	2024-05-08 09:05:39 -04:00
Erick Friis	893f06b5de	infra: rewrite ipynb links to md (#21392 )	2024-05-07 23:16:52 +00:00
Hassan El Mghari	225ceedcb6	docs: Add together docs in chat models & update provider docs (#21391 ) - Added Together docs in chat models section - Update Together provider docs to match the LLM & chat models sections --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-07 22:40:57 +00:00
Heidi Steen	af97d58c9e	docs: update docs/integrations/retrievers/azure_ai_search.ipynb (#21160 ) This is a doc update. It fixes up formatting and product name references. The example code is updated to use a local built-in text file. @mmhangami Please take a look --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-05-07 22:33:46 +00:00
snova-jamesv	ca753e7c15	community: updated performance limitation wording in sambanova.ipynb (#21390 ) - Description: updated performance limitation wording in sambanova.ipynb - Issue: NA - Dependencies: NA - Twitter handle: NA	2024-05-07 22:21:46 +00:00
Hassan El Mghari	416549bed2	docs: Updated Together integration docs (#21388 ) Description: Updated the together integration docs by leading with the streaming example, explicitly specifying a model to show users how to do that, and updating the sections to more closely match other integrations.	2024-05-07 21:51:42 +00:00
Leonid Ganeline	7cbf1c31aa	docs: table legend updated (#21351 ) Compacted the table column legends. Added links. Similar to #21259	2024-05-07 14:45:04 -07:00
Erick Friis	d5bde4fa91	infra: use nbconvert for docs build (#21135 ) todo - [x] remove quarto build semantics - [x] remove quarto download/install - [x] make `uv` not verbose	2024-05-07 12:30:17 -07:00
Ikko Eltociear Ashimine	80170da6c5	docs: update cassandra_database.ipynb (#21145 ) Enviroment -> Environment	2024-05-07 15:00:24 -04:00
Ikko Eltociear Ashimine	c34419e200	docs: update quick_start.ipynb (#21358 ) initalize -> initialize - [x] PR title: "package: description"	2024-05-07 08:44:48 -07:00
snova-jamesv	c2ed484653	community: add Sambaverse rate limitation info to sambanova.ipynb (#21379 ) - Description: add Sambaverse rate limitation info to sambanova.ipynb - Issue: NA - Dependencies: NA	2024-05-07 15:42:44 +00:00
Hassan El Mghari	d6ef5fe86a	together: add chat models, use openai base (#21337 ) Description: Adding chat completions to the Together AI package, which is our most popular API. Also staying backwards compatible with the old API so folks can continue to use the completions API as well. Also moved the embedding API to use the OpenAI library to standardize it further. Twitter handle: @nutlope - [x] Add tests and docs: If you're adding a new integration, please include - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-06 17:47:06 -07:00
Jorge Piedrahita Ortiz	e65652c3e8	community: add SambaNova embeddings integration (#21227 ) - Description: SambaNova hosted embeddings integration	2024-05-06 13:29:59 -07:00
Mark Cusack	060987d755	community[minor]: Add indexing via locality sensitive hashing to the Yellowbrick vector store (#20856 ) - Description: Add LSH-based indexing to the Yellowbrick vector store module - Twitter handle: @markcusack --------- Co-authored-by: markcusack <markcusack@markcusacksmac.lan> Co-authored-by: markcusack <markcusack@Mark-Cusack-sMac.local> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-05-06 20:18:02 +00:00
Leonid Ganeline	62559b20b3	docs: `chains` page format (#21259 ) Compacted the table column descriptions.	2024-05-06 11:33:38 -07:00
Daniel Glogowski	27e73ebe57	docs: update nvidia docs v2 (#21288 ) More doc updates por favor @baskaryan!	2024-05-06 11:29:02 -07:00
Pengcheng Liu	144f2821af	docs: add example for loading data from LarkSuite wiki. (#21311 ) Description: Update LarkSuite loader doc to give an example for loading data from LarkSuite wiki. Issue: None Dependencies: None Twitter handle: None	2024-05-06 09:56:12 -07:00
Jagadish Krishnamoorthy	c038991590	docs: Update pandas.ipynb (#21289 ) Remove the redundant comment.	2024-05-05 20:22:17 +00:00
tanersekmen	d310f9c71e	docs:update code structure (#21302 ) update the structure of llm_chain variables Co-authored-by: tanersemenn <0418>	2024-05-05 17:18:15 +00:00
Christophe Bornet	ba9dc04ffa	docs: Add doc for hybrid search (#21245 ) See [preview](https://langchain-git-fork-cbornet-doc-hybrid-search-langchain.vercel.app/docs/use_cases/question_answering/hybrid/) In the model of [per user retrieval](https://python.langchain.com/docs/use_cases/question_answering/per_user/)	2024-05-04 08:22:56 -04:00
Rohan Aggarwal	8021d2a2ab	community[minor]: Oraclevs integration (#21123 ) Thank you for contributing to LangChain! - Oracle AI Vector Search Oracle AI Vector Search is designed for Artificial Intelligence (AI) workloads that allows you to query data based on semantics, rather than keywords. One of the biggest benefit of Oracle AI Vector Search is that semantic search on unstructured data can be combined with relational search on business data in one single system. This is not only powerful but also significantly more effective because you don't need to add a specialized vector database, eliminating the pain of data fragmentation between multiple systems. - Oracle AI Vector Search is designed for Artificial Intelligence (AI) workloads that allows you to query data based on semantics, rather than keywords. One of the biggest benefit of Oracle AI Vector Search is that semantic search on unstructured data can be combined with relational search on business data in one single system. This is not only powerful but also significantly more effective because you don't need to add a specialized vector database, eliminating the pain of data fragmentation between multiple systems. This Pull Requests Adds the following functionalities Oracle AI Vector Search : Vector Store Oracle AI Vector Search : Document Loader Oracle AI Vector Search : Document Splitter Oracle AI Vector Search : Summary Oracle AI Vector Search : Oracle Embeddings - We have added unit tests and have our own local unit test suite which verifies all the code is correct. We have made sure to add guides for each of the components and one end to end guide that shows how the entire thing runs. - We have made sure that make format and make lint run clean. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: skmishraoracle <shailendra.mishra@oracle.com> Co-authored-by: hroyofc <harichandan.roy@oracle.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-04 03:15:35 +00:00
andyjessen	64e17bd793	docs: Fix comment within "handle long text" example (#21248 ) The current doc-string comment is referring to the wrong schema.	2024-05-03 12:36:53 +00:00
Daniel Glogowski	c3d169ab00	docs: Update Nvidia documentation (#21240 ) Updating Nvidia docs ahead for 5/15 competition. Thanks!	2024-05-03 12:29:03 +00:00
Bagatur	70bde15480	docs: add tool choice to tool calling (#21229 )	2024-05-03 03:10:22 -04:00
Erick Friis	aa9faa8512	docs: model table keywords, remove tool calling from llm (#21225 )	2024-05-02 21:04:29 +00:00
Aditya	ee2c55ca09	docs: Added documentation on Anthropic models on vertex (#21070 ) Description:Added documentation on Anthropic models on Vertex @lkuligin for review --------- Co-authored-by: adityarane@google.com <adityarane@google.com>	2024-05-02 13:12:01 -04:00
Erick Friis	cd4c54282a	infra: cleanup docs build (#21134 ) Refactors the docs build in order to: - run the same `make build` command in both vercel and local build - incrementally build artifacts in 2 distinct steps, instead of building all docs in-place (in vercel) or in a _dist dir (locally) Highlights: - introduces `make build` in order to build the docs - collects and generates all files for the build in `docs/build/intermediate` - renders those jupyter notebook + markdown files into `docs/build/outputs` And now the outputs to host are in `docs/build/outputs`, which will need a vercel settings change. Todo: - [ ] figure out how to point the right directory (right now deleting and moving docs dir in vercel_build.sh isn't great)	2024-05-01 17:34:05 -07:00
Bagatur	8b4b75e543	docs: standardize vertexai params (#20167 ) Related to #20085 Requires https://github.com/langchain-ai/langchain-google/pull/121	2024-05-01 11:42:18 -04:00
Jacob Lee	bd38073d76	👥 Update LangChain people data (#21143 ) 👥 Update LangChain people data Co-authored-by: github-actions <github-actions@github.com>	2024-05-01 11:01:43 -04:00
East Agile	2a6f78a53f	community[minor]: Rememberizer retriever (#20052 ) Description: This pull request introduces a new feature for LangChain: the integration with the Rememberizer API through a custom retriever. This enables LangChain applications to allow users to load and sync their data from Dropbox, Google Drive, Slack, their hard drive into a vector database that LangChain can query. Queries involve sending text chunks generated within LangChain and retrieving a collection of semantically relevant user data for inclusion in LLM prompts. User knowledge dramatically improved AI applications. The Rememberizer integration will also allow users to access general purpose vectorized data such as Reddit channel discussions and US patents. Issue: N/A Dependencies: N/A Twitter handle: https://twitter.com/Rememberizer	2024-05-01 10:41:44 -04:00
Abhishek Bhagwat	86fe484e24	docs: Docs (sample notebook) for Vertex DIY RAG Ranking API (#21054 ) Vertex DIY RAG APIs helps to build complex RAG systems and provide more granular control, and are suited for custom use cases. The Ranking API takes in a list of documents and reranks those documents based on how relevant the documents are to a given query. Compared to embeddings that look purely at the semantic similarity of a document and a query, the ranking API can give you a more precise score for how well a document answers a given query. [Reference](https://cloud.google.com/generative-ai-app-builder/docs/ranking) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-01 05:39:39 +00:00
Ismail Hossain Polas	1fdf63fa6c	community[patch]: update package name to bagelML (#19948 ) Description This pull request updates the Bagel Network package name from "betabageldb" to "bagelML" to align with the latest changes made by the Bagel Network team. The following modifications have been made: - Updated all references to the old package name ("betabageldb") with the new package name ("bagelML") throughout the codebase. - Modified the documentation, and any relevant scripts to reflect the package name change. - Tested the changes to ensure that the functionality remains intact and no breaking changes were introduced. By merging this pull request, our project will stay up to date with the latest Bagel Network package naming convention, ensuring compatibility and smooth integration with their updated library. Please review the changes and provide any feedback or suggestions. Thank you!	2024-05-01 01:17:33 -04:00
Erick Friis	67e6744e0f	docs: fix some notebook formatting (#21136 )	2024-04-30 21:39:03 -07:00
Leonid Kuligin	a36935b520	docs: updated docs on langchain_google_community (#21064 ) Thank you for contributing to LangChain! - [ ] PR title: "docs: updated docs on langchain_google_community" - [ ] PR message: - Description: updated docs on langchain_google_community	2024-04-30 20:20:49 -04:00
junkeon	8d2909ee25	upstage[minor]: Update few codes and add upstage loader in pdf section (#21085 ) Description: Update UpstageLayoutAnalysisParser and Loader and add upstage loader example in pdf section Dependencies: langchain_community Twitter handle: [@upstageai](https://twitter.com/upstageai) - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-30 20:15:49 -04:00
MacanPN	0f7f448603	community[patch]: add delete() method to AzureSearch vector store (#21127 ) Issue: Currently `AzureSearch` vector store does not implement `delete` method. This PR implements it. This also makes it compatible with LangChain indexer. Dependencies: None Twitter handle: @martintriska1 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-30 23:46:18 +00:00
Jorge Piedrahita Ortiz	3441a11b21	docs: minor changes in sambanova community integration docs (#21129 ) - Description: minor changes in sambanova community integration notebook docs --------- Co-authored-by: Renate Kempf <165940384+renate-snova@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-30 23:44:26 +00:00
Bagatur	6d3e9eaf84	docs: format (#21132 )	2024-04-30 23:32:41 +00:00
Christophe Bornet	d6e9bd3011	docs: Bump cassio min version in docs (#21081 ) Cassio 0.6+ is recommended for async vector store (not blocking on getting the embedding dimension) and for hybrid search support.	2024-04-30 10:25:37 -04:00
Kuro Denjiro	fa4124b821	community[minor]: add mintbase loader to langchain (#20089 ) - [x] Add Near NFT loader: "community: Load NFT near block chain using mintbase graph API" - [x] PR message: - Description: a description of the change - Twitter handle:Kurodenjiro --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-30 04:11:56 +00:00
Rahul Triptahi	c172611647	community[patch]: Add classifier_url argument in PebbloSafeLoader and documentation update. (#21030 ) Description: Add classifier_url argument in PebbloSafeLoader. Documentation: Updated PebbloSafeLoader documentation with above change and new links for pebblo github pages. --------- Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-04-29 17:41:09 -04:00
Rodrigo Nogueira	90f19028e5	community[patch]: Add maritalk streaming (sync and async) (#19203 ) Co-authored-by: RosevalJr <rdmalajr@gmail.com> Co-authored-by: Roseval Donisete Malaquias Junior <roseval@maritaca.ai>	2024-04-29 21:31:14 +00:00
Cahid Arda Öz	cc6191cb90	community[minor]: Add support for Upstash Vector (#20824 ) ## Description Adding `UpstashVectorStore` to utilize [Upstash Vector](https://upstash.com/docs/vector/overall/getstarted)! #17012 was opened to add Upstash Vector to langchain but was closed to wait for filtering. Now filtering is added to Upstash vector and we open a new PR. Additionally, [embedding feature](https://upstash.com/docs/vector/features/embeddingmodels) was added and we add this to our vectorstore aswell. ## Dependencies [upstash-vector](https://pypi.org/project/upstash-vector/) should be installed to use `UpstashVectorStore`. Didn't update dependencies because of [this comment in the previous PR](https://github.com/langchain-ai/langchain/pull/17012#pullrequestreview-1876522450). ## Tests Tests are added and they pass. Tests are naturally network bound since Upstash Vector is offered through an API. There was [a discussion in the previous PR about mocking the unittests](https://github.com/langchain-ai/langchain/pull/17012#pullrequestreview-1891820567). We didn't make changes to this end yet. We can update the tests if you can explain how the tests should be mocked. --------- Co-authored-by: ytkimirti <yusuftaha9@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-29 17:25:01 -04:00

1 2 3 4 5 ...

3592 Commits