langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-07-11 15:35:09 +00:00

History

Lance Martin 7a00f17033 Web research retriever (#8102 ) Given a user question, this will - * Use LLM to generate a set of queries. * Query for each. * The URLs from search results are stored in self.urls. * A check is performed for any new URLs that haven't been processed yet (not in self.url_database). * Only these new URLs are loaded, transformed, and added to the vectorstore. * The vectorstore is queried for relevant documents based on the questions generated by the LLM. * Only unique documents are returned as the final result. This code will avoid reprocessing of URLs across multiple runs of similar queries, which should improve the performance of the retriever. It also keeps track of all URLs that have been processed, which could be useful for debugging or understanding the retriever's behavior. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>		2023-07-25 19:58:00 -07:00
..
agents	mv module integrations docs (#8101 )	2023-07-23 23:23:16 -07:00
callbacks	mv module integrations docs (#8101 )	2023-07-23 23:23:16 -07:00
chains	ArangoDB/AQL support for Graph QA Chain (#7880 )	2023-07-24 15:16:52 -07:00
data_connection	Web research retriever (#8102 )	2023-07-25 19:58:00 -07:00
memory	Add LLMChain example of memory with chat models (#8250 )	2023-07-25 15:20:32 -07:00
model_io	mv module integrations docs (#8101 )	2023-07-23 23:23:16 -07:00
paul_graham_essay.txt	codespell: workflow, config + some (quite a few) typos fixed (#6785 )	2023-07-12 16:20:08 -04:00
state_of_the_union.txt	Doc refactor (#6300 )	2023-06-16 11:52:56 -07:00