mirror of https://github.com/hwchase17/langchain.git synced 2026-06-09 18:50:33 +00:00

Files

Kaparthy Reddy 2d4f00a451 fix(openai): Respect 300k token limit for embeddings API requests (#33668 )

## Description

Fixes #31227 - Resolves the issue where `OpenAIEmbeddings` exceeds
OpenAI's 300,000 token per request limit, causing 400 BadRequest errors.

## Problem

When embedding large document sets, LangChain would send batches
containing more than 300,000 tokens in a single API request, causing
this error:
```
openai.BadRequestError: Error code: 400 - {'error': {'message': 'Requested 673477 tokens, max 300000 tokens per request'}}
```

The issue occurred because:
- The code chunks texts by `embedding_ctx_length` (8191 tokens per
chunk)
- Then batches chunks by `chunk_size` (default 1000 chunks per request)
- **But didn't check**: Total tokens per batch against OpenAI's 300k
limit
- Result: `1000 chunks × 8191 tokens = 8,191,000 tokens` → Exceeds
limit!

## Solution

This PR implements dynamic batching that respects the 300k token limit:

1. **Added constant**: `MAX_TOKENS_PER_REQUEST = 300000`
2. **Track token counts**: Calculate actual tokens for each chunk
3. **Dynamic batching**: Instead of fixed `chunk_size` batches,
accumulate chunks until approaching the 300k limit
4. **Applied to both sync and async**: Fixed both
`_get_len_safe_embeddings` and `_aget_len_safe_embeddings`

## Changes

- Modified `langchain_openai/embeddings/base.py`:
  - Added `MAX_TOKENS_PER_REQUEST` constant
  - Replaced fixed-size batching with token-aware dynamic batching
  - Applied to both sync (line ~478) and async (line ~527) methods
- Added test in `tests/unit_tests/embeddings/test_base.py`:
- `test_embeddings_respects_token_limit()` - Verifies large document
sets are properly batched

## Testing

All existing tests pass (280 passed, 4 xfailed, 1 xpassed).

New test verifies:
- Large document sets (500 texts × 1000 tokens = 500k tokens) are split
into multiple API calls
- Each API call respects the 300k token limit

## Usage

After this fix, users can embed large document sets without errors:
```python
from langchain_openai import OpenAIEmbeddings
from langchain_chroma import Chroma
from langchain_text_splitters import CharacterTextSplitter

# This will now work without exceeding token limits
embeddings = OpenAIEmbeddings()
documents = CharacterTextSplitter().split_documents(large_documents)
Chroma.from_documents(documents, embeddings)
```

Resolves #31227

---------

Co-authored-by: Kaparthy Reddy <kaparthyreddy@Kaparthys-MacBook-Air.local>
Co-authored-by: Chester Curme <chester.curme@gmail.com>
Co-authored-by: Mason Daugherty <mason@langchain.dev>
Co-authored-by: Mason Daugherty <github@mdrxy.com>

2025-11-14 18:12:07 -05:00

anthropic

fix(anthropic): execute bash + file tools via tool node (#33960 )

2025-11-14 13:17:01 -05:00

chroma

fix(chroma): resolve OpenCLIP + Chroma image embedding test regression (#33899 )

2025-11-09 21:24:33 -05:00

deepseek

release(deepseek): 1.0.1 (#33946 )

2025-11-13 11:24:39 -05:00

exa

chore: update pyproject.toml url entries (#33587 )

2025-10-17 17:16:55 -04:00

fireworks

chore: update README.md files (#33919 )

2025-11-10 22:51:35 -05:00

groq

fix(groq): bump min ver for core dep (#33949 )

2025-11-13 11:46:54 -05:00

huggingface

chore: update README.md files (#33919 )

2025-11-10 22:51:35 -05:00

mistralai

chore: update README.md files (#33919 )

2025-11-10 22:51:35 -05:00

nomic

release(nomic): 1.0.1 (#33948 )

2025-11-13 11:25:39 -05:00

ollama

chore: update README.md files (#33919 )

2025-11-10 22:51:35 -05:00

openai

fix(openai): Respect 300k token limit for embeddings API requests (#33668 )

2025-11-14 18:12:07 -05:00

perplexity

chore: update README.md files (#33919 )

2025-11-10 22:51:35 -05:00

prompty

chore: update pyproject.toml url entries (#33587 )

2025-10-17 17:16:55 -04:00

qdrant

style: misc refs work (#33771 )

2025-10-31 18:29:53 -04:00

xai

chore: update README.md files (#33919 )

2025-11-10 22:51:35 -05:00

README.md

docs: update package READMEs (#33488 )

2025-10-15 10:49:35 -04:00

README.md

FAQ

Looking for an integration not listed here? Check out the integrations documentation and the note in the libs/ README about third-party maintained packages.

Integration docs

For full documentation, see the primary and API reference docs for integrations.