mirror of https://github.com/hwchase17/langchain.git synced 2026-05-18 05:26:09 +00:00

Files

Dayna Blackwell 3b9750f0a4 fix(text-splitters): remove incorrect C# and Elixir separator keywords (#37037 )

## Summary

Removes two incorrect separators from `get_separators_for_language()` in
`RecursiveCharacterTextSplitter`:

- **C#**: `"\nimplements "` is a Java keyword. C# uses `:` for interface
implementation. This separator never matches valid C# source code.
- **Elixir**: `"\nwhile "` does not exist in Elixir. The language uses
recursion and `Enum.reduce_while/3` instead of while loops.

Both are dead separators that silently degrade chunking quality by
occupying positions in the separator priority list without contributing
useful split points.

## Tests

Added two targeted tests:
- `test_csharp_separators_no_java_keywords`: verifies `"\nimplements "`
is not in the C# separator list
- `test_elixir_separators_no_while`: verifies `"\nwhile "` is not in the
Elixir separator list

Existing `test_csharp_code_splitter` continues to pass (no change to
expected output since `implements` never matched valid C# code).

Full suite: 129 passed, 0 failed.

Fixes #37030

2026-04-27 13:48:19 -04:00

langchain_text_splitters

fix(text-splitters): remove incorrect C# and Elixir separator keywords (#37037 )

2026-04-27 13:48:19 -04:00

scripts

ci: avoid unnecessary dep installs in lint targets (#36046 )

2026-03-17 21:23:29 -04:00

tests

fix(text-splitters): remove incorrect C# and Elixir separator keywords (#37037 )

2026-04-27 13:48:19 -04:00

extended_testing_deps.txt

fix: support python 3.14 in various projects (#33575 )

2025-10-17 11:06:23 -04:00

Makefile

fix(infra): correct lint_diff relative paths in package makefiles (#36333 )

2026-03-28 02:32:02 -04:00

pyproject.toml

hotfix: bump min core versions (#36996 )

2026-04-24 15:23:28 -04:00

README.md

chore: update twitter URLs (#34736 )

2026-01-13 01:54:11 -05:00

uv.lock

release(openai): 1.2.1 (#36995 )

2026-04-24 15:04:36 -04:00

README.md

🦜✂️ LangChain Text Splitters

Looking for the JS/TS version? Check out LangChain.js.

Quick Install

pip install langchain-text-splitters

🤔 What is this?

LangChain Text Splitters contains utilities for splitting into chunks a wide variety of text documents.

📖 Documentation

For full documentation, see the API reference.

📕 Releases & Versioning

See our Releases and Versioning policies.

We encourage pinning your version to a specific version in order to avoid breaking your CI when we publish new tests. We recommend upgrading to the latest version periodically to make sure you have the latest tests.

Not pinning your version will ensure you always have the latest tests, but it may also break your CI if we introduce tests that your integration doesn't pass.

💁 Contributing

As an open-source project in a rapidly developing field, we are extremely open to contributions, whether it be in the form of a new feature, improved infrastructure, or better documentation.

For detailed information on how to contribute, see the Contributing Guide.