Sourced from nltk's changelog.
Version 3.9.4 2026-03-24
- Support Python 3.14
- Fix bug in Levenshtein distance when substitution_cost > 2
- Fix bug in Treebank detokeniser re quote ordering
- Fix bug in Jaro similarity for empty strings
- Several security enhancements
- Fix GHSA-rf74-v2fm-23pw: unbounded recursion in JSONTaggedDecoder
- Implement TextTiling vocabulary introduction method (Hearst 1997)
- Fix ALINE feature matrix errors and add comprehensive tests
- Support multiple VerbNet versions, fix longid/shortid regex for VerbNet ids
- Let downloader fallback to md5 when sha256 is unavailable
- Several other minor bugfixes and code cleanups
Thanks to the following contributors to 3.9.4: Min-Yen Kan, Eric Kafe, Emily Voss, bowiechen, Hrudhai01, jancallewaert, Mr-Neutr0n, pollak.peter89, ylwango613,
Version 3.9.3 2026-02-21
- Fix CVE-2025-14009: secure ZIP extraction in nltk.downloader (#3468)
- Block path traversal/arbitrary reads in nltk.data for protocol-less refs (#3467)
- Block path traversal/abs paths in corpus readers and FS pointers (#3479, #3480)
- Validate external StanfordSegmenter JARs using SHA256 (#3477)
- Add optional sandbox enforcement for filestring() (#3485)
- Maintenance: downloader/zipped models, CI/tooling updates
Thanks to the following contributors to 3.9.3: Chris Clauss, Eric Kafe, HyperPS, purificant, Shivansh-Game, Christopher Smith
Version 3.9.2 2025-10-01
- Update download checksums to use SHA256 in built index
- Fix percentage escape in new-style string formatting
- replace shortened URLs using goo.gl
- Make Wordnet interoperable with various taggers and tagged corpora
- Fix saving PerceptronTagger
- Document how to reproduce old Wordnet studies
- properly initialize Portuguese corpus reader
- support for mixed rules conversion into Chomsky Normal Form
- only import tkinter if a GUI is needed
- issue #2112 with Corenlp
- new environment variable NLTK_DOWNLOADER_FORCE_INTERACTIVE_SHELL
- Lesk defaults to most frequent sense in case of ties
Thanks to the following contributors to 3.9.2: Jose Cols, Peter de Blanc, GeneralPoxter, Eric Kafe, William LaCroix, Jason Liu, Samer Masterson, Mike014, purificant, Andrew Ernest Ritz, samertm, Ikram Ul Haq, Christopher Smith, Ryan Mannion
Version 3.9.1 2024-08-19
... (truncated)
ad9c96b
Update copyright year7edcddf
Updates for 3.9.4 release67a2736
Merge pull request #3180 from
yzhaoinuw/bug-on-edit_distance_align2b17ac5
Fix edit_distance_align backtrace for high substitution costs4b72976
Merge pull request #3018 from
JuanIMartinezB/bug/shortid-longid8a5619f
Merge pull request #3222 from
Syzygy2048/feature/texttiling-vocabulary-introd...c6574d7
Merge pull request #3289 from
ihitamandal/codeflash/optimize-windowdiff-2024-...98ff5d9
Merge pull request #3435 from
Hrudhai01/fix-3260-detokenize-quotesaec4fce
Merge pull request #3522 from
ekaf/pathseceec4ee3
Merge pull request #3526 from
nltk/update-contributing