Skip to content

Bump Tiktoken from 2.0.3 to 3.1.5#147

Open
dependabot[bot] wants to merge 1 commit into
mainfrom
dependabot/nuget/src/MarkZither.Rag.Chunking/Tiktoken-3.1.5
Open

Bump Tiktoken from 2.0.3 to 3.1.5#147
dependabot[bot] wants to merge 1 commit into
mainfrom
dependabot/nuget/src/MarkZither.Rag.Chunking/Tiktoken-3.1.5

Conversation

@dependabot
Copy link
Copy Markdown
Contributor

@dependabot dependabot Bot commented on behalf of github May 21, 2026

Updated Tiktoken from 2.0.3 to 3.1.5.

Release notes

Sourced from Tiktoken's releases.

3.1.5

What's Changed

Full Changelog: tryAGI/Tiktoken@v3.1.4...v3.1.5

3.1.4

Full Changelog: tryAGI/Tiktoken@v3.1.3...v3.1.4

3.1.3

Full Changelog: tryAGI/Tiktoken@v3.1.2...v3.1.3

3.1.2

Full Changelog: tryAGI/Tiktoken@v3.1.1...v3.1.2

3.1.1

Full Changelog: tryAGI/Tiktoken@v3.1.0...v3.1.1

3.1.0

Full Changelog: tryAGI/Tiktoken@v3.0.0...v3.1.0

3.1.0-rc.1

Full Changelog: tryAGI/Tiktoken@v3.0.0...v3.1.0-rc.1

3.0.0

Highlights

Peak throughput: 618 MiB/s — up to 42x faster than other .NET tokenizers on multilingual/CJK text.

Performance improvements

  • O(n log n) min-heap BPE merge — replaces O(n²) linear scan in FindParts, improving cold-path CJK/multilingual by ~13%
  • Restored ASCII fast path — fixes a 35% regression on cached CJK/multilingual text introduced in v2.3.0
  • Cache speedup: 5-13x on repeated multilingual/CJK text (was 4-9x)

CountTokens — zero allocation

Input Tiktoken Throughput vs competitors
Hello, World! (13 B) 88 ns 141 MiB/s 1.9-3.6x
Multilingual (382 B) 1.1 us 339 MiB/s 4.7-13.6x
CJK-heavy (1,676 B) 2.6 us 618 MiB/s 14.3-42.3x
Python code (879 B) 5.5 us 153 MiB/s 1.8-4.0x
Multilingual long (4,312 B) 9.0 us 458 MiB/s 7.9-31.6x
Bitcoin whitepaper (19,884 B) 105.1 us 180 MiB/s 2.4-3.8x

New

  • ColdPath benchmark project — dedicated benchmark for cache vs no-cache comparison
  • SmokeTest project — local perf regression guard using machine-independent cache speedup ratios
  • Aho-Corasick investigation — research document on alternative tokenization approaches

Cross-language context (Apple M4 Max, o200k_base)

Implementation Encode Throughput CountTokens Throughput
Tiktoken .NET (cached) 114-484 MiB/s 141-618 MiB/s
Tiktoken .NET (no cache) 44-145 MiB/s 47-155 MiB/s
tiktoken Rust v3 34-88 MiB/s
GitHub bpe Rust v0.3 33-64 MiB/s 29-66 MiB/s
OpenAI tiktoken Python 0.12 7-20 MiB/s

Commits viewable in compare view.

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

---
updated-dependencies:
- dependency-name: Tiktoken
  dependency-version: 3.1.5
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
@dependabot dependabot Bot added .NET Pull requests that update .NET code dependencies Pull requests that update a dependency file labels May 21, 2026
@github-actions
Copy link
Copy Markdown

Dependency Review

The following issues were found:
  • ✅ 0 vulnerable package(s)
  • ✅ 0 package(s) with incompatible licenses
  • ✅ 0 package(s) with invalid SPDX license definitions
  • ⚠️ 8 package(s) with unknown licenses.
See the Details below.

License Issues

src/MarkZither.Rag.Chunking/MarkZither.Rag.Chunking.csproj

PackageVersionLicenseIssue Type
Tiktoken3.1.5NullUnknown License
Tiktoken.Core3.1.5NullUnknown License
Tiktoken.Encodings.Abstractions3.1.5NullUnknown License
Tiktoken.Encodings.Tokenizer3.1.5NullUnknown License
Tiktoken.Encodings.cl100k3.1.5NullUnknown License
Tiktoken.Encodings.o200k3.1.5NullUnknown License
Tiktoken.Encodings.p50k3.1.5NullUnknown License
Tiktoken.Encodings.r50k3.1.5NullUnknown License

OpenSSF Scorecard

Scorecard details
PackageVersionScoreDetails
nuget/System.IO.Pipelines 10.0.7 UnknownUnknown
nuget/System.Text.Encodings.Web 10.0.7 UnknownUnknown
nuget/System.Text.Json 10.0.7 UnknownUnknown
nuget/Tiktoken 3.1.5 UnknownUnknown
nuget/Tiktoken.Core 3.1.5 UnknownUnknown
nuget/Tiktoken.Encodings.Abstractions 3.1.5 UnknownUnknown
nuget/Tiktoken.Encodings.Tokenizer 3.1.5 UnknownUnknown
nuget/Tiktoken.Encodings.cl100k 3.1.5 UnknownUnknown
nuget/Tiktoken.Encodings.o200k 3.1.5 UnknownUnknown
nuget/Tiktoken.Encodings.p50k 3.1.5 UnknownUnknown
nuget/Tiktoken.Encodings.r50k 3.1.5 UnknownUnknown

Scanned Files

  • src/MarkZither.Rag.Chunking/MarkZither.Rag.Chunking.csproj

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file .NET Pull requests that update .NET code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

0 participants