:bug: fix decode dot in keys by techouse · Pull Request #25 · techouse/qs_codec

techouse · 2025-08-23T17:48:42Z

This pull request refines query string decoding logic, focusing on how percent-encoded dots (%2E/%2e) and list limits are handled, and improves compatibility with custom and legacy decoders. The changes clarify documentation, enhance top-level dot splitting, and add convenience methods for decoding keys and values. The most significant changes are grouped below:

Dot Decoding and Key Splitting Improvements:

Introduced a new dot_to_bracket_top_level method in DecodeUtils to convert top-level dots in keys to bracket groups, preserving dots inside brackets and handling edge cases. This replaces the previous regex-based approach and ensures percent-encoded dots are never split at the top level.
Updated key splitting (split_key_into_segments) to use the new character-scanner for top-level dots, ensuring correct handling of degenerate cases and parity with other language ports. Unterminated brackets now produce a synthetic segment matching Kotlin/qs behavior. [1] [2] [3]

Percent-Encoding and Decoder Behavior:

Keys and values are now decoded identically by the default decoder; whether a literal . acts as a key separator is determined by parsing options, not by decoding. The previous logic that preserved %2E in keys has been removed for consistency. [1] [2]
Documentation in DecodeOptions and decode-related methods has been clarified to reflect these behaviors and the role of decode_dot_in_keys for top-level dot splitting only. [1] [2]

Decoder Configuration and Compatibility:

Added support for a legacy_decoder option in DecodeOptions, enabling back-compatibility with older two-argument decoders. The decoder precedence is now: custom decoder > legacy_decoder > library default. [1] [2]
Added convenience methods decode, decode_key, and decode_value to DecodeOptions for unified scalar decoding, mirroring Kotlin API parity.

List Limit and Parsing Semantics:

Improved documentation and logic around negative list_limit values, clarifying that numeric-index parsing is disabled but comma-splitting still returns lists, and that list-growth operations raise immediately when raise_on_limit_exceeded=True. [1] [2]

General Documentation and Clarity:

Enhanced docstrings and inline comments throughout the decoding code to clarify behavior, edge cases, and cross-language parity. [1] [2] [3]

These changes together improve the correctness, configurability, and maintainability of the query string decoding logic.

Summary by CodeRabbit

New Features
- Pluggable custom decoders with backward-compat compatibility and new decode/decode_key/decode_value APIs.
- Top-level dot-to-bracket splitting that preserves dots inside brackets and respects encoded dots.
- Case-insensitive handling of percent-encoded brackets and uniform token pre-decoding.
Bug Fixes
- Clarified/enforced negative list_limit and list-growth semantics; numeric bracket indices honor list limits or become map keys.
- Preserve original keys when depth limits prevent splitting; unified key/value decoding behavior.
Documentation
- Updated docstrings describing decoding, dot/bracket, and decoder-signature semantics.
Tests
- Expanded unit tests covering dot/decoder interactions, parity, and segmentation edge cases.

…decode methods

…ndle degenerate cases

…_keys is enabled

… decoder results in DecodeOptions

…to_bracket_top_level

… configurations

…inated brackets in top-level dot splitting

…oded dot handling in decode logic

coderabbitai · 2025-08-23T17:48:50Z

Warning

Rate limit exceeded

@techouse has exceeded the limit for the number of commits or files that can be reviewed per hour. Please wait 2 minutes and 32 seconds before requesting another review.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

📥 Commits

Reviewing files that changed from the base of the PR and between b0e17c5 and a710b6d.

📒 Files selected for processing (2)

src/qs_codec/enums/decode_kind.py (2 hunks)
src/qs_codec/utils/decode_utils.py (7 hunks)

Walkthrough

Preprocesses query-string tokens, normalizes percent-encoded brackets/dots, introduces kind-aware decoding via DecodeOptions (with legacy_decoder compatibility), replaces regex dot-splitting with a top-level dot scanner, refines comma-split and negative list_limit semantics, and expands tests for decoder-kind, dot, and list behaviors.

Changes

Cohort / File(s)	Summary
Core decoding flow `src/qs_codec/decode.py`	Preprocesses `%5B`/`%5D` case-insensitively, applies `DecodeOptions.decode(..., kind=KEY
Decode options & adapter `src/qs_codec/models/decode_options.py`	Adds `legacy_decoder` and a signature-aware adapter that normalizes various decoder signatures (positional/keyword `charset`, optional `kind` as str or enum). Precedence: `decoder` > `legacy_decoder` > default. Adds `decode`, `decode_key`, and `decode_value` helpers.
Utilities: top-level dot scanner `src/qs_codec/utils/decode_utils.py`	Removes `DOT_TO_BRACKET` regex and adds `DecodeUtils.dot_to_bracket_top_level` to convert only top-level dots to bracket segments while preserving dots inside brackets and percent-encoded dots; `split_key_into_segments` uses this scanner and improves max-depth / unterminated-bracket handling; unify key/value decode behavior.
Enum docs `src/qs_codec/enums/decode_kind.py`	Updates `DecodeKind.KEY` docstring to state the scalar decoder fully decodes percent-encoded dots and that dot-splitting is handled later by higher-level parsing.
Tests: decode options & custom decoders `tests/unit/decode_options_test.py`	Adds tests covering `allow_dots` vs `decode_dot_in_keys` interactions, percent-encoded-dot handling, custom decoder signature/precedence/return behaviors (including `None` and non-string returns), and legacy-decoder compatibility.
Tests: decode behavior & parity `tests/unit/decode_test.py`	Adds extensive C#‑parity and edge-case tests for encoded-dot behavior across top-level and bracketed segments, Latin‑1 paths, leading/trailing/double dots, nesting, kind-aware decoder observations, and `split_key_into_segments` remainder/strict-depth cases.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant Client as Raw query string
  participant Decode as src/qs_codec/decode.py
  participant Utils as DecodeUtils
  participant Opts as DecodeOptions
  participant Parser as _parse_object/_parse_array_value

  Client->>Decode: raw "k=v" tokens
  Decode->>Utils: normalize %5B/%5D (case-insensitive)
  Decode->>Utils: dot_to_bracket_top_level(key_token) / split_key_into_segments
  Utils-->>Decode: key segments (top-level dots -> brackets)
  Decode->>Opts: decode(key_token, kind=KEY)
  Opts->>Opts: select/adapt decoder (decoder > legacy_decoder > default)
  Opts-->>Decode: decoded key (or None)
  Decode->>Opts: decode(value_token, kind=VALUE)
  Opts-->>Decode: decoded value (or None)
  alt token decode returned None
    Decode-->>Client: skip pair
  else
    Decode->>Parser: apply bracket vs numeric-index heuristics, list_limit & comma-split rules
    Parser-->>Decode: intermediate entries (flat dict)
    Decode-->>Client: accumulate results
  end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

🐛 preserve percent-encoded dots in keys during decoding #23 — Overlaps kind-aware decoding, decoder-adapter logic, and shared decode-path changes.
✨ decode: add raise_on_limit_exceeded option #11 — Overlaps list/bracket parsing and raise_on_limit_exceeded/list_limit semantics.
🐛 fix list parsing behavior and improve test cases for DecodeOptions #19 — Related changes to list-parsing and list_limit enforcement in decode.py.

Poem

I hopped through brackets, dots, and percent signs,
Tuned decoders to greet keys and values fine.
Lists count gently, indices find their art,
Bracketed dots return to their true heart.
My whiskers twitch — the parser's part. 🐇✨

✨ Finishing Touches

📝 Generate Docstrings

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch fix/decode-dot-in-keys

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR/Issue comments)

Type @coderabbitai help to get the list of available commands.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Status, Documentation and Community

Visit our Status Page to check the current availability of CodeRabbit.
Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

codecov · 2025-08-23T17:49:59Z

Codecov Report

❌ Patch coverage is 85.51724% with 21 lines in your changes missing coverage. Please review.
✅ Project coverage is 93.82%. Comparing base (2c3e103) to head (a710b6d).
⚠️ Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
src/qs_codec/models/decode_options.py	79.74%	16 Missing ⚠️
src/qs_codec/utils/decode_utils.py	92.42%	5 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #25      +/-   ##
==========================================
- Coverage   94.01%   93.82%   -0.20%     
==========================================
  Files          16       16              
  Lines        1070     1134      +64     
==========================================
+ Hits         1006     1064      +58     
- Misses         64       70       +6

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

…acy_decoder signatures

codacy-production · 2025-08-23T17:56:12Z

Coverage summary from Codacy

See diff coverage on Codacy

Coverage variation	Diff coverage
✅ -0.19% (target: -1.00%)	✅ 85.52%

Coverage variation details

	Coverable lines	Covered lines	Coverage
Common ancestor commit (`2c3e103`)	1070	1006	94.02%
Head commit (`a710b6d`)	1134 (+64)	1064 (+58)	93.83% (-0.19%)

Coverage variation is the difference between the coverage for the head and common ancestor commits of the pull request branch: <coverage of head commit> - <coverage of common ancestor commit>

Diff coverage details

	Coverable lines	Covered lines	Diff coverage
Pull request (#25)	145	124	85.52%

Diff coverage is the percentage of lines that are covered by tests out of the coverable lines that the pull request added or modified: <covered lines added or modified>/<coverable lines added or modified> * 100%

See your quality gate settings Change summary preferences

coderabbitai

Actionable comments posted: 4

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

tests/unit/decode_options_test.py (1)

232-244: Coercion test: replace PEP 604 unions

Apply this diff:

-        def dec(
-            v: str | None,
-            charset: Charset | None = None,
+        def dec(
+            v: t.Optional[str],
+            charset: t.Optional[Charset] = None,
             *,
             kind: DecodeKind = DecodeKind.VALUE,
-        ) -> object | None:
+        ) -> t.Optional[object]:
             return 42 if v is not None else None

🧹 Nitpick comments (6)

src/qs_codec/utils/decode_utils.py (2)
7-8: Doc: great clarification of top‑level dot rules; consider cross‑file consistency note

The new docstring clearly enumerates edge cases (leading/trailing/double dots, skip before “[”) and the “don’t split on %2E” rule. One follow‑up: DecodeKind’s docstring (enums/decode_kind.py) still mentions preserving encoded dots for KEY decoding, which no longer reflects the unified key/value decoding here. Recommend aligning that doc to avoid confusion.

32-108: Top‑level dot scanner logic looks solid and O(n); minor comment cleanup suggested

The depth tracking and handling of “.[”, leading “.”, double dots, and trailing “.” are correct and match the stated rules. Small nit: the “normal split (also handles leading '.')” comment duplicates the explicit leading‑dot branch above and may mislead future readers.

Proposed comment tweak only:
-                        # normal split (also handles leading '.'): convert a.b → a[b]
-                        # and '.a' → '[a]' at top level
+                        # normal split at top level: convert a.b → a[b]
src/qs_codec/models/decode_options.py (1)

202-215: Kind passing heuristic: robust; add Literal/typing annotation caveat (nit)

The Enum vs str decision is thoughtful. Minor future‑proofing: if someone annotates kind as typing.Literal["key","value"], it will be treated as “pass string” (fine). Consider a short inline comment to document this for maintainers. No code change required.

src/qs_codec/decode.py (3)

126-145: list_limit wording vs implementation: behavior is consistent; nit on doc phrasing

The doc explains negative list_limit semantics. The implementation enforces immediate errors for comma‑splits when raise_on_limit_exceeded=True (since any split length > negative limit), and otherwise proceeds. That matches the note. Consider clarifying that index‑based growth is handled in _parse_object while comma‑splits are handled here.

186-188: Case‑insensitive bracket normalization: OK, but note that values are affected too

Replacing %5B/%5D globally is pragmatic and keeps splitting simple. It also affects value tokens, which is acceptable given values are then decoded through the decoder. Add a brief comment to acknowledge this intentional trade‑off.

324-336: Current list length pre‑compute for trailing []: logic is conservative; consider simplification

The attempt to infer current_list_length by parsing chain[:-1] into an int usually yields None (e.g., "a[0]" → ValueError), so the branch rarely triggers. Since _parse_array_value already enforces list limits at the leaf and you pass explicit lengths for existing containers in _parse_query_string_values, this pre‑compute could be simplified or documented as best‑effort.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 2c3e103 and d0dd3c7.

📒 Files selected for processing (5)

src/qs_codec/decode.py (4 hunks)
src/qs_codec/models/decode_options.py (4 hunks)
src/qs_codec/utils/decode_utils.py (7 hunks)
tests/unit/decode_options_test.py (1 hunks)
tests/unit/decode_test.py (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (4)

tests/unit/decode_options_test.py (3)

src/qs_codec/models/decode_options.py (3)

DecodeOptions (20-286)

decode_key (279-282)

decode_value (284-286)

src/qs_codec/enums/charset.py (1)

Charset (25-36)

src/qs_codec/enums/decode_kind.py (1)

DecodeKind (13-27)

tests/unit/decode_test.py (5)

src/qs_codec/models/decode_options.py (2)

DecodeOptions (20-286)

decode (263-277)

src/qs_codec/decode.py (1)

decode (31-101)

src/qs_codec/utils/decode_utils.py (1)

decode (143-176)

src/qs_codec/enums/charset.py (1)

Charset (25-36)

src/qs_codec/enums/decode_kind.py (1)

DecodeKind (13-27)

src/qs_codec/models/decode_options.py (3)

src/qs_codec/utils/decode_utils.py (1)

decode (143-176)

src/qs_codec/enums/charset.py (1)

Charset (25-36)

src/qs_codec/enums/decode_kind.py (1)

DecodeKind (13-27)

src/qs_codec/utils/decode_utils.py (1)

src/qs_codec/enums/decode_kind.py (1)

DecodeKind (13-27)

🪛 GitHub Actions: Test

tests/unit/decode_options_test.py

[error] 186-186: pytest -v --cov=src/qs_codec --cov-report=xml failed with exit code 1. Test 'test_decoder_is_used_for_key_and_value' failed due to TypeError: unsupported operand type(s) for |: 'type' and 'NoneType' in annotation 'str | None'. Python 3.9 does not support PEP 604 union types; upgrade to Python 3.10+ or replace with Optional[str].

[error] 199-199: pytest -v --cov=src/qs_codec --cov-report=xml failed with exit code 1. Test 'test_decoder_null_return_is_honored' failed due to TypeError: unsupported operand type(s) for |: 'type' and 'NoneType' in annotation 'str | None'. Python 3.9 does not support PEP 604 union types; upgrade to Python 3.10+ or replace with Optional[str].

[error] 207-207: pytest -v --cov=src/qs_codec --cov-report=xml failed with exit code 1. Test 'test_single_decoder_acts_like_legacy_when_ignoring_kind' failed due to TypeError: unsupported operand type(s) for |: 'type' and 'NoneType' in annotation 'str | None'. Python 3.9 does not support PEP 604 union types; upgrade to Python 3.10+ or replace with Optional[str].

[error] 217-217: pytest -v --cov=src/qs_codec --cov-report=xml failed with exit code 1. Test 'test_decoder_wins_over_legacy_decoder_when_both_provided' failed due to TypeError: unsupported operand type(s) for |: 'type' and 'NoneType' in annotation 'str | None'. Python 3.9 does not support PEP 604 union types; upgrade to Python 3.10+ or replace with Optional[str].

[error] 235-235: pytest -v --cov=src/qs_codec --cov-report=xml failed with exit code 1. Test 'test_decode_key_coerces_non_string_decoder_result' failed due to TypeError: unsupported operand type(s) for |: 'type' and 'NoneType' in annotation 'str | None'. Python 3.9 does not support PEP 604 union types; upgrade to Python 3.10+ or replace with Optional[str].

🪛 Ruff (0.12.2)

tests/unit/decode_test.py

1556-1556: Dictionary key literal "a" repeated

(F601)

🪛 Flake8 (7.2.0)

tests/unit/decode_test.py

[error] 1556-1556: dictionary key 'a' repeated with different values

(F601)

[error] 1556-1556: dictionary key 'a' repeated with different values

(F601)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Codacy Static Code Analysis

🔇 Additional comments (12)

src/qs_codec/utils/decode_utils.py (3)

147-156: Unified decoding path: OK; confirm intentional no‑op kind

Using the same decoding logic for keys and values is consistent with the new tests. The kind parameter is intentionally unused here (pylint‑disabled). Just confirming this is by design now that dot handling moved entirely to the splitter/object builder.

188-201: Behavioral change in split: depth<=0 returns original key — matches tests

Returning [original_key] when max_depth <= 0 preserves keys unsplit (parity with tests like “uses original key when depth is 0”). Good defensive default.

240-246: Remainder wrapping: behavior matches expected “[c][d]” literal segment

Wrapping the remainder as [[...]] to yield a single literal key like "[c][d]" matches test expectations (e.g., depth limit with no strict depth). Looks good.

src/qs_codec/models/decode_options.py (5)

31-35: Option semantics are clear: good note on top‑level vs in‑bracket behavior

The doc clarifies that decode_dot_in_keys affects top‑level splitting semantics and not percent‑decoding inside brackets. This aligns with the new unified decoding pipeline.

138-142: New legacy_decoder: good back‑compat surface and precedence note

The explicit field and doc about precedence (decoder > legacy_decoder > default) will save integrators time.

155-162: Decoder precedence and fallback: sensible defaults

The precedence and defaulting to DecodeUtils.decode avoid None paths later. Good.

250-261: Adapter wrapper: nice consolidation point

Wrapping user decoders into a uniform (s, charset, *, kind) signature is clean and keeps call sites simple.

262-287: New convenience methods: API ergonomics improved

decode, decode_key, decode_value provide a clear, single entrypoint. Coercing decode_key to str safeguards callers from custom decoders returning non‑strings.

tests/unit/decode_test.py (1)

1530-1533: Guardrail expectations match splitter behavior — LGTM

Tests around trailing dots, leading dots, and double dots align with dot_to_bracket_top_level rules (preserve trailing, skip “.[”, convert “a..b” → “a.[b]”).

Also applies to: 1619-1625

src/qs_codec/decode.py (2)

238-270: Decoder usage: correct KEY/VALUE signaling and None‑skipping

The key/value decode with kind and the “skip pair when key decodes to None” behavior is clean and matches tests.

352-365: In‑bracket %2E normalization under decode_dot_in_keys: good parity with custom decoders

Normalizing %2E/%2e inside bracket segments here covers the case where a custom decoder returns raw tokens. Clear comments — nice.
tests/unit/decode_options_test.py (1)
215-231: Ensure decoder wins over legacy_decoder: replace PEP 604 unions

Apply this diff:
-        def legacy(v: str | None, charset: Charset | None = None) -> str | None:
+        def legacy(v: t.Optional[str], charset: t.Optional[Charset] = None) -> t.Optional[str]:
             return f"L:{'null' if v is None else v}"

-        def dec(
-            v: str | None,
-            charset: Charset | None = None,
+        def dec(
+            v: t.Optional[str],
+            charset: t.Optional[Charset] = None,
             *,
             kind: DecodeKind = DecodeKind.VALUE,
-        ) -> str | None:
+        ) -> t.Optional[str]:
             return f"K:{kind.name}:{'null' if v is None else v}"
Likely an incorrect or invalid review comment.

… decoder invocation for dot-encoded and bracketed keys

…e logic

…ind parameter in decode logic

…t in decode function

… query string prior to splitting

…cement in decode logic

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/qs_codec/utils/decode_utils.py (1)

213-244: Strict-depth raising also triggers on unterminated bracket groups; differentiate overflow vs. untermination
Today, any leftover '[' after the main loop triggers an IndexError when strict_depth=True, even if the loop exited due to an unterminated group (close < 0) rather than exceeding max_depth. The docstring promises a synthetic segment for unterminated groups (qs/Kotlin parity). Tighten the condition so we only raise on “more groups beyond max_depth” and not on unterminated input.

Apply this diff to track untermination and only raise on true depth overflow:

@@
-        while open_idx >= 0 and depth < max_depth:
+        unterminated = False
+        while open_idx >= 0 and depth < max_depth:
@@
-            if close < 0:
-                break  # unterminated group; stop collecting; remainder handled below
+            if close < 0:
+                unterminated = True  # unterminated group; stop collecting; remainder handled below
+                break
@@
-        if open_idx >= 0:
-            if strict_depth:
+        if open_idx >= 0:
+            # We only want to raise for true depth overflow under strict_depth,
+            # not for unterminated bracket groups.
+            depth_overflow = (depth >= max_depth) and not unterminated
+            if strict_depth and depth_overflow:
                 raise IndexError(f"Input depth exceeded depth option of {max_depth} and strict_depth is True")
             # Stash the remainder as a single segment (qs/Kotlin parity)
             segments.append("[" + key[open_idx:] + "]")

🧹 Nitpick comments (5)

src/qs_codec/utils/decode_utils.py (1)

51-57: Edge cases: leading '.' without following token and empty bracket name

For the lone key "." (no following token), the code treats it as a trailing dot and preserves ".". That matches “trailing '.' is preserved,” but differs from “leading '.' starts a bracket segment.” If this is intentional, consider clarifying the docstring that ".a → [a]" but "." remains ".".

For inputs like ".[ ]" (space inside brackets after a skipped dot), behavior is consistent, but a short test would lock this down.

Would you like me to add targeted unit tests for "." and for whitespace after a skipped dot (e.g., "a.[ b ]=x")?

Also applies to: 68-99, 103-107
tests/unit/decode_test.py (3)
1485-1500: Mixed-case encoded brackets + encoded dots: ensure option consistency path is covered
Good coverage of mixed-case %5B/%5D and %2E. One nit: the “inconsistent options raises” case ties the error to options construction, not parsing. Consider asserting the ValueError at options init time to decouple from decode() and make the intent explicit.
-        with pytest.raises(ValueError):
-            decode("a%5Bb%5D%5Bc%5D%2Ed=x", DecodeOptions(allow_dots=False, decode_dot_in_keys=True))
+        with pytest.raises(ValueError):
+            DecodeOptions(allow_dots=False, decode_dot_in_keys=True)
1513-1521: Case-insensitive %2E handling assertion is good; add control with allow_dots=False
You demonstrate both uppercase and lowercase %2E variants. Add a control asserting that with allow_dots=False, both remain literal and do not split to increase confidence.
         opt = DecodeOptions(allow_dots=True, decode_dot_in_keys=True)
         assert decode("a[b]%2Ec=x", opt) == {"a": {"b": {"c": "x"}}}
         assert decode("a[b]%2ec=x", opt) == {"a": {"b": {"c": "x"}}}
+        # Control: no splitting when allow_dots=False
+        no_dots = DecodeOptions(allow_dots=False, decode_dot_in_keys=False)
+        assert decode("a[b]%2Ec=x", no_dots) == {"a[b].c": "x"}
1564-1622: Additional parity tests mix two different models for encoded-dot behavior
Parts of this block assume “encoded dots decode to literal '.' and then split under allow_dots” (model A), others assume “encoded dots never split unless decode_dot_in_keys=True” (model B). Please unify to one model and update expectations accordingly (see earlier comment).

I can submit a cohesive test matrix once you confirm whether keys are pre-decoded before splitting or not.
src/qs_codec/models/decode_options.py (1)
283-291: decode_key coerces any scalar to str; callers relying on non-str keys should be aware
Minor note: decode_key forces str() on any non-None result. If a custom decoder returns non-strings for keys (e.g., bytes), they’ll be coerced. This is reasonable, but worth a one-liner in docs to avoid surprises.
     def decode_key(self, value: t.Optional[str], charset: t.Optional[Charset] = None) -> t.Optional[str]:
-        """Decode a key (or key segment). Always returns a string or ``None``."""
+        """Decode a key (or key segment). Always returns a string or ``None``.
+        Note: custom decoders returning non-strings for keys are coerced via ``str()``."""
         out = self.decode(value, charset, kind=DecodeKind.KEY)
         return None if out is None else str(out)

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between f04d9c6 and 9beee76.

📒 Files selected for processing (4)

src/qs_codec/decode.py (4 hunks)
src/qs_codec/models/decode_options.py (4 hunks)
src/qs_codec/utils/decode_utils.py (7 hunks)
tests/unit/decode_test.py (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

src/qs_codec/decode.py

🧰 Additional context used

🧬 Code graph analysis (3)

src/qs_codec/utils/decode_utils.py (1)

src/qs_codec/enums/decode_kind.py (1)

DecodeKind (13-27)

src/qs_codec/models/decode_options.py (4)

src/qs_codec/utils/decode_utils.py (2)

DecodeUtils (18-245)

decode (142-175)

src/qs_codec/decode.py (1)

decode (31-101)

src/qs_codec/enums/charset.py (1)

Charset (25-36)

src/qs_codec/enums/decode_kind.py (1)

DecodeKind (13-27)

tests/unit/decode_test.py (5)

src/qs_codec/models/decode_options.py (2)

DecodeOptions (20-290)

decode (267-281)

src/qs_codec/decode.py (1)

decode (31-101)

src/qs_codec/utils/decode_utils.py (1)

decode (142-175)

src/qs_codec/enums/charset.py (1)

Charset (25-36)

src/qs_codec/enums/decode_kind.py (1)

DecodeKind (13-27)

🔇 Additional comments (4)

src/qs_codec/utils/decode_utils.py (1)

32-107: Top-level dot scanner: solid O(n) pass with well-scoped splitting; preserves bracket depth and encoded dots
The character-scanner correctly avoids splitting inside brackets, handles degenerate cases, and never treats percent-encoded dots as separators. Clear improvement over regex.

tests/unit/decode_test.py (1)

1554-1561: Nice fix for F601 duplicate dict key in no-op assertion
The bool(decode(...)) approach avoids constructing a dict with duplicate keys while still exercising the code path.

src/qs_codec/models/decode_options.py (2)

27-35: Docstring implies decode_dot_in_keys governs top-level %2E splitting; verify against actual pipeline
Given DecodeUtils.decode treats keys and values identically, whether a top-level %2E participates in splitting depends on where decoding occurs relative to dot-splitting. Please make the contract explicit here:

If keys are decoded before splitting, then allow_dots alone causes decoded %2E→"." to split; decode_dot_in_keys may be redundant.

If decode_dot_in_keys is the only switch that enables top-level %2E splitting, ensure keys are not pre-decoded into literal '.' before calling the top-level scanner.

I can propose precise wording after you confirm the intended order of operations (decode vs. split).

155-166: Signature-adaptive decoder adapter: robust and pragmatic
The adapter correctly handles (s), (s, charset), and (s, charset, kind) forms, including varargs/kwargs and Enum-vs-str kind representations. Nice touch using Enum base to decide string coercion.

Also applies to: 220-263

… encoded dots in dot_to_bracket_top_level

…et groups in decode logic

…der outputs via str()

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (7)

src/qs_codec/utils/decode_utils.py (7)
80-111: Leading “..” currently produces an empty “[]” segment; either document or preserve the first dot

For inputs like “..a”, the branch for a leading dot builds an empty bracket segment “[]” before proceeding. That may be surprising given the documented degenerate rule focuses on “a..b → a.[b]”. If the intended rule is “preserve the first dot” even at the start, consider this small guard; otherwise, please add a test and doc note for “..a” explicitly.
                 elif i == 0:
+                    # If input starts with '..', preserve the first dot like the 'a..b' case.
+                    if has_next and next_ch == ".":
+                        sb.append(".")
+                        i += 1
+                        continue
                     # leading '.' starts a bracket segment: ".a" -> "[a]"
                     start = i + 1
                     j = start
                     while j < n and s[j] != "." and s[j] != "[":
                         j += 1
Examples to cover in tests: ".a", "..a", "...", ".[b]".

112-119: Comment mentions preserving percent sequences; code already does so implicitly

The implementation doesn’t special‑case “%2E” (it just appends chars), which is fine. The comment is correct but could be tightened to “We only split on literal '.'; '%' sequences are never treated as separators.” Optional.

184-185: Latin‑1 fast path is correct; minor micro‑opt optional

Current regex+lambda is clear and fine. If you ever need to squeeze a bit more performance, replacing the lambda with a small local fast converter (avoids repeated int/chr in a closure) can help, but not necessary now.

199-205: Docs: clarify remainder-wrapping example for max_depth and unterminated groups

The behavior is sound. Suggest adding an explicit example so users know the remainder is bracket‑wrapped as a single synthetic segment, e.g., 'a[b][c][d]' with max_depth=2 → ['a','[b]','[c]','[[d]]']; 'a[b' (unterminated) → ['a','[[b]'].
         - If there are more groups beyond ``max_depth`` and ``strict_depth`` is True, an ``IndexError`` is raised. Otherwise, the remainder is added as one final segment (again mirroring qs).
-        - Unterminated '[': the remainder after the first unmatched '[' is captured as a single synthetic bracket segment (qs/Kotlin parity).
+        - Unterminated '[': the remainder after the first unmatched '[' is captured as a single synthetic bracket segment (qs/Kotlin parity).
+
+        Examples
+        --------
+        max_depth=2: "a[b][c][d]" -> ["a", "[b]", "[c]", "[[d]]"]
+        unterminated: "a[b" -> ["a", "[[b]"]
225-247: Unterminated bracket handling is correct and avoids strict_depth false positives

The separate ‘unterminated’ flag prevents raising for malformed input under strict_depth—good parity detail. Consider adding unit tests that assert no exception is raised for unterminated input when strict_depth=True.

253-261: Depth overflow vs unterminated remainder: confirm intended double-bracket remainder

Appending "[" + key[open_idx:] + "]" intentionally creates a synthetic “[ … ]” segment whose content contains the original bracket tokens, e.g., remainder “[d][e]” → “[[d][e]]”. That matches the doc comment; just ensure downstream consumers treat “[[…]]” as an opaque segment. If not, you may need a sentinel wrapper.

I can help add tests to lock this in:

allow_dots=True, max_depth=1: "a.b.c" → ["a", "[b][c]"] vs ["a","[[b][c]]"] depending on consumer expectations.

strict_depth=True overflow raises for well-formed keys, but not for unterminated.

154-166: Document kind as a no-op in decode() and refresh DecodeKind docs

To keep API compatibility clear and align cross-file documentation:

In src/qs_codec/utils/decode_utils.py (lines 154–166), extend the decode docstring to note that the kind parameter is accepted but ignored, and may be removed in a future major release.

In src/qs_codec/enums/decode_kind.py (lines 18–22), update the KEY attribute doc to reflect that the default scalar decoder ignores kind and fully decodes percent-encoded dots; dot-splitting semantics run later via parser options.

Suggested diffs:
--- a/src/qs_codec/utils/decode_utils.py
+++ b/src/qs_codec/utils/decode_utils.py
@@     def decode(
-        ) -> t.Optional[str]:
-        """Decode a URL-encoded scalar.
+        ) -> t.Optional[str]:
+        """Decode a URL-encoded scalar.
+
+        Notes
+        -----
+        The `kind` parameter is accepted for API compatibility but is currently
+        ignored; keys and values are decoded identically. It may be removed in
+        a future major release.
--- a/src/qs_codec/enums/decode_kind.py
+++ b/src/qs_codec/enums/decode_kind.py
@@ class DecodeKind(str, Enum):
-    KEY
-        Decode a *key* (or key segment). Implementations typically preserve
-        percent-encoded dots (``%2E``/``%2e``) so that dot-splitting semantics can
-        be applied later according to parser options.
+    KEY
+        Decode a *key* (or key segment). Note that the default scalar decoder
+        (``qs_codec.utils.decode_utils.decode``) ignores `kind` and fully
+        decodes percent-encoded dots (``%2E``/``%2e``). Dot-splitting behavior is
+        applied later by higher-level parser options.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 51b3c75 and 5887b7a.

📒 Files selected for processing (2)

src/qs_codec/models/decode_options.py (4 hunks)
src/qs_codec/utils/decode_utils.py (7 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

src/qs_codec/models/decode_options.py

🧰 Additional context used

🧬 Code graph analysis (1)

src/qs_codec/utils/decode_utils.py (1)

src/qs_codec/enums/decode_kind.py (1)

DecodeKind (13-27)

🔇 Additional comments (4)

src/qs_codec/utils/decode_utils.py (4)
7-8: Clarify percent-encoded dot semantics in module docstring

The bullet claims “never splits on percent‑encoded dots,” but the default decode path later normalizes “%2E” → “.”, which will be split at top level when allow_dots=True. Recommend updating the wording to reflect: only literal “.” at depth 0 are split; encoded “%2E” will be split iff the upstream decoder normalized them.

Apply this doc tweak:
- - Top‑level dot splitting uses a character‑scanner that handles degenerate cases (leading '.' starts a bracket segment; '.[' is skipped; double dots preserve the first; trailing '.' is preserved) and never splits on percent‑encoded dots.
+ - Top‑level dot splitting uses a character‑scanner that handles degenerate cases (leading '.' starts a bracket segment; '.[' is skipped; double dots preserve the first; trailing '.' is preserved). It splits only on literal '.' at depth 0. If the upstream decoder normalized '%2E' to '.', those will be split; if a custom decoder leaves '%2E' encoded, they will not split.
32-47: dot_to_bracket_top_level: tighten language about when decoding happens

Good API addition. Minor doc refinement: the opening line suggests this function itself runs “after percent‑decoding.” To avoid implying it decodes, prefer “after upstream decoding.”
[ suggest_nitpick ]
- """Convert top-level dot segments into bracket groups *after* percent-decoding.
+ """Convert top-level dot segments into bracket groups (assuming upstream decoding has already run).
Also, the Notes already clarify the default path vs custom decoder path—nice. No functional concerns here.

208-212: Returning the original key when max_depth <= 0 matches qs semantics

Good change. This preserves the original token when depth-based splitting is disabled.

36-45: Behavior vs PR summary: encoded dot splitting

PR summary states “ensures percent‑encoded dots are never split at the top level,” while the doc here (correctly) notes default decoding turns %2E into '.' which then will split. Please confirm the intended behavior. If the invariant “never split encoded dots” must hold for the default decoder, decode() would need to preserve %2E in keys again (or pass DecodeKind.KEY and branch), which contradicts the unification goal. Otherwise, update PR text/docs accordingly.

… dot sequences

…d unterminated bracket handling

…be removed in future

…encoded dots

… depth enforcement

coderabbitai

Actionable comments posted: 0

♻️ Duplicate comments (1)

tests/unit/decode_test.py (1)

1554-1561: Resolved: duplicate dict key (F601) workaround looks good
This replaces the problematic dead-branch assertion with a no-op truthiness assertion to force invocation without constructing duplicate keys. Matches prior guidance.

🧹 Nitpick comments (7)

src/qs_codec/utils/decode_utils.py (4)
80-106: Edge-case coverage: add tests for leading double dot and dot-before-bracket at start

You already cover a..b, a.[b], and trailing dots. Consider adding:

"..a=x" with allow_dots=True → {"..a": "x"} or {".": {"a": "x"}} depending on intended policy; your current logic preserves the first dot and turns the second into a bracket, yielding ".[a]" which ultimately becomes {".": {"a": "x"}}.

".[a]=x" with allow_dots=True → {"a": "x"} (you skip the dot before '['; worth an explicit test).

I can add parametrized tests mirroring these if you want.

118-124: Comment nit: “preserve percent sequences verbatim” is accurate but implementation is byte-wise

The comment says percent sequences are preserved “verbatim,” which is true; however, the code appends characters byte-by-byte without special handling for '%' sequences. Consider clarifying the comment to “no special handling for percent sequences here; characters are appended as-is.”
-                # also preserve percent sequences verbatim at top level;
-                # we don't split on '%2E' here
+                # No special handling for percent sequences here; characters are appended as-is.
+                # We never split on '%2E' at this stage.
190-196: Latin-1 fast-path is fine; consider a tiny micro-optimization

The HEX2_PATTERN.sub(lambda ...) is straightforward. If you care about micro-opts, prebind int and chr locally to avoid global lookups in the hot loop, but this is purely optional.
-            return cls.HEX2_PATTERN.sub(lambda m: chr(int(m.group(1), 16)), s)
+            _int, _chr = int, chr
+            return cls.HEX2_PATTERN.sub(lambda m: _chr(_int(m.group(1), 16)), s)
241-276: Remainder handling and strict-depth logic look correct; add one extra malformed-key test

unterminated=True avoids false positives for depth overflow.

The synthetic remainder via "[" + key[open_idx:] + "]" yields the intended [[...]].

Consider adding a test for a malformed sequence that includes ‘]’ inside the token captured by the scanner-generated group (e.g., “a.b]c” with allow_dots=True) to ensure downstream consumers tolerate odd literals within the synthetic segment.
tests/unit/decode_test.py (3)
1530-1533: Trailing dot behavior: confirm intentional drop vs literal retention

The expectation that “user.email.” with allow_dots=True yields {"user": {"email": "x"}} implies the trailing '.' is intentionally ignored. This matches the scanner’s behavior; consider adding one counterpart asserting that with allow_dots=False, the trailing '.' is retained as a literal key (“user.email.”), to document both modes.

1563-1622: Guardrail and parity tests are strong; consider minor parameterization to reduce duplication

The “additional parity” suite is thorough. You could reduce repetition by parameterizing case variants (%2E vs %2e) and option tuples, but this is optional.

1456-1461: Clarify intent with bracket-internal %2E decoding comments

Tests already assert that %2E inside brackets decodes to '.' independent of decode_dot_in_keys. Consider a brief comment on the option being top-level only, to avoid readers assuming it affects bracket segments.
-        # Note: key-decoder percent-decodes inside brackets, so %2E → "."
+        # Note: percent-decoding inside bracket segments yields '.' regardless of decode_dot_in_keys,
+        # which only influences top-level splitting semantics.
Also applies to: 1470-1474, 1475-1484

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 5887b7a and dba5961.

📒 Files selected for processing (3)

src/qs_codec/enums/decode_kind.py (1 hunks)
src/qs_codec/utils/decode_utils.py (7 hunks)
tests/unit/decode_test.py (1 hunks)

✅ Files skipped from review due to trivial changes (1)

src/qs_codec/enums/decode_kind.py

🧰 Additional context used

🧬 Code graph analysis (2)

src/qs_codec/utils/decode_utils.py (1)

src/qs_codec/enums/decode_kind.py (1)

DecodeKind (13-28)

tests/unit/decode_test.py (5)

src/qs_codec/models/decode_options.py (2)

DecodeOptions (20-293)

decode (267-281)

src/qs_codec/utils/decode_utils.py (3)

decode (159-198)

DecodeUtils (18-278)

split_key_into_segments (201-278)

src/qs_codec/decode.py (1)

decode (31-101)

src/qs_codec/enums/charset.py (1)

Charset (25-36)

src/qs_codec/enums/decode_kind.py (1)

DecodeKind (13-28)

🔇 Additional comments (8)

src/qs_codec/utils/decode_utils.py (5)

32-124: Top-level dot scanner is correct, linear, and handles degenerate cases well

The stateful scanner correctly:

respects depth (splits only at depth 0),

preserves dots inside brackets,

skips ".[" (a.[b] → a[b]),

preserves the first of a double dot (a..b → a.[b]),

preserves trailing dots.

The implementation is linear-time and avoids regex backtracking. Nice work.

163-178: Unified key/value decoding: docstring and signature match behavior

Accepting kind but ignoring it (with pylint suppression) is consistent with the updated design and the DecodeKind docs. Clear and future-proof.

210-216: split_key_into_segments doc clarifies dot-first policy and Kotlin/qs parity

The updated doc precisely communicates the top-level dot conversion and the synthetic segment behavior. Good alignment with tests.

224-228: depth=0 early return correctly bypasses dot conversion

Returning [original_key] before calling dot_to_bracket_top_level ensures “depth=0 means no splitting whatsoever,” which your tests assert. Good.

1-8: Sanity check complete: no lingering references to DOT_TO_BRACKET
I ran a repository-wide search for DOT_TO_BRACKET (including import statements) and found no matches. All references have been removed.

tests/unit/decode_test.py (3)

1435-1512: Encoded-dot behavior tests align with pre-decoding + top-level splitting

These cases establish:

%2E splits at top level when allow_dots=True regardless of decode_dot_in_keys.

Inconsistent options raise (decode_dot_in_keys=True while allow_dots=False).

Inside brackets, %2E decodes to '.' but does not trigger additional splits.

This matches the new contract. Well done.

1513-1545: Compositional cases (bracket + encoded dot, encoded dot before bracket) are accurate and valuable

Great coverage for “a[b]%2Ec=x”, “a%2E[b]=x”, and case variants. This guards against regressions in the scanner’s lookahead logic.

1624-1649: Remainder segmentation tests validate the synthetic ‘[[...]]’ contract

These assert the nuanced behaviors (depth slicing for dot and bracket inputs, strict vs non-strict, unterminated group). Excellent.

…rcent sequence handling

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (4)

src/qs_codec/utils/decode_utils.py (4)
7-7: Clarify wording around percent-encoded dots to avoid ambiguity.

“never splits on percent-encoded dots” can be misread given the default decoder fully percent-decodes first. Suggest clarifying that only literal '.' characters at depth 0 are split, while literal percent sequences like '%2E' are never treated as split points.

Apply this doc tweak:
- - Top‑level dot splitting uses a character‑scanner that handles degenerate cases (leading '.' starts a bracket segment; '.[' is skipped; double dots preserve the first; trailing '.' is preserved) and never splits on percent‑encoded dots.
+ - Top‑level dot splitting uses a character‑scanner that handles degenerate cases (leading '.' starts a bracket segment; '.[' is skipped; double dots preserve the first; trailing '.' is preserved) and never treats literal percent‑encoded sequences (e.g., '%2E') as split points; only actual '.' characters at depth 0 are split.
94-116: Reduce duplication in token scanning logic.

The “scan until next '.' or '[' (and now ']')” block appears twice (leading '.' case and normal split). Extract a small local helper to improve readability and reduce maintenance overhead.

Example refactor:
+        def _scan_token(start_idx: int) -> int:
+            j = start_idx
+            while j < n and s[j] not in ".[]":
+                j += 1
+            return j
@@
-                        j = start
-                        while j < n and s[j] != "." and s[j] != "[":
-                            j += 1
+                        j = _scan_token(start)
@@
-                        j = start
-                        while j < n and s[j] != "." and s[j] != "[":
-                            j += 1
+                        j = _scan_token(start)
195-197: Optional: Consider unquote_to_bytes for the Latin‑1 path.

For long strings this regex+lambda is fine, but urllib.parse.unquote_to_bytes followed by .decode('latin-1') is a readable, reasonably fast alternative that naturally ignores %uXXXX.

Illustrative change:
-            _int, _chr = int, chr
-            return cls.HEX2_PATTERN.sub(lambda m: _chr(_int(m.group(1), 16)), s)
+            from urllib.parse import unquote_to_bytes
+            return unquote_to_bytes(s).decode('latin-1')
Note: Both behaviors leave %uXXXX untouched. Keep the current approach if benchmarks favor it in your workloads.

242-279: Good treatment of unterminated groups and strict-depth overflow.

Using unterminated to suppress strict overflow exceptions is subtle but correct.

Remainder wrapping into a single synthetic segment ensures downstream folding can proceed deterministically.

Nit: consider renaming variables to reduce cognitive load:

level -> group_balance

depth -> segments_emitted

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between dba5961 and b0e17c5.

📒 Files selected for processing (1)

src/qs_codec/utils/decode_utils.py (7 hunks)

🧰 Additional context used

🧬 Code graph analysis (1)

src/qs_codec/utils/decode_utils.py (1)

src/qs_codec/enums/decode_kind.py (1)

DecodeKind (13-28)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Codacy Static Code Analysis

🔇 Additional comments (3)

src/qs_codec/utils/decode_utils.py (3)

32-62: API addition looks solid and the docstring is thorough.

The new top-level dot scanner API and its behavior notes are clear and align with qs semantics. Good call documenting degenerate cases and custom-decoder behavior.

211-217: Key-splitting spec is clear and matches qs/Kotlin parity.

The top-level dot conversion plus balanced-bracket segmentation and the unterminated-bracket behavior are well-specified in the docstring.

225-229: Correct handling for max_depth <= 0.

Returning the original key as a single segment is the right qs-aligned behavior. Good defensive early-exit.

…verrun on closing brackets

… percent-decoded dots in keys

…' at depth 0, not percent-encoded sequences

techouse added 10 commits August 23, 2025 17:53

♻️ refactor DecodeOptions to support legacy decoders and add unified …

2b999a8

…decode methods

🐛 fix top-level dot splitting in keys to preserve encoded dots and ha…

316a61b

…ndle degenerate cases

🐛 normalize percent-encoded dots in bracketed keys when decode_dot_in…

6c8b384

…_keys is enabled

✅ add tests for DecodeOptions dot-in-keys and custom decoder behaviors

2e0e688

✅ add CSharp parity tests for encoded dot behavior in DecodeOptions

8ed48ab

✅ add tests for decoder precedence over legacy_decoder and non-string…

9a3ce18

… decoder results in DecodeOptions

🐛 handle leading dot in keys by converting to bracket segment in dot_…

a4dbc60

…to_bracket_top_level

✅ add tests for dot encoding and decoding parity across DecodeOptions…

8fee561

… configurations

💡 update docstring to clarify handling of degenerate cases and unterm…

b4cb14a

…inated brackets in top-level dot splitting

💡 clarify docstrings for list_limit, comma-splitting, and percent-enc…

d0dd3c7

…oded dot handling in decode logic

techouse self-assigned this Aug 23, 2025

techouse added the bug Something isn't working label Aug 23, 2025

♻️ update type annotations in decode_options_test for decoder and leg…

f04d9c6

…acy_decoder signatures

coderabbitai Bot reviewed Aug 23, 2025

View reviewed changes

Comment thread tests/unit/decode_options_test.py Outdated

Comment thread tests/unit/decode_options_test.py

Comment thread tests/unit/decode_options_test.py

Comment thread tests/unit/decode_test.py

techouse added 6 commits August 23, 2025 19:06

✅ revise decode test to avoid duplicate dict key assertion and ensure…

2cf3612

… decoder invocation for dot-encoded and bracketed keys

💡 update comment to clarify top-level dot splitting behavior in decod…

9b62792

…e logic

💡 add comment to clarify handling of typing.Literal annotations for k…

d416d9d

…ind parameter in decode logic

💡 clarify comment on comma-split list logic and list_limit enforcemen…

b8e88e3

…t in decode function

💡 add comment to clarify normalization of percent-encoded brackets in…

089cfa4

… query string prior to splitting

💡 add comment to clarify conservative heuristic for list length enfor…

9beee76

…cement in decode logic

coderabbitai Bot reviewed Aug 23, 2025

View reviewed changes

Comment thread tests/unit/decode_test.py

techouse added 3 commits August 23, 2025 20:59

💡 update comment to clarify percent-decoding behavior and handling of…

51b3c75

… encoded dots in dot_to_bracket_top_level

🐛 fix strict_depth enforcement to avoid raising on unterminated brack…

1b8e41e

…et groups in decode logic

💡 clarify docstring on decode_key to note coercion of non-string deco…

5887b7a

…der outputs via str()

coderabbitai Bot reviewed Aug 23, 2025

View reviewed changes

techouse added 2 commits August 23, 2025 21:58

🐛 fix dot-to-bracket decoding to preserve leading dots in consecutive…

77e8342

… dot sequences

💡 add examples to docstring for decode_key to illustrate max_depth an…

43c2df8

…d unterminated bracket handling

techouse added 3 commits August 23, 2025 22:04

💡 document that 'kind' parameter in decode_scalar is ignored and may …

875abe2

…be removed in future

💡 clarify KEY docstring to note default decoder behavior for percent-…

1d9627f

…encoded dots

✅ add tests for split_key_into_segments remainder handling and strict…

dba5961

… depth enforcement

coderabbitai Bot reviewed Aug 23, 2025

View reviewed changes

🐛 fix percent-decoding to handle dot in keys and clarify top-level pe…

b0e17c5

…rcent sequence handling

coderabbitai Bot reviewed Aug 23, 2025

View reviewed changes

Comment thread src/qs_codec/utils/decode_utils.py

Comment thread src/qs_codec/utils/decode_utils.py

techouse added 3 commits August 23, 2025 23:36

🐛 handle ambiguous '.]' in key decoding and prevent bracket segment o…

877c7eb

…verrun on closing brackets

💡 update decode_kind docstring to clarify scalar decoder behavior for…

680bfa5

… percent-decoded dots in keys

💡 clarify docstring to specify dot splitting only occurs on actual '.…

a710b6d

…' at depth 0, not percent-encoded sequences

techouse merged commit bca7bcc into main Aug 23, 2025
14 of 17 checks passed

techouse deleted the fix/decode-dot-in-keys branch August 23, 2025 23:16

coderabbitai Bot mentioned this pull request Mar 4, 2026

⚡ optimize decoder #44

Merged

Uh oh!

Conversation

techouse commented Aug 23, 2025 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Aug 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Status, Documentation and Community

Uh oh!

codecov Bot commented Aug 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

codacy-production Bot commented Aug 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Coverage summary from Codacy

See diff coverage on Codacy

See your quality gate settings Change summary preferences

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

techouse commented Aug 23, 2025 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Aug 23, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)

codecov Bot commented Aug 23, 2025 •

edited

Loading

codacy-production Bot commented Aug 23, 2025 •

edited

Loading