Add LEB128 functions for Nat and Int #1

f0i · 2025-07-30T09:50:49Z

Add functions for LEB128 encoding and decoding of Nat and Int values.
Add doc comments to LEB128 functions
Add additional keywords to mops.toml

Copilot

Pull Request Overview

This PR adds LEB128 encoding and decoding functions for Nat and Int types, extending the existing functionality that only supported Nat64 and Int64. It also improves documentation by adding doc comments to all LEB128 functions and updates package metadata with additional keywords.

Implements new LEB128 functions for Nat and Int types (both encoding and decoding)
Adds comprehensive documentation comments to all LEB128 functions
Updates package keywords to better reflect the library's functionality

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
src/lib.mo	Adds new LEB128 functions for `Nat`/`Int` types and doc comments for all LEB128 functions
tests/ByteUtils.Test.mo	Adds test coverage for the new `Nat` and `Int` LEB128 functions
tests/Sorted.Test.mo	Removes unused imports to clean up test file
mops.toml	Updates keywords array with more descriptive encoding-related terms

src/lib.mo

tomijaga · 2025-07-30T11:35:29Z

Hey @f0i , thanks for the CR.
The implementation looks good, I just had one comment before I merge.

Could you add a few more test cases for Nat values greater than 2^64?
You could either add a few hard coded values or use the fuzz library to generate some.

tomijaga · 2025-07-30T11:40:38Z

Benchmark workflows are failing for an unrelated reason - mops uses the latest moc version which can be incompatible with the base package at times. I plan to update the workflow to use the version specified in the mops.toml file

f0i · 2025-07-31T16:13:01Z

I added some test cases.
Here is the python script I used to generate the expected byte arrays without relying on my own implementation:

#!/usr/bin/env python3

import leb128

def testULEB128(name, value):
    print(name, value, [f'0x{byte:02x}' for byte in leb128.i.encode(value)])

def testSLEB128(name, value):
    print(name, value, [f'0x{byte:02x}' for byte in leb128.i.encode(value)])

print("ULEB128")
testULEB128("2 ** 64", 2 ** 64)
testULEB128("2 ** 65", 2 ** 65)
testULEB128("2 ** 70", 2 ** 70)
testULEB128("2 ** 64 + 1", 2 ** 64 + 1)
testULEB128("123456789012345678901234567890", 123456789012345678901234567890)
print("")
print("SLEB128")
testULEB128("2 ** 64", 2 ** 64)
testULEB128("2 ** 65", 2 ** 65)
testULEB128("2 ** 70", 2 ** 70)
testULEB128("2 ** 64 + 1", 2 ** 64 + 1)
testULEB128("123456789012345678901234567890", 123456789012345678901234567890)
testULEB128("-1 * (2 ** 64)", -1*(2 ** 64))
testULEB128("-1 * (2 ** 65)", -1*(2 ** 65))
testULEB128("-1 * (2 ** 70)", -1*(2 ** 70))
testULEB128("-1 * (2 ** 64 + 1)", -1*(2 ** 64 + 1))
testULEB128("-123456789012345678901234567890", -123456789012345678901234567890)

Output of that script:

ULEB128
2 ** 64 18446744073709551616 ['0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x02']
2 ** 65 36893488147419103232 ['0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x04']
2 ** 70 1180591620717411303424 ['0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x01']
2 ** 64 + 1 18446744073709551617 ['0x81', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x02']
123456789012345678901234567890 123456789012345678901234567890 ['0xd2', '0x95', '0xfc', '0xf1', '0xe4', '0x9d', '0xf8', '0xb9', '0xc3', '0xed', '0xbf', '0xc8', '0xee', '0x31']

SLEB128
2 ** 64 18446744073709551616 ['0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x02']
2 ** 65 36893488147419103232 ['0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x04']
2 ** 70 1180591620717411303424 ['0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x01']
2 ** 64 + 1 18446744073709551617 ['0x81', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x02']
123456789012345678901234567890 123456789012345678901234567890 ['0xd2', '0x95', '0xfc', '0xf1', '0xe4', '0x9d', '0xf8', '0xb9', '0xc3', '0xed', '0xbf', '0xc8', '0xee', '0x31']
-1 * (2 ** 64) -18446744073709551616 ['0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x7e']
-1 * (2 ** 65) -36893488147419103232 ['0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x7c']
-1 * (2 ** 70) -1180591620717411303424 ['0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x80', '0x7f']
-1 * (2 ** 64 + 1) -18446744073709551617 ['0xff', '0xff', '0xff', '0xff', '0xff', '0xff', '0xff', '0xff', '0xff', '0x7d']
-123456789012345678901234567890 -123456789012345678901234567890 ['0xae', '0xea', '0x83', '0x8e', '0x9b', '0xe2', '0x87', '0xc6', '0xbc', '0x92', '0xc0', '0xb7', '0x91', '0x4e']

tomijaga · 2025-08-01T20:59:22Z

Thanks for adding these test cases. I'll merge and update the mops version

tomijaga · 2025-08-01T21:21:47Z

@f0i v0.1.0 is published on mops: https://mops.one/byte-utils@0.1.0

f0i added 4 commits July 30, 2025 09:01

Add ULEB128 and SLEB128 encoding for Nat and Int values

3fbbd82

Add keywords for discoverability

dc543c8

Add doc comments to leb128 functions

a7e4375

Add note to LEB128 functions that can trap

6d9620b

tomijaga requested a review from Copilot July 30, 2025 11:22

Copilot AI reviewed Jul 30, 2025

View reviewed changes

src/lib.mo Show resolved Hide resolved

NatLabs deleted a comment from Copilot AI Jul 30, 2025

Add test cases for LEB128 encoding of values larger than 2**64

b68627a

tomijaga merged commit dd58c06 into NatLabs:main Aug 1, 2025
1 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add LEB128 functions for Nat and Int #1

Add LEB128 functions for Nat and Int #1

Uh oh!

f0i commented Jul 30, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

tomijaga commented Jul 30, 2025

Uh oh!

tomijaga commented Jul 30, 2025

Uh oh!

f0i commented Jul 31, 2025

Uh oh!

tomijaga commented Aug 1, 2025

Uh oh!

Uh oh!

tomijaga commented Aug 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add LEB128 functions for Nat and Int #1

Add LEB128 functions for Nat and Int #1

Uh oh!

Conversation

f0i commented Jul 30, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

tomijaga commented Jul 30, 2025

Uh oh!

tomijaga commented Jul 30, 2025

Uh oh!

f0i commented Jul 31, 2025

Uh oh!

tomijaga commented Aug 1, 2025

Uh oh!

Uh oh!

tomijaga commented Aug 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants