Create a geo_diff_10 configuration by aneubeck · Pull Request #96 · github/rust-gems

aneubeck · 2026-02-11T13:13:36Z

This configuration increases the precision compared to geo_diff_7 without consuming as much memory as a geo_diff_13.

Copilot

Pull request overview

This pull request adds a new geo_diff_10 configuration to the geo_filters crate, providing an intermediate precision option between the existing geo_diff_7 and geo_diff_13 configurations. The configuration uses b=10 with a relative error standard deviation of ~0.04, offering a balanced trade-off between memory usage and accuracy.

Changes:

Added GeoDiffConfig10 and GeoDiffCount10 types with b=10, using u32 bucket type, 896 bytes, and 64 MSB
Added test coverage for the estimation lookup table for the new configuration
Updated evaluation tooling to include geo_diff_10 in accuracy measurements and plot generation
Added documentation and generated accuracy plot for the new configuration

Reviewed changes

Copilot reviewed 5 out of 12 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
crates/geo_filters/src/diff_count/config.rs	Adds GeoDiffConfig10 type definition and test_estimation_lut_10 test
crates/geo_filters/src/diff_count.rs	Exports GeoDiffConfig10 and defines GeoDiffCount10 type alias
crates/geo_filters/scripts/generate-accuracy-plots	Updates script to include geo_diff_10 in accuracy plot generation
crates/geo_filters/evaluation/accuracy.rs	Adds geo_diff_10 to evaluation CLI and simulation parsers
crates/geo_filters/evaluation/accuracy.md	Adds reference to geo_diff_10 accuracy plot
crates/geo_filters/evaluation/accuracy/hll_8.png	Generated accuracy plot (binary file)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

itsibitzi

Neat!

itsibitzi · 2026-02-11T13:38:32Z

crates/geo_filters/scripts/generate-accuracy-plots

+    -o accuracy.csv -n 10000 -m 5000000 geo_diff_{7,10,13} geo_distinct_{7,13} hll_{8,14} "$@"

 evaluation/plot-accuracy.r accuracy.csv

 rm -f "$plots_dir"/*

 idx=0
-for c in geo_diff_{7,13} geo_distinct_{7,13} hll_{8,14}; do
+for c in geo_diff_{7,10,13} geo_distinct_{7,13} hll_{8,14}; do


Do we want to include this configuration the geo distinct plots here too?

mmm. I don't need those yet...
We could do it in a separate run (it was burning quite a lot of my CPU for quite some time to rerun all these evaluations)

itsibitzi · 2026-02-11T13:39:46Z

crates/geo_filters/src/diff_count.rs

 /// Diff count filter with a relative error standard deviation of ~0.125.
 pub type GeoDiffCount7<'a> = GeoDiffCount<'a, GeoDiffConfig7>;

+/// Diff count filter with a relative error standard deviation of ~0.04.


Do you know if copilot read the evals to get the relative error?

I don't know. But you can see from the graphs that it is slightly above 0.04

Create a geo_diff_10 configuration

7d52718

Copilot AI review requested due to automatic review settings February 11, 2026 13:13

aneubeck requested a review from a team as a code owner February 11, 2026 13:13

Copilot started reviewing on behalf of aneubeck February 11, 2026 13:14 View session

Copilot AI reviewed Feb 11, 2026

View reviewed changes

itsibitzi approved these changes Feb 11, 2026

View reviewed changes

aneubeck enabled auto-merge February 11, 2026 15:48

aneubeck disabled auto-merge February 11, 2026 16:02

aneubeck merged commit 49c1f0d into main Feb 11, 2026
13 checks passed

aneubeck deleted the aneubeck/geofilter10 branch February 11, 2026 16:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create a geo_diff_10 configuration#96

Create a geo_diff_10 configuration#96
aneubeck merged 1 commit intomainfrom
aneubeck/geofilter10

aneubeck commented Feb 11, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

itsibitzi left a comment

Uh oh!

itsibitzi Feb 11, 2026

Uh oh!

aneubeck Feb 11, 2026

Uh oh!

itsibitzi Feb 11, 2026

Uh oh!

aneubeck Feb 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

aneubeck commented Feb 11, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

itsibitzi left a comment

Choose a reason for hiding this comment

Uh oh!

itsibitzi Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

aneubeck Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

itsibitzi Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

aneubeck Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments