PyTorch Search Option Performance Benchmark¶

Generated: 2026-02-22T05:54:36.060215+00:00

Snapshot note: values depend on repository state, cache state, and hardware. Re-run with the same command before release/perf decisions.

Scope¶

Measures cgrep search latency and payload size for practical option/scenario pairs.
Includes indexed keyword paths and scan-mode (--no-index, --regex) paths.
Success means expected marker evidence appears in structured JSON results.

Case	Scenario	Options	Success	P50 latency (ms)	P95 latency (ms)	P50 payload tokens	P50 results
`default_autograd`	autograd evaluate_function	`(default)`	100.0%	19.10	19.50	838	16
`path_scoped_autograd`	autograd evaluate_function	`(default)`	100.0%	15.85	16.38	323	5
`type_cpp_parser`	python arg parser	`--type cpp`	100.0%	17.49	18.28	998	20
`glob_cpp_cuda`	cuda graph	`--glob *.cpp`	100.0%	17.22	17.68	335	5
`context_addmm`	addmm call path	`-C 2`	100.0%	17.03	17.83	210	2
`budget_tight_dispatch`	dispatch key set	`-B tight`	100.0%	16.28	16.68	1000	20
`budget_full_dispatch`	dispatch key set	`-B full`	100.0%	15.75	15.87	1025	20
`profile_fast_dispatch`	dispatch key set	`-P fast`	100.0%	16.04	16.60	1006	20
`payload_compact_dispatch`	dispatch key set	`--path-alias --dedupe-context --suppress-boilerplate`	100.0%	16.02	17.19	981	20
`fuzzy_tensor_iterator`	tensor iterator symbol lookup	`--fuzzy`	100.0%	21.28	22.49	1002	20
`scan_no_index_autograd`	autograd evaluate_function	`--no-index`	100.0%	19.10	20.12	320	5
`scan_regex_addmm`	addmm regex search	`--regex --no-index`	100.0%	46.69	66.01	1161	20

Root search -> scoped search latency change: -17.0% (default_autograd vs path_scoped_autograd).
Indexed scoped -> --no-index scan latency change: 20.5%.
Scan -> regex scan latency change: 144.5%.
-B tight -> -B full payload token change: 2.5%.
-B tight -> -P fast latency change: -1.5%.

python3 scripts/benchmark_search_option_performance.py --repo /path/to/pytorch --cgrep-bin /path/to/cgrep