A proxy pacing scorecard should measure field completeness, retry pressure, market consistency, response time, and replay quality before a team changes crawler speed. It is useful for authorized public data collection queues, especially when successful responses still produce weak records.
The scorecard starts with required fields
The target user is a data engineering team responsible for crawler reliability. Before changing proxy pacing, define the required fields for each page type: title, price, availability, result URL, snippet, region cue, timestamp, or another task-specific field.
A queue is not healthy just because it returns a status code. If required fields are missing, the record may be unusable for price monitoring, SERP monitoring, or public catalog analysis.
Five signals should be scored together
| Signal | What it shows |
|---|---|
| Field completeness | Whether the public record can support analysis |
| Retry pressure | Whether pacing is stressing a queue |
| Market consistency | Whether the sample matches the intended region |
These signals should be grouped by proxy lane, market, page type, and collection window. A single global average hides the queue that actually needs attention.

Score changes should trigger small replays
When the score drops, replay a small public sample with lower concurrency and the same field rules. If completeness recovers, proxy pacing was likely too aggressive for that page class.
If completeness does not recover, inspect parser rules, page variants, and market cues before changing proxy capacity. The scorecard is a diagnostic tool, not a replacement for evidence.
Capacity decisions need a threshold
Teams should set a threshold for acceptable field completeness and market consistency. Capacity expansion should wait until replay shows the queue is healthy but lacks enough lane coverage for the required volume.
This keeps cost decisions tied to measurable crawler reliability instead of isolated retry spikes.
FAQ
What should a proxy pacing scorecard measure first?
It should measure field completeness first, because a successful response is still weak if required public fields are missing.
When should proxy pacing be reduced?
Reduce pacing when retries, response time, or field loss rise within a specific lane, market, or page type.
