Token Count Mismatch
PastePrompt token counts are estimates. They can differ from the final count shown by ChatGPT, Claude, Gemini, Codex, Cursor, or provider APIs.
Common causes
- Different tokenizer implementations.
- Chat message wrappers.
- Attachment handling.
- System prompts added by the external tool.
- Formatting changes after copy or export.
- Model-specific context accounting.
What to do
- Leave margin below the target model limit.
- Remove low-value files.
- Split the review into smaller bundles.
- Keep instructions concise.
- Use the external tool's final count as authoritative for that tool.
Practical margins
For large reviews, do not aim for the exact advertised context limit. Leave room for:
- the LLM provider's system prompt and message wrappers;
- follow-up instructions;
- model-generated reasoning and answer text;
- provider-side formatting or attachment conversion.
If the provider rejects a bundle, split by module, workflow, or question:
- Start with Git metadata, file map, and prompt instructions.
- Add only the files needed for the first question.
- Export a second bundle for dependencies, tests, or follow-up evidence.
- Keep the same output format so comparisons remain easier.
When estimates differ by a lot
Large differences usually mean the external tool is counting something PastePrompt cannot see, such as chat wrappers, hidden system context, attachments, or provider-specific normalization. Treat PastePrompt counts as planning estimates and the provider count as authoritative for that provider.