Back Professions
Back Dating
Back Writing Tools
Back Programming Tools
Back AI Chat
Back AI Image
Back AI Video
How-to guide

How to Use Claude for Long Documents

200K tokens of context is a lot. Here's how to actually use it.

8 min read Beginner difficulty 8 steps

Last updated

Claude Sonnet 4's 200K-token context window fits roughly 150,000 words - a typical novel, a thick research paper, a 300-page contract. But "can fit" isn't the same as "will use well."

The difference shows up in how you ask. Treat Claude like a search box ("summarise this") and you get search-box results - Shallow, generic, occasionally wrong. Treat it like a junior analyst who has genuinely read the whole document and can be ordered to show their work, and you get something no other workflow gives you: 300 pages of institutional knowledge on tap, with citations.

This guide covers the techniques that make the second mode reliable: load verification, structural questioning, forced citations, and the known failure modes of long-context models.

Step-by-step guide

Upload the document

On Claude Sonnet 4 on AskAI.free, drag-and-drop your PDF, doc or text file into the chat. Multimodal Claude reads PDFs natively - Including images and tables - But text-based PDFs work best.

Size sanity check before you start: a dense page runs 500-700 tokens, so 200K comfortably holds about 300 pages. If your document is bigger, split it now rather than discovering truncation later. And prefer the original digital PDF over a scan whenever you have the choice - OCR introduces exactly the kind of small numeric errors that matter in contracts and financial reports.

Ask Claude to verify it actually read the document

Don't assume Claude read everything just because it accepted the file. Always start with:

"Confirm you can see this document. What's its title, length, and overall structure (sections, chapters)?"

If Claude can't summarise the structure, retry the upload. Don't ask substantive questions until you've confirmed the document loaded.

Add one probe for the ending: "What does the final section cover?" A model that only received the first two-thirds of a file will answer the structure question plausibly from what it saw - The ending probe is what actually catches truncation. Expected response when everything's fine: specific section names and an accurate description of the last pages, not generic phrasing like "the document concludes by summarising its themes."

Use targeted, structural questions

Long-context models work best when your question maps to specific sections of the document. Bad question:

"Summarise this 300-page contract."

Better questions:

  • "What does Section 8 say about termination?"
  • "Find every place that mentions data retention. Quote the relevant clauses."
  • "List all the financial obligations of party A, with the page numbers."

If you don't know the document's structure yet, make that the first question: "Give me an annotated table of contents - For each section, one line on what it covers." Now you have a map, and every later question can point at territory. The 'find every place that mentions X' pattern is the long-context superpower - It's the task humans do worst on 300 pages and Claude does best.

Force citations

Always ask Claude to cite which page or section a claim comes from:

"For every answer, quote the relevant sentence and give the page or section number it appears in."

This forces Claude to actually retrieve from the document instead of guessing. It also lets you verify quickly.

Say it once at the start of the conversation and it applies to everything after - Effectively a system prompt for the session. The failure this prevents is subtle: without forced citations, Claude blends what the document says with what documents like it usually say. The blend sounds authoritative and is occasionally wrong precisely where your contract deviates from the standard template - Which is usually the part you care about.

Iterate and follow up

Claude keeps the whole document in context for the entire conversation, so you can ask follow-ups indefinitely. Use this for deep dives:

  • "In the section you just quoted, what does 'reasonable efforts' typically mean in contract law?"
  • "Compare the termination clauses to standard SaaS contracts."
  • "Which clause is most unfavourable to me as the buyer?"

Notice the shift in that list: the first question retrieves, the later ones ask for judgment grounded in the retrieval. That two-beat rhythm - Quote it, then interpret it - Is the most productive way to work through a long document. One caution: in very long conversations, periodically re-anchor with "quote the clause again before analysing" so drift doesn't creep in.

Be aware of the 'lost in the middle' effect

Even with 200K context, Claude (and every other long-context model) pays slightly less attention to information in the middle of a long document. Beginning and end get more weight.

If you're working with a critical document, ask explicitly: "Have you actually read the middle sections? Quote one sentence from page 100." If Claude can't, you may need to chunk the document.

For high-stakes work, a belt-and-braces pass: ask the same critical question twice, once against the full document and once with just the relevant 20 pages pasted in. Matching answers are strong confirmation; diverging answers tell you the full-document pass got fuzzy and the section-level answer is the one to trust.

Compare multiple documents in one context

200K isn't just for one big file - It's for several related ones. Upload last year's contract and this year's renewal and ask:

"These are the 2025 and 2026 versions of the same agreement. List every substantive change, quote both versions of each changed clause, and flag which changes favour the vendor."

This is tedious, error-prone work for a human and close to free for Claude. Other variants: a research paper plus its three main rebuttals, your policy document against the new regulation it must comply with, or four supplier proposals against your requirements list. Keep it under 4-5 documents per conversation; beyond that, attribution starts slipping and you should name documents explicitly in every question.

Worked example: due diligence on a 280-page filing

An analyst needs the risk picture from a 280-page annual report by tomorrow. Upload to Claude Sonnet 4 (fits, barely - The token math said ~190K). Verification probe passes: Claude correctly describes the final section's litigation disclosures.

The session: annotated table of contents first. Then "quote every passage about debt covenants, with page numbers" - Six passages surface, including one buried in a footnote on page 211. Then judgment questions: "which disclosed risk changed most from the language in the previous filing?" (the prior year's report is added to the same chat for comparison). Claude flags a new going-concern qualifier; the analyst verifies the quoted pages by hand, confirms, and that finding leads the memo.

Three hours of reading became 40 minutes of directed interrogation - And every claim in the memo carries a page number a sceptical partner can check.

Claude Sonnet 4 is the strongest long-document model that's broadly available. For documents over 200K tokens (roughly 500+ pages of dense text), Gemini 2.5 Pro's 2M-token context is the only option - But Claude is more accurate within its limit. See Claude vs Gemini for the head-to-head.

Troubleshooting quick hits: vague answers about late sections - Truncation, re-upload or split. Confident claims with no quotes - You forgot to force citations. Slow responses - Normal; 200K tokens of attention costs seconds, not milliseconds. Numbers that don't match the PDF - Scanned source and OCR noise; verify against the original.

Both models are available on AskAI.free Pro with a 7-day free trial.

Related tools and guides

Try the techniques above on AskAI.free - Your first question is free.

Start a free chat →

FAQ

How long is 200K tokens?

Roughly 150,000 words, or about 300 pages of dense single-spaced text - A 600-page book in typical paperback formatting. The conversion that matters: one dense page is 500-700 tokens, so divide 200,000 by your page density. Remember the window holds the whole conversation, not just the file: a 280-page upload plus fifty long Q&A; exchanges can eventually push early content out. Paste a sample page into our token counter for an exact read on your document's density.

Is Claude better than ChatGPT for long documents?

Yes, and it's one of the clearest gaps between them. Claude Sonnet 4's 200K window is bigger than ChatGPT 4o's 128K, and in our testing Claude retrieves mid-document details more reliably and fabricates less when asked about sections near its attention limits. ChatGPT remains fine for documents under ~100 pages. For truly massive inputs, Gemini 2.5 Pro's 2M window beats both on capacity while trailing Claude on retrieval precision within range.

Can Claude process scanned PDFs?

Yes - Claude has built-in OCR and handles clean scans well. Quality tracks scan quality: crisp 300-dpi office scans are near-perfect, while skewed phone photos, faxed pages and dense tables produce character-level errors. The dangerous failure is numeric: a 6 read as an 8 in a contract figure. For anything where exact numbers matter from a scanned source, run dedicated OCR first or verify every quoted figure against the page image yourself.

Does Claude remember my document between conversations?

No. Each conversation is self-contained: close the chat and the document is gone from context, which is also the privacy-friendly behaviour you want. Keep one long-running conversation per document while you're working on it rather than re-uploading daily. If you need persistent shared context across many sessions, that's what Anthropic's Projects feature on claude.ai addresses; on AskAI.free, the practical equivalent is keeping the document conversation open and bookmarked.

What file types work besides PDF?

Plain text, markdown, Word documents and common code files all work, and plain text is actually the most reliable - No extraction layer to introduce errors. Spreadsheets are the weak spot: CSV is fine for modest tables, but complex multi-sheet Excel files with formulas lose structure in extraction. For data-heavy analysis, export the relevant sheet to CSV first. If a file fails to upload, the universal fallback is copy-pasting the text directly into the chat; it's identical to Claude once it's in context.

Other guides