Skip to main content

AI Document Analysis

Automatically classify and analyze documents with AI-powered intelligence.


What is AI Document Analysis?

AI Document Analysis uses artificial intelligence to understand your documents instantly:

  • Auto-Classification: Categorize documents (contract, invoice, legal, personal, etc.)
  • Smart Summaries: Generate human-readable summaries in seconds
  • Entity Extraction: Identify names, dates, amounts, and organizations
  • Key Points: Extract the most important information
  • Confidence Scoring: Know how confident the AI is about its analysis

Why Use It?

Save Time:

  • Analyze 100 documents in minutes (not hours)
  • No manual reading required for initial classification
  • Instant summaries for quick review

Improve Organization:

  • Auto-tag documents by type
  • Extract metadata for search
  • Group related documents automatically

Find Information Faster:

  • Search by extracted entities
  • Filter by document category
  • Locate key information in seconds

Document Classification

How It Works

Upload a document and AI automatically classifies it:

Supported Categories:

  • Contract: Employment contracts, service agreements, NDAs
  • Invoice: Bills, receipts, payment records
  • Legal: Court documents, legal notices, filings
  • Report: Business reports, research papers, analyses
  • Correspondence: Letters, memos, emails
  • Financial: Bank statements, tax documents, financial reports
  • Personal: Personal letters, diaries, notes
  • Other: Uncategorizable documents

Example Analysis:

{
"category": "contract",
"confidence": 0.95,
"summary": "Employment agreement between TechCorp and Jane Smith for Senior Software Engineer position, starting January 15, 2024.",
"keyPoints": [
"Position: Senior Software Engineer",
"Salary: $150,000 annually",
"Start Date: January 15, 2024",
"Benefits: Health insurance, 401(k), stock options",
"At-will employment"
],
"entities": {
"names": ["TechCorp", "Jane Smith"],
"dates": ["2024-01-15"],
"amounts": ["$150,000"]
}
}

Using Classification

Automatic Classification:

  1. Enable SettingsAI PreferencesAuto-Classification
  2. Upload document to vault
  3. AI classifies automatically in background
  4. View results in document details

Manual Classification:

  1. Go to Documents → Select document
  2. Click Analyze with AI button
  3. Review classification results
  4. Accept or modify classification

Bulk Classification:

  1. Go to Documents → Select multiple documents
  2. Click Bulk ActionsClassify with AI
  3. AI processes all selected documents
  4. Review results summary

Confidence Scores

Understanding Confidence:

  • 0.95-1.00: Extremely confident (very likely correct)
  • 0.85-0.94: High confidence (usually correct)
  • 0.70-0.84: Medium confidence (review recommended)
  • Below 0.70: Low confidence (manual review required)

What Affects Confidence:

  • Document clarity and quality
  • Standard vs. custom document formats
  • Language complexity
  • Document length and structure

Low Confidence? Tips:

  • Upload higher quality scans
  • Use OCR-processed PDFs (searchable text)
  • Provide context (add document title/description first)
  • Split combined documents (multiple contracts in one file)

Deep Document Analysis

Advanced Analysis Features

Beyond Classification:

  • Detailed content analysis
  • Relationship mapping between entities
  • Chronological event extraction
  • Risk and compliance flagging
  • Similar document detection

Use Cases:

  • Legal Review: Identify key clauses, obligations, deadlines
  • Contract Management: Extract terms, renewal dates, parties
  • Financial Analysis: Extract transactions, amounts, account numbers
  • Research: Summarize papers, extract key findings, citations

Using Deep Analysis

Analyze a Document:

  1. Go to Documents → Select document
  2. Click AI AnalysisDeep Analysis
  3. Wait for processing (10-30 seconds)
  4. Review detailed analysis results

Analysis Output:

{
"summary": {
"brief": "One-paragraph overview",
"detailed": "Multi-paragraph analysis with context"
},
"entities": {
"people": ["Jane Smith (Employee)", "John Doe (CEO)"],
"organizations": ["TechCorp Inc.", "Acme Legal Services"],
"locations": ["San Francisco, CA", "Remote"],
"dates": [
{"date": "2024-01-15", "context": "Employment start date"},
{"date": "2025-01-15", "context": "First review date"}
],
"amounts": [
{"amount": "$150,000", "context": "Annual salary"},
{"amount": "$5,000", "context": "Signing bonus"}
]
},
"keyTerms": {
"obligations": [
"Confidentiality agreement",
"Non-compete clause (12 months)",
"Intellectual property assignment"
],
"rights": [
"Health insurance coverage",
"Stock options (10,000 shares, 4-year vesting)",
"Unlimited PTO policy"
],
"deadlines": [
{"date": "2024-01-10", "task": "Complete background check"},
{"date": "2025-01-15", "task": "Performance review"}
]
},
"riskFactors": [
"Non-compete clause may be unenforceable in California",
"No specified severance terms"
],
"relatedDocuments": [
"Employment Offer Letter - Jane Smith.pdf",
"TechCorp Employee Handbook 2024.pdf"
]
}

Entity Extraction

What Are Entities?

Entities are important pieces of information extracted from documents:

Supported Entity Types:

  • Names: People, organizations, products
  • Dates: Specific dates, date ranges, deadlines
  • Amounts: Money, percentages, quantities
  • Locations: Addresses, cities, countries, regions
  • Contact Information: Emails, phone numbers, websites
  • Identifiers: Account numbers, case numbers, reference IDs

Using Extracted Entities

Search by Entity:

Search: "Jane Smith"
Results: All documents mentioning Jane Smith

Search: "amount:>$100,000"
Results: Documents with amounts over $100,000

Search: "date:2024-01"
Results: Documents with dates in January 2024

Entity Timeline:

  • View all extracted dates chronologically
  • See which documents contain specific dates
  • Create timelines across multiple documents

Entity Relationships:

  • Map connections between people and organizations
  • Track document flow between parties
  • Visualize entity networks

Key Points Extraction

What Are Key Points?

Key Points are the most important takeaways from a document:

Examples by Document Type:

Contract:

  • Parties involved
  • Key obligations and rights
  • Payment terms and amounts
  • Important dates and deadlines
  • Termination conditions

Invoice:

  • Invoice number and date
  • Billed items and quantities
  • Total amount and payment terms
  • Billing party and recipient
  • Due date

Legal Document:

  • Case or matter number
  • Parties involved
  • Key claims or defenses
  • Important rulings or decisions
  • Next action dates

Report:

  • Main findings or conclusions
  • Key statistics and data points
  • Recommendations
  • Executive summary points
  • Methodology highlights

Using Key Points

Quick Review:

  1. Open document in viewer
  2. View Key Points panel (right sidebar)
  3. Review 3-10 bullet points
  4. Click any point to jump to source in document

Export Key Points:

  • Markdown: For notes and wikis
  • PDF: Professional summary document
  • CSV: Spreadsheet-compatible format
  • JSON: Programmatic access

Share Key Points:

  • Share summary without sharing full document
  • Create case briefing from key points
  • Generate reports from extracted points

Document Summaries

Summary Types

Brief Summary (2-3 sentences):

  • High-level overview
  • Main purpose or topic
  • Primary parties or entities

Example:

"Employment agreement between TechCorp and Jane Smith for Senior Software Engineer role. Outlines salary ($150K), benefits, and standard employment terms. Effective January 15, 2024."

Detailed Summary (1-2 paragraphs):

  • Comprehensive overview
  • Context and background
  • Key details and terms
  • Important conditions or clauses

Example:

"This employment agreement establishes the terms of employment between TechCorp Inc. and Jane Smith for the position of Senior Software Engineer, effective January 15, 2024. The agreement specifies an annual salary of $150,000, comprehensive benefits including health insurance and stock options (10,000 shares with 4-year vesting), and unlimited PTO. The position is at-will employment, with standard confidentiality, non-compete (12 months), and intellectual property assignment clauses. The agreement includes provisions for performance reviews, professional development stipend ($2,000 annually), and remote work flexibility."

Using Summaries

View Summary:

  • Document details page → AI Analysis tab
  • Document list view → Hover for quick summary
  • Search results → Summary preview

Customize Summaries:

  • Length: Brief, Standard, Detailed
  • Focus: General, Legal, Financial, Technical
  • Audience: Executive, Legal, Technical, General

Export Summaries:

  • Generate PDF with summary + document
  • Export summaries for multiple documents
  • Create executive briefing from summaries

Practical Examples

Scenario: You uploaded 50 mixed legal documents to your vault.

Steps:

  1. Select all 50 documents
  2. Bulk ActionsClassify with AI
  3. AI categorizes: 20 contracts, 15 invoices, 10 legal filings, 5 correspondence
  4. Auto-tag documents based on classification
  5. Review confidence scores, manually adjust low-confidence classifications
  6. Search and filter by category

Time Saved: 50 documents × 5 minutes each = 250 minutes manual → 10 minutes with AI


Example 2: Contract Analysis

Scenario: Reviewing employment contract before signing.

Steps:

  1. Upload contract PDF
  2. Analyze with AIDeep Analysis
  3. Review key points:
    • Salary and compensation
    • Benefits and perks
    • Obligations and restrictions
    • Termination conditions
  4. Check risk factors highlighted by AI
  5. Extract important dates and deadlines
  6. Export analysis to share with attorney

Insight: AI flags non-compete clause that may be unenforceable in your state.


Example 3: Invoice Processing

Scenario: Process monthly invoices from multiple vendors.

Steps:

  1. Upload 30 invoices
  2. AI auto-classifies as "invoice"
  3. Extract key information:
    • Invoice numbers
    • Due dates
    • Amounts
    • Vendor names
  4. Export to CSV
  5. Import to accounting software

Time Saved: 30 invoices × 3 minutes each = 90 minutes manual → 5 minutes with AI


Best Practices

When to Use AI Analysis

✅ Good Use Cases:

  • High volume of similar documents
  • Initial document review and categorization
  • Extracting structured data from unstructured documents
  • Creating summaries for quick reference
  • Finding specific information across many documents

❌ Not Recommended For:

  • Final legal review (always use human attorney)
  • Highly sensitive decisions based solely on AI
  • Documents in poor quality or non-standard formats
  • Languages other than English (accuracy may vary)

Improving Accuracy

Document Quality:

  • Upload high-resolution scans (300+ DPI)
  • Use searchable PDFs (OCR processed)
  • Avoid handwritten documents (low accuracy)
  • Ensure text is machine-readable

Provide Context:

  • Add document title and description
  • Tag documents with relevant metadata
  • Group related documents in folders
  • Use consistent naming conventions

Review Results:

  • Always check confidence scores
  • Verify extracted entities
  • Confirm key points accuracy
  • Report errors to improve AI

Privacy Considerations

Data Processing:

  • AI analyzes document content (requires reading)
  • Content sent to AI provider (Google Gemini)
  • No data retained by AI provider
  • All processing encrypted end-to-end

Sensitive Documents:

  • Review terms before analyzing confidential documents
  • Disable AI for highly sensitive vaults
  • Use manual classification for top-secret materials
  • Audit AI operations via SettingsAI Logs

Troubleshooting

Low Confidence Scores

Problem: AI returns low confidence (<0.70) for classification.

Solutions:

  • Improve scan quality: Re-scan at higher resolution
  • Use OCR: Convert image PDFs to searchable text PDFs
  • Provide context: Add title and description manually first
  • Manual review: Classify manually when AI is uncertain

Incorrect Classification

Problem: AI misclassifies document (e.g., invoice classified as contract).

Solutions:

  • Review extracted text: Ensure OCR read document correctly
  • Check document format: Standard formats work better
  • Report error: SettingsReport AI Issue (helps improve AI)
  • Manual override: Change classification manually

Missing Entities

Problem: AI doesn't extract expected entities (names, dates, amounts).

Solutions:

  • Check text readability: Ensure document text is machine-readable
  • Formatting issues: Entities in tables/images may not extract well
  • Language support: Entities in non-English text may be missed
  • Manual addition: Add entities manually via Edit Metadata

Slow Processing

Problem: AI analysis takes longer than expected.

Solutions:

  • Document size: Large documents (100+ pages) take longer
  • Queue position: Multiple requests may queue
  • API limits: Rate limits may slow processing
  • Check status: SettingsAI Operations to monitor progress

FAQ

How accurate is AI classification?

85-95% accuracy for standard document types (contracts, invoices, legal documents). Accuracy varies based on document quality, format, and language.

Can AI read handwritten documents?

Limited support. AI can read clear handwriting but accuracy is 60-70%. For best results, use typed/printed documents or OCR-processed scans.

Does AI work with images?

Yes, but images must contain readable text. AI extracts text via OCR, then analyzes. Photos of documents work if text is clear and legible.

What file formats are supported?

  • PDF: Best support (native and scanned)
  • Word: .docx, .doc
  • Images: .jpg, .png, .tiff (OCR processed)
  • Text: .txt, .md
  • Other: .rtf, .odt (limited)

Can I customize AI categories?

Not yet. AI uses predefined categories. Coming Soon: Custom categories for Enterprise customers.

How do I disable AI for specific vaults?

SettingsVaults → Select vault → AI SettingsDisable AI Analysis

Documents in this vault will not be analyzed by AI.

Can I re-analyze a document?

Yes. Documents → Select document → AI AnalysisRe-analyze

Useful if AI analysis failed or you uploaded a new version.

Are AI results auditable?

Yes. All AI operations are logged:

  • SettingsAI Logs shows all AI operations
  • Export logs for compliance (JSON, CSV)
  • Enterprise: Integrate logs with SIEM

API Access

Classify Document via API

curl -X POST https://api.torvus security.com/v1/ai/classify \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"documentId": "doc_abc123",
"vaultId": "vault_xyz789"
}'

Response:

{
"category": "contract",
"confidence": 0.95,
"summary": "Employment agreement between TechCorp and Jane Smith...",
"keyPoints": ["Salary: $150K", "Start date: Jan 15, 2024"],
"entities": {
"names": ["TechCorp", "Jane Smith"],
"dates": ["2024-01-15"],
"amounts": ["$150,000"]
},
"metadata": {
"provider": "gemini",
"model": "gemini-2.0-flash-exp",
"processingTime": "1.2s"
}
}

Getting Help

Support Channels:

Resources:


Last Updated: October 15, 2025