Platform Capabilities

Built for Defensibility.

Every feature designed with lit-support professionals in mind. Precision search, automated privilege detection, and court-ready audit trails.

Search Intelligence

Multi-modal retrieval for maximum recall

Hybrid Search

DEFAULT

BM25 lexical + semantic vectors combined via Reciprocal Rank Fusion. Catches both exact terms and conceptual matches.

PrecisionTBD%
Latency100-300ms
Best ForStandard document discovery

RAG Fusion

ADVANCED

Multi-query expansion generates 4 search variants, runs parallel hybrid searches, then fuses results across all queries.

PrecisionTBD%
Latency300-600ms
Best ForComplex legal queries

Full Pipeline

PRECISION

Complete pipeline with LLM reranking. Gemini 3 Flash scores and reasons about each candidate for maximum precision.

PrecisionTBD%
Latency600-1200ms
Best ForEvidence packets, smoking guns

Accuracy validation in progress. We're running comprehensive benchmarks on the Enron corpus to provide verified precision metrics. Check back soon for validated accuracy claims.

AI Triage Engine

Gemini 3-powered document classification

Four-Way Classification

Every document receives a decisive classification with confidence scoring and mandatory source citations.

KEEPHigh relevance, clear value for review
REVIEWUncertain relevance, requires human judgment
DROPIrrelevant, system files, duplicates
INSUFFICIENTNot enough content to classify

Citation Enforcement

// Every AI decision requires source citation
decision: "KEEP"
confidence: 0.94
excerpts: [
"The merger discussions with Enron..."
offset: 1247-1298
]
No citations = forced REVIEW status

Document Processing

Forensic-grade extraction pipeline

Container Extraction

ZIP, PST, MSG with full recursive extraction. Zip bomb protection via compression ratio analysis.

ZIPPSTMSGRAR

Document Processing

Native text extraction from PDFs, Office documents, and 100+ file types with metadata preservation.

PDFDOCXXLSXPPTX

Tiered OCR

Tesseract baseline with Gemini 'hard lane' for difficult pages. Automatic hardness scoring promotes bad OCR to LLM.

Scanned PDFsImagesFaxes

Email Threading

Header-aware chunking preserves conversation context. Attachments extracted and processed recursively.

MSGEMLMBOX

Privilege Guard

Automated attorney-client detection

Catch privilege risks before they ever hit an external platform. Pattern-based detection combined with entity recognition flags potential attorney-client communications for human review.

Attorney name detection
Law firm domain matching
Legal advice language patterns
Work product indicators
Litigation hold keywords
Custom privilege lexicons

Review Queue

Email_Thread_492.msgHIGH
Merger_Draft_v3.docxMEDIUM
Board_Notes.pdfHIGH

Forensic Defensibility

Court-ready audit infrastructure

Content-Addressed Storage

SHA256 deduplication ensures identical files are stored once. Full provenance chain from source to artifact.

Immutable Audit Logs

Append-only, cryptographically chained logs. Every action timestamped and attributed for court-ready defensibility.

Citation Enforcement

Every AI decision MUST cite source text. No citations = forced human review. Zero tolerance for hallucinations.

Model Versioning

Every AI output tracked with model ID, version, prompt hash, and token counts. Full reproducibility.

Export to Your Stack

Generate load files compatible with every major review platform. Your data, your workflow.

RELATIVITY
EVERLAW
DISCO
NUIX
CSV