ogikol
Privacy Vault · Document Intelligence

Data infrastructurethat knows how toPROTECT.

One platform that detects and tokenizes sensitive data before it ever reaches a model — and turns any document into structured, LLM-ready output. Sensitive data never leaves your tenancy in plaintext. Your AI tooling never stops working.

100%
Field-level extraction accuracy
Zero
Plaintext that ever leaves your tenancy
BYOK
Customer-held encryption keys
1 call
Single sync API. No polling.
The problem

Their team won't find out for six months. By then the SSN, the policy number, the K-1 income figure — whatever it is — is in a chat log on a server somewhere, indexed into another company's training corpus, and sitting in three places no one can reach to delete it from.

GDPR
Up to 4% of global revenue
HIPAA
$50K – $1.5M per violation tier
GLBA / IRS Pub 1075
License & contract loss

Specific penalties vary by statute, jurisdiction, and circumstances. A single incident can trigger multiple regulators at once.

One platform · Two pillars

Vault, then extract. Or extract, then vault. Either way — your secrets stay yours.

Logikol pairs a privacy-first data vault with a high-precision document extraction engine. Build AI features and data pipelines on top of either — or both — without ever shipping plaintext PII to a model provider.

Use them independently — or chain them. Document Intelligence parses, the Vault tokenizes, and your AI never sees a name.
How it works

Plaintext goes in.
Surrogates go out.

Every document, record, or message that touches Logikol passes through a four-stage gateway before reaching any downstream consumer. Reversible only inside your tenancy, with your keys.

Source
docs · forms · streams · tickets
Logikol Gateway
detect → tokenize → encrypt → log
Any consumer
LLM · analytics · partner · warehouse
01 · Detect
02 · Tokenize
03 · Encrypt + split
04 · Sign + log

Six detection layers run in parallel. Format-preserving tokens look real to your downstream tooling. Original ciphertext sits behind customer-held keys — split-storage, separate from the surrogates.

Pillar I · Privacy Vault

Same task. Same answer.
Different exposure.

Your team uses ChatGPT, Claude, or your in-house model exactly the way they do today. The Vault sits in front. Sensitive fields are tokenized inline. The model sees structure but no secrets — and answers the same question.

direct to LLM
RAW
You
Summarize this customer record and flag risk: Maria Lopez, 412-55-9180, account 8801-4421-3092, balance $48,200.
Model
Customer Maria Lopez shows a healthy balance trend. No fraud signals on the most recent 72 transactions. Risk: low.
✗ Name + SSN + account number stored in provider logs · cannot be recalled
via Logikol Gateway
VAULTED
You
Summarize this customer record and flag risk: [name_a3f], [ssn_91c], account [acct_4de], balance $48,200.
Model
Customer [name_a3f] shows a healthy balance trend. No fraud signals on the most recent 72 transactions. Risk: low.
✓ No PII left your tenancy · prompt + completion logged with surrogates only
Detection

Six layers. One document. Nothing slips through.

No single detector finds everything. Run together, they leave nothing for an LLM to leak — across structured fields, free text, scanned images, and inferred identifiers.

Patterns
regex · checksums · Luhn · ABA
Schema
column types · field tags
NER
names · orgs · locations
Vision
stamps · signatures · faces
Context
neighborhood inference
Custom rules
your domain dictionary
Encryption · BYOK

Your data, your keys. Yours to revoke.

Documents and tokenized mirrors live in split storage. Originals are encrypted with keys you hold in yourKMS — not ours. Revoke access at any time and even Logikol can't open the locker.

Your KMS
AWS KMS · GCP KMS · HSM
● live
Logikol Gateway
decrypt-on-demand
● live
Encrypted vault
ciphertext at rest · zero plaintext
● live
Active. Decryption only happens with a freshly fetched key from your KMS.
Pillar II · Document Intelligence

Every field. Every number.
Every checkbox. Right.

A purpose-built extraction engine — not a general-purpose LLM squinting at a PDF. Layout-aware computer vision plus vision-language correction, designed to win on the documents general models break on: scanned, stamped, multi-column, handwritten, rotated, half-readable.

100%
Field-level accuracy on benchmark set
<3s
Median latency per page
11×
Lower cost than the leading LLM-only OCR
1 call
Single sync API. No polling.
POST /v1/extract
sync · 1 call
curl -X POST https://api.logikol.com/v1/extract \
  -H "Authorization: Bearer $LOGIKOL_KEY" \
  -F "file=@statement.pdf" \
  -F 'options={
        "output": "markdown",
        "schema": "auto",
        "tokenize_pii": true
      }'
200 OK · markdown + structured fields
LLM-ready
{
  "markdown": "# Account Statement\n\n**Holder:** [name_a3f]\n...",
  "fields": {
    "account_number": { "value": "[acct_4de]", "confidence": 0.99 },
    "balance":        { "value": 48200.00, "confidence": 1.00 },
    "statement_date": { "value": "2026-04-30", "confidence": 1.00 }
  },
  "tokens": { "[name_a3f]": "vault_ref://...", "[acct_4de]": "vault_ref://..." },
  "pages": 4,
  "ms": 1840
}
Why we win

On the documents general models actually fail on.

Naïve LLM OCR
  • Hallucinated values on multi-column tables
  • Drops checkboxes, stamps, signatures
  • Re-orders rows on rotated scans
  • Token cost balloons with page count
  • No field-level confidence scores
Logikol Document Intelligence
  • Layout-grounded reading order, every page
  • Vision pass catches stamps, marks, handwriting
  • Field-level confidence on every value
  • Flat per-page price — no token surprises
  • Built-in PII tokenization (works with the Vault)
Use cases

Six places this lands the day you turn it on.

Logikol is industry-agnostic data infrastructure. Wherever sensitive data crosses a trust boundary — into a model, a partner, an analyst, an agent — the Vault and Document Intelligence handle the boundary, so you don't have to.

Safe LLM ingestion

Pipe customer records, support tickets, and documents into any model — internal or third-party — without ever shipping a name, ID, or account number in plaintext.

Bulk document intake

Turn statements, claims, contracts, applications, or invoices into clean structured data. Field-level confidence, reviewer queues, no hand correction.

Regulated PII redaction

Detect and tokenize sensitive identifiers across structured rows and unstructured text — for analytics, partner sharing, support exports, anything that crosses a trust boundary.

Agent + workflow guardrails

Sit Logikol in front of multi-step agents. Inputs and outputs both pass through tokenization, with policy enforcement at each call.

Warehouse de-identification

Backfill or stream tokenized columns into Snowflake, Redshift, or BigQuery — analysts query surrogates, only authorized roles re-identify.

Vendor + partner data sharing

Share documents and records with offshore teams, processors, or third-party tools. They see structure and surrogates. Originals stay in your tenancy.

Deployment

Run it on our infra. Or yours.

Three deployment models. Same platform, same APIs, same SLAs — you choose where the compute and the data live. Sensitive data stays inside whichever perimeter you already trust.

Logikol Cloud

Managed multi-tenant SaaS.

We run it. Spin up in minutes, hit the API, keep your KMS keys in your account. The default for teams that want to ship before the security review finishes.

  • Hosted in our cloud, regional residency options
  • Customer-held keys (BYOK) by default
  • SLA-backed uptime
Most common for regulated buyers

Dedicated VPC

Single-tenant, isolated.

Your own dedicated Logikol stack inside our cloud. No shared compute, no shared data plane. The right answer for regulated workloads with isolation requirements.

  • Single-tenant infrastructure end-to-end
  • Private networking · VPC peering
  • Customer-controlled key rotation

Self-hosted

Runs entirely in your infra.

Deploy Logikol inside your own AWS, GCP, Azure, or on-prem environment. We ship the platform; your team operates it. Data never leaves your perimeter, period.

  • Helm charts · Terraform · air-gapped supported
  • Your VPC, your KMS, your audit pipeline
  • Zero data egress to Logikol

Same platform, same API contract across all three. Switch later without rewriting integrations — workloads can even straddle deployment modes (e.g. self-hosted Vault + Logikol Cloud Document Intelligence).

How we compare

Logikol vs. stitching it together yourself.

The alternatives aren't bad — they're just incomplete. A privacy vault alone can't parse your documents. A document extractor alone leaks PII to whatever model you point it at. And building the whole thing in-house is a year-long project that ages out the moment a regulator updates a checklist.

CapabilityLogikolVault-only vendorExtract-only vendorDIY in-house
Privacy Vault — tokenize, detokenize, BYOK
Document Intelligence — layout-aware extraction
One API contract for both pillars
Inline tokenization on every extraction
Customer-held keys (BYOK) end-to-end
Self-hosted / on-prem deployment
Signed audit trail across both pillars
Time-to-first-callHoursDaysDaysQuarters
Cost relative to vault-only + extract-only stack1x1.4x+1.6x+Unknown
FullPartial / requires bolt-onsNot supported
FAQ

Buyer questions, answered without dancing around them.

Get started

Stop pasting plaintext.Start shipping AI.

Bring your hardest document, your strictest data class, and your existing model setup. We'll show you Logikol running against your stack in a 30-minute call.

hello@logikol.com