Privacy Vault · Document Intelligence

Data infrastructurethat knows how toPROTECT.

One platform that detects and tokenizes sensitive data before it ever reaches a model — and turns any document into structured, LLM-ready output. Sensitive data never leaves your tenancy in plaintext. Your AI tooling never stops working.

Request a demo See the architecture

100%

Field-level extraction accuracy

Zero

Plaintext that ever leaves your tenancy

BYOK

Customer-held encryption keys

1 call

Single sync API. No polling.

The problem

Their team won't find out for six months. By then the SSN, the policy number, the K-1 income figure — whatever it is — is in a chat log on a server somewhere, indexed into another company's training corpus, and sitting in three places no one can reach to delete it from.

GDPR

Up to 4% of global revenue

HIPAA

$50K – $1.5M per violation tier

GLBA / IRS Pub 1075

License & contract loss

Specific penalties vary by statute, jurisdiction, and circumstances. A single incident can trigger multiple regulators at once.

One platform · Two pillars

Vault, then extract. Or extract, then vault. Either way — your secrets stay yours.

Logikol pairs a privacy-first data vault with a high-precision document extraction engine. Build AI features and data pipelines on top of either — or both — without ever shipping plaintext PII to a model provider.

Pillar I · Privacy Vault

Sensitive data, isolated.
Models see only surrogates.

Detect PII as it enters your stack. Tokenize each field into a format-preserving surrogate. Encrypt the original under keys you control. Plaintext never leaves your tenancy — not to OpenAI, not to your analytics warehouse, not to your offshore partner.

Six-layer detection across structured + unstructured inputs
Bring-your-own-key (BYOK) — revoke access in one click
Format-preserving tokens that look real to downstream tools
Signed audit trail for every authorization and access event

Explore the Vault

Pillar II · Document Intelligence

Any document.
Structured, in one call.

A computer-vision + vision-language pipeline that turns scanned PDFs, images, and messy real-world documents into clean, field-level structured output. Markdown-ready for LLMs. JSON-ready for your warehouse. Sub-second per page.

Field-level extraction with full confidence scoring
Layout-aware: tables, multi-column, handwritten, stamped, rotated
Single sync API call — no polling, no multi-step workflow
Markdown + JSON outputs ready for downstream pipelines

Explore Document Intelligence

Use them independently — or chain them. Document Intelligence parses, the Vault tokenizes, and your AI never sees a name.

How it works

Plaintext goes in.
Surrogates go out.

Every document, record, or message that touches Logikol passes through a four-stage gateway before reaching any downstream consumer. Reversible only inside your tenancy, with your keys.

Source

docs · forms · streams · tickets

Logikol Gateway

detect → tokenize → encrypt → log

Any consumer

LLM · analytics · partner · warehouse

01 · Detect

02 · Tokenize

03 · Encrypt + split

04 · Sign + log

Six detection layers run in parallel. Format-preserving tokens look real to your downstream tooling. Original ciphertext sits behind customer-held keys — split-storage, separate from the surrogates.

Pillar I · Privacy Vault

Same task. Same answer.
Different exposure.

Your team uses ChatGPT, Claude, or your in-house model exactly the way they do today. The Vault sits in front. Sensitive fields are tokenized inline. The model sees structure but no secrets — and answers the same question.

direct to LLM

RAW

You

Summarize this customer record and flag risk: Maria Lopez, 412-55-9180, account 8801-4421-3092, balance $48,200.

Model

Customer Maria Lopez shows a healthy balance trend. No fraud signals on the most recent 72 transactions. Risk: low.

✗ Name + SSN + account number stored in provider logs · cannot be recalled

via Logikol Gateway

VAULTED

You

Summarize this customer record and flag risk: [name_a3f], [ssn_91c], account [acct_4de], balance $48,200.

Model

Customer [name_a3f] shows a healthy balance trend. No fraud signals on the most recent 72 transactions. Risk: low.

✓ No PII left your tenancy · prompt + completion logged with surrogates only

Detection

Six layers. One document. Nothing slips through.

No single detector finds everything. Run together, they leave nothing for an LLM to leak — across structured fields, free text, scanned images, and inferred identifiers.

Patterns

regex · checksums · Luhn · ABA

Schema

column types · field tags

NER

names · orgs · locations

Vision

stamps · signatures · faces

Context

neighborhood inference

Custom rules

your domain dictionary

Encryption · BYOK

Your data, your keys. Yours to revoke.

Documents and tokenized mirrors live in split storage. Originals are encrypted with keys you hold in yourKMS — not ours. Revoke access at any time and even Logikol can't open the locker.

Your KMS

AWS KMS · GCP KMS · HSM

● live

Logikol Gateway

decrypt-on-demand

● live

Encrypted vault

ciphertext at rest · zero plaintext

● live

Active. Decryption only happens with a freshly fetched key from your KMS.

Pillar II · Document Intelligence

Every field. Every number.
Every checkbox. Right.

A purpose-built extraction engine — not a general-purpose LLM squinting at a PDF. Layout-aware computer vision plus vision-language correction, designed to win on the documents general models break on: scanned, stamped, multi-column, handwritten, rotated, half-readable.

100%

Field-level accuracy on benchmark set

<3s

Median latency per page

11×

Lower cost than the leading LLM-only OCR

1 call

Single sync API. No polling.

POST /v1/extract

sync · 1 call

curl -X POST https://api.logikol.com/v1/extract \
  -H "Authorization: Bearer $LOGIKOL_KEY" \
  -F "file=@statement.pdf" \
  -F 'options={
        "output": "markdown",
        "schema": "auto",
        "tokenize_pii": true
      }'

200 OK · markdown + structured fields

LLM-ready

{
  "markdown": "# Account Statement\n\n**Holder:** [name_a3f]\n...",
  "fields": {
    "account_number": { "value": "[acct_4de]", "confidence": 0.99 },
    "balance":        { "value": 48200.00, "confidence": 1.00 },
    "statement_date": { "value": "2026-04-30", "confidence": 1.00 }
  },
  "tokens": { "[name_a3f]": "vault_ref://...", "[acct_4de]": "vault_ref://..." },
  "pages": 4,
  "ms": 1840
}

Why we win

On the documents general models actually fail on.

Naïve LLM OCR

Hallucinated values on multi-column tables
Drops checkboxes, stamps, signatures
Re-orders rows on rotated scans
Token cost balloons with page count
No field-level confidence scores

Logikol Document Intelligence

Layout-grounded reading order, every page
Vision pass catches stamps, marks, handwriting
Field-level confidence on every value
Flat per-page price — no token surprises
Built-in PII tokenization (works with the Vault)

Use cases

Six places this lands the day you turn it on.

Logikol is industry-agnostic data infrastructure. Wherever sensitive data crosses a trust boundary — into a model, a partner, an analyst, an agent — the Vault and Document Intelligence handle the boundary, so you don't have to.

Safe LLM ingestion

Pipe customer records, support tickets, and documents into any model — internal or third-party — without ever shipping a name, ID, or account number in plaintext.

Bulk document intake

Turn statements, claims, contracts, applications, or invoices into clean structured data. Field-level confidence, reviewer queues, no hand correction.

Regulated PII redaction

Detect and tokenize sensitive identifiers across structured rows and unstructured text — for analytics, partner sharing, support exports, anything that crosses a trust boundary.

Agent + workflow guardrails

Sit Logikol in front of multi-step agents. Inputs and outputs both pass through tokenization, with policy enforcement at each call.

Warehouse de-identification

Backfill or stream tokenized columns into Snowflake, Redshift, or BigQuery — analysts query surrogates, only authorized roles re-identify.

Vendor + partner data sharing

Share documents and records with offshore teams, processors, or third-party tools. They see structure and surrogates. Originals stay in your tenancy.

Deployment

Run it on our infra. Or yours.

Three deployment models. Same platform, same APIs, same SLAs — you choose where the compute and the data live. Sensitive data stays inside whichever perimeter you already trust.

Logikol Cloud

Managed multi-tenant SaaS.

We run it. Spin up in minutes, hit the API, keep your KMS keys in your account. The default for teams that want to ship before the security review finishes.

Hosted in our cloud, regional residency options
Customer-held keys (BYOK) by default
SLA-backed uptime

Most common for regulated buyers

Dedicated VPC

Single-tenant, isolated.

Your own dedicated Logikol stack inside our cloud. No shared compute, no shared data plane. The right answer for regulated workloads with isolation requirements.

Single-tenant infrastructure end-to-end
Private networking · VPC peering
Customer-controlled key rotation

Self-hosted

Runs entirely in your infra.

Deploy Logikol inside your own AWS, GCP, Azure, or on-prem environment. We ship the platform; your team operates it. Data never leaves your perimeter, period.

Helm charts · Terraform · air-gapped supported
Your VPC, your KMS, your audit pipeline
Zero data egress to Logikol

Same platform, same API contract across all three. Switch later without rewriting integrations — workloads can even straddle deployment modes (e.g. self-hosted Vault + Logikol Cloud Document Intelligence).

How we compare

Logikol vs. stitching it together yourself.

The alternatives aren't bad — they're just incomplete. A privacy vault alone can't parse your documents. A document extractor alone leaks PII to whatever model you point it at. And building the whole thing in-house is a year-long project that ages out the moment a regulator updates a checklist.

Capability	Logikol	Vault-only vendor	Extract-only vendor	DIY in-house
Privacy Vault — tokenize, detokenize, BYOK
Document Intelligence — layout-aware extraction
One API contract for both pillars
Inline tokenization on every extraction
Customer-held keys (BYOK) end-to-end
Self-hosted / on-prem deployment
Signed audit trail across both pillars
Time-to-first-call	Hours	Days	Days	Quarters
Cost relative to vault-only + extract-only stack	1x	1.4x+	1.6x+	Unknown

FullPartial / requires bolt-onsNot supported

FAQ

Buyer questions, answered without dancing around them.

Get started

Stop pasting plaintext.Start shipping AI.

Bring your hardest document, your strictest data class, and your existing model setup. We'll show you Logikol running against your stack in a 30-minute call.

Request a demo Read the security brief

hello@logikol.com

Data infrastructurethat knows how toPROTECT.

Vault, then extract. Or extract, then vault. Either way — your secrets stay yours.

Sensitive data, isolated.
Models see only surrogates.

Any document.
Structured, in one call.

Plaintext goes in.
Surrogates go out.

Same task. Same answer.
Different exposure.

Six layers. One document. Nothing slips through.

Your data, your keys. Yours to revoke.

Every field. Every number.
Every checkbox. Right.

On the documents general models actually fail on.

Six places this lands the day you turn it on.

Safe LLM ingestion

Bulk document intake

Regulated PII redaction

Agent + workflow guardrails

Warehouse de-identification

Vendor + partner data sharing

Run it on our infra. Or yours.

Logikol Cloud

Dedicated VPC

Self-hosted

Logikol vs. stitching it together yourself.

Buyer questions, answered without dancing around them.

Is the Privacy Vault required to use Document Intelligence — or vice versa?

What happens to a token if Logikol disappears?

How is detection accuracy measured? What about false positives?

Can I use Logikol with a model that I host myself?

How does pricing actually work?

How long does an integration usually take?

Can I extract sensitive documents without ever sending the file to your cloud?

Do you train models on customer data?

Stop pasting plaintext.Start shipping AI.

Data infrastructurethat knows how toPROTECT.

Vault, then extract. Or extract, then vault. Either way — your secrets stay yours.

Sensitive data, isolated.Models see only surrogates.

Any document.Structured, in one call.

Plaintext goes in.Surrogates go out.

Same task. Same answer.Different exposure.

Six layers. One document. Nothing slips through.

Your data, your keys. Yours to revoke.

Every field. Every number.Every checkbox. Right.

On the documents general models actually fail on.

Six places this lands the day you turn it on.

Safe LLM ingestion

Bulk document intake

Regulated PII redaction

Agent + workflow guardrails

Warehouse de-identification

Vendor + partner data sharing

Run it on our infra. Or yours.

Logikol Cloud

Dedicated VPC

Self-hosted

Logikol vs. stitching it together yourself.

Buyer questions, answered without dancing around them.

Is the Privacy Vault required to use Document Intelligence — or vice versa?

What happens to a token if Logikol disappears?

How is detection accuracy measured? What about false positives?

Can I use Logikol with a model that I host myself?

How does pricing actually work?

How long does an integration usually take?

Can I extract sensitive documents without ever sending the file to your cloud?

Do you train models on customer data?

Stop pasting plaintext.Start shipping AI.

Sensitive data, isolated.
Models see only surrogates.

Any document.
Structured, in one call.

Plaintext goes in.
Surrogates go out.

Same task. Same answer.
Different exposure.

Every field. Every number.
Every checkbox. Right.