IDA · Information Driven Automation

Documents in.
Governed metadata out.

A small team of AI agents that classify your documents, extract structured metadata, and stitch it to your business glossary.

Your governance team does the thinking. IDA does the typing.

How It Works

Five agents. One pipeline. Your glossary, populated.

01

Collect

Drop in SDLC documents — PDFs, Word, Excel, CSV. No connectors required.

02

Classify

The Librarian agent reads each document, tags type and SDLC stage, and scores trustworthiness.

03

Extract

Specialised agents populate inventories — systems, pipelines, logical and physical data elements.

04

Stitch

An embeddings model proposes matches between business glossary terms and database columns.

05

Govern

Your team reviews and approves matches. The verified glossary becomes the source of truth.

Inventories

Four inventories. One source of truth.

Everything IDA extracts lands in the right place — systems and pipelines on one side, the data elements they carry on the other.

Inventories

Asset management

993 items

Click any inventory to manage

Tools

What you do with the inventories.

Inventories are the raw material. The tools are how you turn them into a verified glossary, a lineage graph, and the trust signals your governance team actually needs.

Tools

Data utilities

3 available

Tools draw on the inventory data above

Capabilities

Built for the messy reality of enterprise data.

01

Librarian Agent

Automatically classifies documents by SDLC stage, assigns trust scores, and routes to specialist extractors.

02

Extraction Agents

Domain-expert AI agents that understand technical specs, requirements docs, and data dictionaries.

03

Semantic Stitching

Sentence transformers find matches between business terms and physical database elements automatically.

04

Global Glossary

Deduplicated single source of truth with full lineage back to source documents.

05

Lineage Graphs

Interactive visualization connecting systems, pipelines, and data elements end-to-end.

06

Human Oversight

Review, approve, and audit every AI decision. You stay in control.

Two Modes

Run it in bulk. Or one document at a time.

Use bulk for scale. Use chat for the documents you don't want to leave to a job queue.

Bulk

Bulk processing

Automated at scale.

Drop in hundreds of documents and let the pipeline run. Classification, extraction, stitching — all happen in the background. Approve in batches when you're ready.

  • Hundreds of documents in one run
  • Background processing queue
  • Batch approval workflows
Chat

Chat-driven

One document, total control.

Walk through a single complex document with the agent. Ask follow-up questions, refine extractions in real time, and commit only what you've reviewed.

  • Deep-dive on complex documents
  • Ask follow-up questions
  • Fine-tune before committing

Or combine both. Bulk for the long tail, chat for the documents that matter.

The Numbers

A different league.

What we estimate based on benchmark data of typical mid-tier banks vs. an IDA pilot. Your mileage will vary.

Faster than scanners
96%Lower implementation cost
£0Connector licensing fees
ManualScannersIDA
Time to BAU5 years2 years6 months
Cost to BAU£52M£31M£1.25M
Accuracy50%85%80%
The Pilot

Three months. Your documents. Real metadata.

A focused engagement to prove IDA on your real estate. We bring the agents, you bring the documents, and at the end you have a populated glossary you can keep.

Numbers below are from a recent pilot with a UK bank.

600Documents processed
240Systems inventoried
2,000+Logical data elements extracted
2,400+Physical data elements mapped
Under the Hood

Modern stack. Boringly reliable.

Frontend

  • React
  • Next.js
  • Tailwind

Backend

  • Lambda
  • API Gateway
  • AppSync

Storage

  • S3
  • DynamoDB
  • Cognito

AI/ML

  • Claude
  • Transformers
  • MiniLM
See It Live

See it on your documents.

We'll run IDA over a few of your real files in a 30-minute call. No setup, no commitment.

Book a demo