AI Workflow Readiness Checklist
The checklist our team runs through before any AI system goes to production. Scored: ✅ Done (2pts) · ⚠️ Partial (1pt) · ❌ Missing (0pts) · N/A (excluded).
Want ShipSmith to run this automatically?
Connect your repo and we'll score every AI workflow against these 115 controls — no manual work required.
How we conduct the assessment
Pre-work call
30 min
We align on scope, identify which dimensions apply to your situation, and share a preparation checklist.
Document review
2–3 days
You share relevant artefacts. We review asynchronously and pre-score what we can from documentation alone.
Guided review session
2–3 hrs
We walk through each dimension together. You answer questions, we score in real time, flag gaps, and capture context.
Scored report
2 days after
You receive a full scored report with dimension breakdown, prioritised recommendations ranked by ROI and effort.
What to prepare
Data Foundation
Data source inventory, pipeline docs, data quality reports, sample schema, lineage documentation
Model & Architecture
Model cards, prompt catalog, model registry records, benchmarking results, customisation decision log
Evaluation & QA
Eval dataset samples, automated eval pipeline, fairness test results, red-team exercise report
Observability & Monitoring
Observability dashboards, trace samples, alert configurations, compliance audit trail
Resilience & Production Engineering
Architecture diagram, load test results, rollback runbook, canary deployment config, CT pipeline docs
Security & Compliance
Access control policy, compliance certifications, incident response plan, RAI review records
Cost Management
Cost dashboards, token budget configs, model routing policy, cost model at scale
Adoption & Change Management
Org chart (AI team), runbook, adoption metrics, change management plan, Responsible AI scorecard
AI Governance & Risk Management
Org AI policy document, impact assessment records, named risk owner, vendor AI contracts, incident log
Grounded in established standards
Our checklist is not proprietary — it synthesises six established industry frameworks into a single actionable assessment. Each item is traceable back to a specific control or best practice in at least one of these sources.
NIST AI RMF
2023Four core functions: GOVERN, MAP, MEASURE, MANAGE. The gold standard for AI risk management as an organisational discipline.
AWS GenAI Lens
2025Best practices for LLM/RAG/agent systems — excessive agency controls, prompt catalog governance, multi-agent tracing, bias drift monitoring.
AWS ML Lens
2025Model registry gating, data poisoning protection, environment parity, and feature attribution drift monitoring.
Microsoft RAI
2022Fairness evaluation (Fairlearn), model cards, error cohort analysis, content safety layers, and Discover-Protect-Govern deployment framework.
Google MLOps
2021Three maturity levels (0→2) covering training-serving skew, feature stores, continuous training pipelines, and ML pipeline component testing.
OWASP LLM Top 10
2023The ten most critical security risks for LLM applications — including prompt injection (#01), excessive agency (#08), and supply chain vulnerabilities (#05).
Data Foundation
Model & Architecture
Evaluation & QA
Observability & Monitoring
Resilience & Production Engineering
Security & Compliance
Cost Management
Adoption & Change Management
AI Governance & Risk Management
Skip the manual checklist.
Let ShipSmith score your workflows automatically.
Connect your repo and our AI agent discovers every AI workflow and scores it against these 115 controls — in under 10 minutes. Free for your first workflow.