Skip to main content

Step 5 — Apply AI Recommendations

After a simulation run, select Get AI Recommendations to analyse the current architecture and simulation results. The engine returns a prioritised, categorised set of findings.

Recommendation categories

CategoryWhat it addresses
add serviceMissing services that would materially improve performance, resilience, or cost
architectureStructural changes — async decoupling, circuit breakers, retry patterns, fan-out
modify configConfiguration adjustments on existing services — concurrency, caching, timeouts, memory
scalingAuto-scaling configuration gaps

Severity interpretation

⚠️ WARNING — address before deployment. These findings represent real failure risks or significant cost inefficiencies at the simulated load. Examples: Lambda throttling at target RPS, API Gateway as a single point of failure, missing CloudFront for a high-read API.

ℹ️ INFO — improvement opportunities that are not blocking but should be considered for production workloads. Examples: Lambda provisioned concurrency, DynamoDB on-demand mode, SQS visibility timeout tuning.

Work through WARNING items first. A common initial pattern for a Lambda-based API architecture is:

  1. Add CloudFront (WARNING) — reduces API Gateway load and latency at edge
  2. Enable Lambda Provisioned Concurrency (INFO) — eliminates cold starts under spike load
  3. Introduce Circuit Breaker (WARNING) — prevents Lambda cascade failures when downstream services degrade
  4. Implement SQS async processing (INFO) — decouples write path to protect downstream databases
  5. Configure Auto Scaling (INFO) — ensures concurrency limits grow with load

Applying recommendations

Each recommendation can be expanded to read the full rationale before accepting. When accepted:

  • New services are added to the canvas automatically
  • New connections are wired according to the recommended topology
  • Configuration changes are applied to the relevant node
Re-simulate after each change

After applying recommendations, re-run the simulation before accepting the next batch. This confirms the change had the expected effect on latency, cost, and health — and may reveal new recommendations that were not visible in the previous state.

Recommendation best practices

  • Do not accept all recommendations in a single batch. Apply WARNING items, re-simulate, then review INFO items in the new state.
  • Read the rationale before accepting. Recommendations are contextual to your architecture — understanding why a change is being proposed helps you assess whether the trade-off is appropriate for your use case.
  • Use Refresh Recommendations after significant canvas changes to ensure the engine is analysing the current state.
  • Dismiss irrelevant recommendations explicitly. This keeps your recommendation history accurate and ensures future sets are relevant.
  • API Gateway throttling that persists after other fixes requires a specific response — see Pattern 6.