AI Features

Cost-saving playbook

Concrete settings that drop the credit cost of a test without sacrificing reliability. Memory on, the cheapest segmentation mode that still works, and a checklist for auditing existing tests.

If your credit usage is climbing faster than your test count, you're probably leaving free wins on the table. This page is a checklist of concrete settings — every recommendation traces back to a specific toggle or mode documented elsewhere in the manual.

The three high-leverage levers

1. Leave Position Memory on for stable taps

Default is on; if you've turned it off, ask why. The toggle's own description spells out the right rule: "Only enable for elements with stable, fixed positions (e.g., navigation buttons)." Most taps in a typical test qualify.

See Position Memory.

2. Leave Assert Memory and List Bounds Memory on

Both default on. The downside of leaving them on for a non-stable target is essentially nil — a memory miss costs the same as having had memory off.

See Assert Memory & List Bounds Memory.

3. Pick the cheapest segmentation mode that still works

The AI Segmentation picker only appears in Tap and Scroll editors. Three modes, from cheapest to most expensive:

  • None — on-device detection only.
  • Auto — reuses a cached result when the screen still matches, refreshes when it doesn't.
  • Force Fresh — full resolution on every run.

Try None first. If on-device detection finds your target, you spend nothing on segmentation for that step. If None misses, switch to Auto. Reserve Force Fresh for debugging — it's the most expensive mode.

See AI Segmentation modes.

Audit checklist for existing tests

Run through any test that's costing more than you'd expect:

  • Are all stable Tap steps' Position Memory toggles on?

  • Are all Visual Assert steps' Assert Memory toggles on (unless the screen genuinely changes between runs)?

  • Are all Scroll steps' List Bounds Memory toggles on?

  • Are any Tap or Scroll steps stuck on Force Fresh when None or Auto would work?

  • Are any Tap steps on Auto when None would work just as well?

  • Is Run Count set higher than necessary on any Run Test step?

  • Is self-healing set to Full Test when Per Step would do?

What about per-test cost analytics?

Per-test cost is available in both the Mavster macOS app and the web app's project analytics view. Use it to identify the highest-cost tests first — focusing the playbook on the top quartile of spend usually returns most of the savings.