Methodology

How HAM scores tip control.

Every attempt is a short video of four progressive shapes. The platform turns it into four numbers — and a percentile that tells a trainee whether they're at trainee, competent, or expert level.

The platform

A trainee records an attempt, uploads it, and gets a validated AI report within minutes. The video below is a real Shape 1 attempt on the platform.

Anonymised platform footage from a HAM-AI training session at UZ Gent. This attempt scored in the Competent tier — a typical mid-range result that maps to the P33–P67 band described below.

Inside the AI report

For comparison, here is the same Shape 1 task scored in the Expert tier. Same shape, same metrics — different result.

HAM platform attempt page: endoscope feed and AI analysis results, Expert tier example
1
2
3
4
5
  1. 1Endoscope feed, masked to scope FOV
  2. 2Validated badge — attempt accepted by AI
  3. 3Overall percentile + tier
  4. 4Speed and Stability tiles with z-scores
  5. 5Download for review

Trainee workflow: record → upload → AI scores within minutes → validate or contest. All attempts are stored against the trainee profile so improvement over time is visible.

What HAM measures

Four metrics, each derived directly from the recorded video. Together they form the percentile and tier.

Speed (mm/s)

Distance the tip travels per unit active time. Reported in two flavours so trainees see the difference between raw movement and movement with intent.

  • Test Speed — raw distance / total time.
  • Intention Speed — distance / active time, with idle frames stripped.
Example: Test Speed 0.84 mm/s · Intention Speed 1.06 mm/s

Stability (%)

Share of frames where tip jitter falls below the expert threshold. Decomposed into three bands so trainees can see where stability breaks down.

Unstable
5%
Moderate
7%
Stable
88%
Example: 92% stable overall, with the distribution above.

Time on line (%)

Fraction of active time the tip lies within the stencil's blue track. A pure accuracy metric — were you where you were supposed to be?

77%

Duration

Total Test Time vs Active Intention Time. The gap between them reveals hesitation patterns invisible to the naked eye.

Total Test Time
2:10
Active Intention Time
1:43
HAM AI Analysis Results detail — expanded metrics
The expanded Detailed Metrics view from the platform, showing the same four metrics broken down for review.

How the AI works

No black box. Four stages, each with a defined input and output.

1

Frame extraction

Video sampled at 30 fps and masked to the endoscope field of view. Vendor overlays are ignored.

2

Tip detection

A custom computer-vision model trained on ham-specific imagery locates the scope tip in every frame.

3

Line tracking

The blue stencil is segmented; the distance from the tip to the nearest line is computed per frame.

4

Aggregation and z-scoring

Per-frame data is aggregated into the four metrics, then z-scored against a reference cohort of experienced endoscopists.

Benchmarking and tiers

Each metric is z-scored against a reference cohort of experienced endoscopists. The z-score becomes a percentile, and the percentile lands the attempt in one of three tiers.

  • Trainee — below the 33rd percentile.
  • Competent — between the 33rd and 67th percentile.
  • Expert — above the 67th percentile.

Tiers are computed per shape, because shape difficulty varies. A trainee can be Expert on Shape 1 and Competent on Shape 4 — and that's the data the platform surfaces.

TraineeCompetentExpertP33P67

Validated against real endoscopists

HAM stratifies trainees from interventional specialists. AI scoring shows comparable performance to human raters. Three peer-reviewed studies — and a featured talk at ESGE Days 2026 — back the method.

Read the research →

Try it on your next attempt.

Open Platform →