Skip to content
arrow_back
search
ISM-2090 policy ASD Information Security Manual (ISM)

Rate Limiting for AI Model Inference Queries

Limit how often AI queries are run to prevent system overuse and improve efficiency.

record_voice_over

Plain language

Rate limiting means setting limits on how often AI systems are allowed to process requests. This is important because without limits, the system could become overloaded, slow down, or even crash, causing disruptions and potentially leading to mistakes in important tasks.

Framework

ASD Information Security Manual (ISM)

Control effect

Preventative

Classifications

NC, OS, P, S, TS

ISM last updated

Nov 2025

Control Stack last updated

19 Mar 2026

E8 maturity levels

N/A

Official control statement

Rate limiting is applied to inference queries for artificial intelligence models.
policy ASD Information Security Manual (ISM) ISM-2090
priority_high

Why it matters

Without rate limiting, AI inference APIs can be abused, driving up compute costs and causing service degradation or denial of service for legitimate users.

settings

Operational notes

Monitor inference request rates and tune limits per client/model; log and review HTTP 429 events to detect abuse and adjust thresholds without blocking legitimate use.

Mapping detail

Mapping

Direction

Controls