Find the insights and best practices about our product.
Arena Models

The Arena Models endpoint returns all AI models discovered and tested by Cranium across your tenant. Use this endpoint as a model catalog when correlating model identifiers across endpoints, and as a reference for model metadata when integrating Arena data with downstream systems.

List Arena Models

Returns a paginated list of models cataloged in the Arena.

Request:

GET /api/public/arena/models

Authorization:

Bearer token. See Authentication & Generating Credentials.


Query parameters:

Response record:

json

{
"modelId": "guid",
"modelName": "llama-3-8b-instruct",
"modelFamilyId": "guid",
"modelFamilyName": "Llama 3",
"modelSource": "HuggingFace | Internal",
"modelType": "LLM | DL | TradML",
"referenceUrl": "https://huggingface.co/...",
"modelStatus": "string",
"severity": "Critical | High | Medium | Low | None",
"averageAttackSuccessRatio": 0.42,
"attackCategoriesVulnerableCount": 3,
"weaknessCount": 5,
"knownRisksCount": 2,
"automatedTestCount": 10,
"manualTestCount": 1,
"aiSystemCount": 2,
"penTestCompletedAt": "2026-03-15T00:00:00Z | null",
"discoveredAt": "2026-01-05T00:00:00Z",
"updatedAt": "2026-04-10T00:00:00Z"
}

Field Definitions

  • modelId: Unique identifier for the model. Stable across updates. Use this value to correlate with modelId on the Arena Model Vulnerabilities endpoint.
  • modelName: Human-readable name of the model as displayed in the Arena.
  • modelFamilyId: Identifier of the model family. Models that share a base architecture (for example, multiple Llama 3 variants) share a modelFamilyId.
  • modelFamilyName: Display name of the model family.
  • modelSource: Origin of the model. HuggingFace for models cataloged in HuggingFace Hub, Internal for organization-specific models.
  • modelType: AI type classification. LLM for large language models, DL for deep learning models, TradML for traditional machine learning models.
  • referenceUrl: URL to the model's reference page (for example, the HuggingFace model card).
  • modelStatus: Current status of the model. One of Untested, Queued, InTesting, Completed, Failed, NotTestable.
  • severity: Highest severity classification across the model's tested attack categories. One of Critical, High, Medium, Low, None.
  • averageAttackSuccessRatio: Average ratio of successful attacks across all tested attack categories. Expressed as a decimal between 0 and 1. A value of 0.42 indicates that 42 percent of attack attempts succeeded on average.
  • attackCategoriesVulnerableCount: Number of attack categories in which the model was found vulnerable.
  • weaknessCount: Number of identified weaknesses associated with the model.
  • knownRisksCount: Number of known risks documented for the model.
  • automatedTestCount: Number of automated penetration tests run against the model.
  • manualTestCount: Number of manual penetration tests run against the model.
  • aiSystemCount: Number of AI Systems in your tenant that include this model.
  • penTestCompletedAt: Timestamp at which the most recent penetration test completed. Null when no test has been run. ISO 8601 format.
  • discoveredAt: Timestamp at which the model was first cataloged in the Arena. ISO 8601 format.
  • updatedAt: Timestamp at which the record was last updated. ISO 8601 format. This is the field the sync cursor tracks.
Did this answer your question?