Supported foundation models in watsonx.ai

Last updated: Jul 03, 2025

You can work with third-party and IBM foundation models in IBM watsonx.ai.You can use foundation models that are provided by IBM and are ready to use immediately, or deploy foundation models on-demand to use exclusively for your organization.

How to choose a model

To review factors that can help you to choose a model, such as supported tasks and languages, see Choosing a model and Foundation model benchmarks.

Attention:

Model availability varies by data center location. For details, see Regional availability of foundation models. Deploy on demand models are only available on IBM Cloud.

Foundation models by deployment method

Depending on the deployment method, you can use foundation models on multitenant hardware directly or deploy models on dedicated hardware for use by your organization. To learn more about the various ways you can use to deploy models, see Foundation model deployment methods.

Table 1. Foundation models by deployment method
Provider	Provided with watsonx.ai (Pay per token)	Deploy on demand (Pay by the hour)
IBM	• granite-3-3-8b-instruct • granite-13b-instruct-v2 (Deprecated) • granite-8b-japanese (Deprecated) • granite-3-8b-base • granite-3-2b-instruct • granite-3-8b-instruct • granite-3-2-8b-instruct • granite-guardian-3-2b • granite-guardian-3-8b • granite-3b-code-instruct (Deprecated) • granite-8b-code-instruct • granite-20b-code-instruct (Deprecated) • granite-34b-code-instruct (Deprecated) • granite-vision-3-2-2b	• granite-3-3-8b-instruct • granite-3-1-8b-base • granite-3-3-2b-instruct • granite-3-2-8b-instruct • granite-7b-lab • granite-8b-japanese • granite-13b-chat-v2 • granite-13b-instruct-v2 (Deprecated) • granite-20b-multilingual • granite-3b-code-instruct • granite-8b-code-instruct • granite-20b-code-instruct • granite-34b-code-instruct • granite-20b-code-base-schema-linking • granite-20b-code-base-sql-gen
Google	• flan-t5-xl-3b (Deprecated) • flan-t5-xxl-11b (Deprecated) • flan-ul2-20b (Deprecated)	• flan-t5-xl-3b (Deprecated) • flan-t5-xxl-11b (Deprecated) • flan-ul2-20b (Deprecated)
Meta	• llama-4-maverick-17b-128e-instruct-fp8 • llama-4-scout-17b-16e-instruct (Deprecated) • llama-3-3-70b-instruct • llama-3-2-1b-instruct • llama-3-2-3b-instruct • llama-3-2-11b-vision-instruct • llama-3-2-90b-vision-instruct • llama-guard-3-11b-vision-instruct • llama-2-13b-chat (Deprecated)	• llama-3-1-70b • llama-3-2-11b-vision-instruct • llama-3-3-70b-instruct • llama-3-3-70b-instruct-hf • llama-3-1-70b-instruct • llama-2-13b-chat • llama-2-70b-chat • llama-3-8b-instruct • llama-3-70b-instruct • llama-3-1-8b • llama-3-1-8b-instruct
Mistral AI	• mistral-small-3-1-24b-instruct-2503 • mistral-large • mistral-medium-2505 • mistral-small-24b-instruct-2501 (Deprecated) • mixtral-8x7b-instruct-v01 (Deprecated) • pixtral-12b	• mistral-large-instruct-2407 • mistral-large-instruct-2411 • mistral-nemo-instruct-2407 • mixtral-8x7b-base • mixtral-8x7b-instruct-v01
BigScience		• mt0-xxl-13b
Code Llama		• codellama-34b-instruct-hf
DeepSeek AI		• deepseek-r1-distill-llama-8b • deepseek-r1-distill-llama-70b
ELYZA, Inc	• elyza-japanese-llama-2-7b-instruct
Inception	• jais-13b-chat
SDAIA	• allam-1-13b-instruct	• allam-1-13b-instruct
Unified Transcription and Translation for Extended Reality (UTTER) project		• eurollm-1-7b-instruct • eurollm-9b-instruct
LumiOpen		• poro-34b-chat

Provided foundation models that are ready to use

A collection of open source and IBM foundation models are deployed in IBM watsonx.ai. You can prompt these foundation models in the Prompt Lab or programmatically.

For details on metering for foundation model inference in watsonx.ai, see Billing rates for inferencing foundation models. For more information about the IBM watsonx.ai service description with various cloud providers, see:

IBM foundation models

The following table lists the supported IBM foundation models that IBM provides for inferencing.

You can also access some IBM foundation models from third-party repositories, such as Hugging Face. IBM foundation models that you obtain from a third-party repository are not indemnified by IBM. Only IBM foundation models that you access from watsonx.ai are indemnified by IBM. For more information about contractual protections related to IBM indemnification, see the IBM Client Relationship Agreement.

Table 2a. IBM foundation models provided with watsonx.ai for inferencing
Model name	API model ID	Input price (USD/1,000 tokens)	Output price (USD/1,000 tokens)	Context window (input + output tokens)	More information
granite-3-3-8b-instruct	`ibm/granite-3-3-8b-instruct`	$0.0002	$0.0002	131,072	• Model card • Website
granite-13b-instruct-v2	`ibm/granite-13b-instruct-v2`	$0.0006	$0.0006	8,192	• Model card • Website • Research paper Note: This foundation model can be prompt tuned.
granite-8b-japanese	`ibm/granite-8b-japanese`	$0.0006	$0.0006	4,096	• Model card • Website • Research paper
granite-3-2b-instruct	`ibm/granite-3-2b-instruct`	$0.0001	$0.0001	131,072	• Model card • Website • Research paper
granite-3-8b-instruct	`ibm/granite-3-8b-instruct`	$0.0002	$0.0002	131,072	• Model card • Website • Research paper
granite-3-2-8b-instruct	`ibm/granite-3-2-8b-instruct`	$0.0002	$0.0002	131,072	• Model card • Website • Research paper
granite-guardian-3-2b	`ibm/granite-guardian-3-2b`	$0.0001	$0.0001	131,072	• Model card • Website
granite-guardian-3-8b	`ibm/granite-guardian-3-8b`	$0.0002	$0.0002	131,072	• Model card • Website
granite-3b-code-instruct	`ibm/granite-3b-code-instruct`	$0.0006	$0.0006	128,000	• Model card • Website • Research paper
granite-8b-code-instruct	`ibm/granite-8b-code-instruct`	$0.0006	$0.0006	128,000	• Model card • Website • Research paper
granite-20b-code-instruct	`ibm/granite-20b-code-instruct`	$0.0006	$0.0006	8,192	• Model card • Website • Research paper
granite-34b-code-instruct	`ibm/granite-34b-code-instruct`	$0.0006	$0.0006	8,192	• Model card • Website • Research paper
granite-vision-3-2-2b	`ibm/granite-vision-3-2-2b`	$0.0001	$0.0001	131,072	• Model card • Website • Research paper

Table 2b. IBM foundation models provided with watsonx.ai for forecasting future values
Model name	API model ID	Input price (USD/1,000 data points)	Output price (USD/1,000 data points)	Context length Min data points	More information
granite-ttm-512-96-r2	`ibm/granite-ttm-512-96-r2`	$0.00013	$0.00038	512	• Model card • Website • Research paper
granite-ttm-1024-96-r2	`ibm/granite-ttm-1024-96-r2`	$0.00013	$0.00038	1,024	• Model card • Website • Research paper
granite-ttm-1536-96-r2	`ibm/granite-ttm-1536-96-r2`	$0.00013	$0.00038	1,536	• Model card • Website • Research paper

Third-party foundation models

The following table lists the supported third-party foundation models that are provided with watsonx.ai.

Table 3. Third-party foundation models provided with watsonx.ai
Model name	API model ID	Provider	Input price (USD/1,000 tokens)	Output price (USD/1,000 tokens)	Context window (input + output tokens)	More information
allam-1-13b-instruct	`sdaia/allam-1-13b-instruct`	National Center for Artificial Intelligence and Saudi Authority for Data and Artificial Intelligence	$0.0018	$0.0018	4,096	• Model card
elyza-japanese-llama-2-7b-instruct	`elyza/elyza-japanese-llama-2-7b-instruct`	ELYZA, Inc	$0.0018	$0.0018	4,096	• Model card • Blog on note.com
flan-t5-xl-3b	`google/flan-t5-xl`	Google	$0.0006	$0.0006	4,096	• Model card • Research paper Note: This foundation model can be prompt tuned.
flan-t5-xxl-11b	`google/flan-t5-xxl`	Google	$0.0018	$0.0018	4,096	• Model card • Research paper
flan-ul2-20b	`google/flan-ul2`	Google	$0.0050	$0.0050	4,096	• Model card • UL2 research paper • Flan research paper
jais-13b-chat	`core42/jais-13b-chat`	Inception, Mohamed bin Zayed University of Artificial Intelligence (MBZUAI), and Cerebras Systems	$0.0018	$0.0018	2,048	• Model card • Research paper
llama-4-maverick-17b-128e-instruct-fp8	`meta-llama/llama-4-maverick-17b-128e-instruct-fp`	Meta	$0.00035	$0.0014	131,072	• Model card • Meta AI blog
llama-4-scout-17b-16e-instruct	`meta-llama/llama-4-scout-17b-16e-instruct`	Meta	No cost during preview	No cost during preview	131,072	• Model card • Meta AI blog
llama-3-3-70b-instruct	`meta-llama/llama-3-3-70b-instruct`	Meta	$0.00071	$0.00071	131,072	• Model card • Meta AI blog
llama-3-2-1b-instruct	`meta-llama/llama-3-2-1b-instruct`	Meta	$0.0001	$0.0001	131,072	• Model card • Meta AI blog • Research paper
llama-3-2-3b-instruct	`meta-llama/llama-3-2-3b-instruct`	Meta	$0.00015	$0.00015	131,072	• Model card • Meta AI blog • Research paper
llama-3-2-11b-vision-instruct	`meta-llama/llama-3-2-11b-vision-instruct`	Meta	$0.00035	$0.00035	131,072	• Model card • Meta AI blog • Research paper
llama-3-2-90b-vision-instruct	`meta-llama/llama-3-2-90b-vision-instruct`	Meta	$0.0020	$0.0020	131,072	• Model card • Meta AI blog • Research paper
llama-guard-3-11b-vision	`meta-llama/llama-guard-3-11b-vision`	Meta	$0.00035	$0.00035	131,072	• Model card • Meta AI blog • Research paper
llama-3-405b-instruct	`meta-llama/llama-3-405b-instruct`	Meta	$0.0050	$0.016	16,384	• Model card • Meta AI blog
llama-2-13b-chat	`meta-llama/llama-2-13b-chat`	Meta	$0.0006	$0.0006	4,096	• Model card • Research paper
mistral-large	`mistralai/mistral-large`	Mistral AI	$0.003	$0.01	131,072	• Model card • Blog post for Mistral Large 2
mistral-medium-2505	`mistralai/mistral-medium-2505`	Mistral AI	$0.003	$0.010	131,072	• Model card • Blog post for Mistral Medium 3
mistral-small-3-1-24b-instruct-2503	`mistralai/mistral-small-3-1-24b-instruct-2503`	Mistral AI	$0.0001	$0.0003	131,072	• Model card • Blog post for Mistral 3.1
mistral-small-24b-instruct-2501	`mistralai/mistral-small-24b-instruct-2501`	Mistral AI	$0.00035	$0.00035	32,768	• Model card • Blog post for Mistral Small 3
mixtral-8x7b-instruct-v01	`mistralai/mixtral-8x7b-instruct-v01`	Mistral AI	$0.0006	$0.0006	32,768	• Model card • Research paper
mt0-xxl-13b	`bigscience/mt0-xxl`	BigScience	$0.0018	$0.0018	4,096	• Model card • Research paper
pixtral-12b	`mistralai/pixtral-12b`	Mistral AI	$0.00035	$0.00035	128,000	• Model card • Blog post for Pixtral 12B

Deploy on demand foundation models

You can work with a foundation model from a set of IBM-curated models to deploy for the exclusive use of your organization.

IBM deploy on demand foundation models

The following table lists the IBM foundation models that you can deploy on demand.

Some IBM foundation models are also available from third-party repositories, such as Hugging Face. IBM foundation models that you obtain from a third-party repository are not indemnified by IBM. Only IBM foundation models that you access from watsonx.ai are indemnified by IBM. For more information about contractual protections related to IBM indemnification, see the IBM Client Relationship Agreement.

Table 4. IBM foundation models available to deploy on demand in watsonx.ai
Model name	Price per hour in USD	Model hosting category	Context window (input + output tokens)
granite-3-3-8b-instruct	$5.22	Small	131,072
granite-3-3-2b-instruct	$5.22	Small	131,072
granite-3-2-8b-instruct	$5.22	Small	131,072
granite-3-1-8b-base	$5.22	Small	131,072
granite-8b-japanese	$5.22	Small	4,096
granite-20b-multilingual	$5.22	Small	8,192
granite-13b-chat-v2	$5.22	Small	8,192
granite-13b-instruct-v2	$5.22	Small	8,192
granite-3b-code-instruct	$5.22	Small	128,000
granite-8b-code-instruct	$5.22	Small	128,000
granite-20b-code-instruct	$5.22	Small	8,192
granite-34b-code-instruct	$5.22	Small	8,192
granite-20b-code-base-schema-linking	$5.22	Small	8,192
granite-20b-code-base-sql-gen	$5.22	Small	8,192
granite-3-8b-base	$5.22	Small	4,096

Third-party deploy on demand foundation models

The following table lists the third-party foundation models that you can deploy on demand.

Table 5. Third-party foundation models available to deploy on demand in watsonx.ai
Model name	Provider	Price per hour in USD	Model hosting category	Context window (input + output tokens)
allam-1-13b-instruct	National Center for Artificial Intelligence and Saudi Authority for Data and Artificial Intelligence	$5.22	Small	4,096
codellama-34b-instruct-hf	Code Llama	$10.40	Medium	16,384
deepseek-r1-distill-llama-8b	DeepSeek AI	$5.22	Small	131,072
deepseek-r1-distill-llama-70b	DeepSeek AI	$20.85	Large	131,072
eurollm-1-7b-instruct	Utter project	$5.22	Small	4,096
eurollm-9b-instruct	Utter project	$5.22	Small	4,096
flan-t5-xl-3b	Google	$5.22	Small	4,096
flan-t5-xxl-11b	Google	$5.22	Small	4,096
flan-ul2-20b	Google	$5.22	Small	4,096
llama-2-13b-chat	Meta	$5.22	Small	4,096
llama-2-70b-chat	Meta	$20.85	Large	4,096
llama-3-8b-instruct	Meta	$5.22	Small	8,192
llama-3-70b-instruct	Meta	$20.85	Large	8,192
llama-3-1-8b	Meta	$5.22	Small	131,072
llama-3-1-70b	Meta	$20.85	Large	131,072
llama-3-1-8b-instruct	Meta	$5.22	Small	131,072
llama-3-1-70b-instruct	Meta	$20.85	Large	131,072
llama-3-2-11b-vision-instruct	Meta	$5.22	Small	131,072
llama-3-3-70b-instruct	Meta	$10.40	Medium	131,072
llama-3-3-70b-instruct-hf	Meta	$20.85	Large	131,072
mixtral-8x7b-base	Mistral AI	$10.40	Medium	32,768
mixtral-8x7b-instruct-v01	Mistral AI	$10.40	Medium	32,768
mistral-large-instruct-2407	Mistral AI	$55.15 (See note.)	Large	131,072
mistral-large-instruct-2411	Mistral AI	$55.15 (See note.)	Large	131,072
mistral-nemo-instruct-2407	Mistral AI	$5.22	Small	131,072
mt0-xxl-13b	BigScience	$5.22	Small	4,096
poro-34b-chat	LumiOpen	$10.40	Medium	2,048

Note:

There is an hourly access fee associated with hosting the mistral-large-instruct-2411 and mistral-large-instruct-2407 foundation models from Mistral AI for dedicated use. The total price for hosting these deploy on demand foundation models is the sum of the access price plus the hosting price.

Hosting: $20.85 + Access: $34.30 = Total: $55.15 USD per hour

Learn more

IBM foundation models
Third-party foundation models
For more information about the foundation models that IBM provides for embedding and reranking text, see Supported encoder models.
For a list of which models are provided in each regional data center, see Regional availability of foundation models.
For details about foundation model pricing, see Billing details for generative AI assets.
For information about pricing and rate limiting, see watsonx.ai Runtime plans.

Parent topic: Gen AI solutions

Was the topic helpful?

0/1000