This version is still in development and is not considered stable yet. For the latest snapshot version, please use Spring AI 1.0.0-SNAPSHOT! |
More info can be found here |
Property | Description | Default |
---|---|---|
spring.ai.watsonx.ai.base-url |
The URL to connect to |
|
spring.ai.watsonx.ai.stream-endpoint |
The streaming endpoint |
generation/stream?version=2023-05-29 |
spring.ai.watsonx.ai.text-endpoint |
The text endpoint |
generation/text?version=2023-05-29 |
spring.ai.watsonx.ai.project-id |
The project ID |
- |
spring.ai.watsonx.ai.iam-token |
The IBM Cloud account IAM token |
- |
Property | Description | Default |
---|---|---|
spring.ai.watsonx.ai.chat.enabled |
Enable Watsonx.AI chat model. |
true |
spring.ai.watsonx.ai.chat.options.temperature |
The temperature of the model. Increasing the temperature will make the model answer more creatively. |
0.7 |
spring.ai.watsonx.ai.chat.options.top-p |
Works together with top-k. A higher value (e.g., 0.95) will lead to more diverse text, while a lower value (e.g., 0.2) will generate more focused and conservative text. |
1.0 |
spring.ai.watsonx.ai.chat.options.top-k |
Reduces the probability of generating nonsense. A higher value (e.g. 100) will give more diverse answers, while a lower value (e.g. 10) will be more conservative. |
50 |
spring.ai.watsonx.ai.chat.options.decoding-method |
Decoding is the process that a model uses to choose the tokens in the generated output. |
greedy |
spring.ai.watsonx.ai.chat.options.max-new-tokens |
Sets the limit of tokens that the LLM follow. |
20 |
spring.ai.watsonx.ai.chat.options.min-new-tokens |
Sets how many tokens must the LLM generate. |
0 |
spring.ai.watsonx.ai.chat.options.stop-sequences |
Sets when the LLM should stop. (e.g., ["\n\n\n"]) then when the LLM generates three consecutive line breaks it will terminate. Stop sequences are ignored until after the number of tokens that are specified in the Min tokens parameter are generated. |
- |
spring.ai.watsonx.ai.chat.options.repetition-penalty |
Sets how strongly to penalize repetitions. A higher value (e.g., 1.8) will penalize repetitions more strongly, while a lower value (e.g., 1.1) will be more lenient. |
1.0 |
spring.ai.watsonx.ai.chat.options.random-seed |
Produce repeatable results, set the same random seed value every time. |
randomly generated |
spring.ai.watsonx.ai.chat.options.model |
Model is the identifier of the LLM Model to be used. |
google/flan-ul2 |
In addition to the model specific WatsonxAiChatOptions.java you can use a portable ChatOptions instance, created with the ChatOptionsBuilder#builder(). |
For more information go to watsonx-parameters-info |