Install Alauda Build of KServe
Alauda Build of KServe is a cloud-native component built on KServe for serving generative AI models. As an extension of the Alauda AI ecosystem, it specifically optimizes for Large Language Models (LLMs), offering essential features such as inference orchestration, streaming responses, and resource-based auto-scaling for generative workloads.
TOC
PrerequisitesRequired DependenciesOptional DependenciesInstallation NotesDownloading Cluster PluginUploading the Cluster PluginInstalling Alauda Build of KServeEnvoy Gateway ConfigurationEnvoy AI Gateway ConfigurationKServe Gateway ConfigurationGIE(gateway-api-inference-extension) ConfigurationAlauda AI IntegrationUpgrading Alauda Build of KServePrerequisites
Before installing Alauda Build of KServe, you need to ensure the following dependencies are installed:
Required Dependencies
Alauda build of Envoy Gateway is natively integrated into ACP 4.2. For environments running earlier versions (including ACP 4.0 and 4.1), please contact Customer Support for compatibility and installation guidance.
Optional Dependencies
Installation Notes
- Required Dependencies: All three required dependencies must be installed before installing Alauda Build of KServe.
- GIE Integration: If you want to use GIE, you can enable it during the installation process by selecting the "Integrated GIE" option in the Alauda Build of KServe UI.
- Alauda AI Integration: If you don't need KServe Predictive AI functionality and only want to use LLM Generative AI, you can disable the "Integrated With Alauda AI" option during installation.
Downloading Cluster Plugin
Alauda Build of KServe cluster plugin can be retrieved from Customer Portal.
Please contact Consumer Support for more information.
Uploading the Cluster Plugin
For more information on uploading the cluster plugin, please refer to Uploading Cluster Plugins
Installing Alauda Build of KServe
-
Go to the
Administrator->Marketplace->Cluster Pluginpage, switch to the target cluster, and then deploy theAlauda Build of KServeCluster plugin. -
In the deployment form, configure the following parameters as needed:
Envoy Gateway Configuration
Envoy AI Gateway Configuration
KServe Gateway Configuration
GIE(gateway-api-inference-extension) Configuration
Alauda AI Integration
-
Click Install to begin the installation process.
-
Verify result. You can see the status of "Installed" in the UI.
Upgrading Alauda Build of KServe
- Upload the new version for package of Alauda Build of KServe plugin to ACP.
- Go to the
Administrator->Clusters->Target Cluster->Functional Componentspage, then click theUpgradebutton, and you will see theAlauda Build of KServecan be upgraded.