Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,7 @@ description: "This guide includes steps and guidance for deploying a large langu
authors: ["Akamai"]
contributors: ["Akamai"]
published: 2025-03-25
modified: 2025-04-17
keywords: ['ai','ai inference','ai inferencing','llm','large language model','app platform','lke','linode kubernetes engine','llama 3','kserve','istio','knative']
license: '[CC BY-ND 4.0](https://creativecommons.org/licenses/by-nd/4.0)'
external_resources:
Expand Down Expand Up @@ -66,11 +67,14 @@ If you prefer to manually install an LLM and RAG Pipeline on LKE rather than usi

- Enrollment into the Akamai App Platform's [beta program](https://cloud.linode.com/betas).

- An provisioned and configured LKE cluster with App Platform enabled. We recommend an LKE cluster consisting of at least 3 RTX4000 Ada x1 Medium [GPU](https://techdocs.akamai.com/cloud-computing/docs/gpu-compute-instances) plans.
## Set Up Infrastructure

To learn more about provisioning a LKE cluster with App Platform, see our [Getting Started with App Platform for LKE](https://techdocs.akamai.com/cloud-computing/docs/getting-started-with-akamai-application-platform) guide.
### Provision an LKE Cluster

## Set Up Infrastructure
We recommend provisioning an LKE cluster with [App Platform](https://techdocs.akamai.com/cloud-computing/docs/application-platform) enabled and the following minimum requirements:

- 3 **8GB Dedicated CPUs** with [autoscaling](https://techdocs.akamai.com/cloud-computing/docs/manage-nodes-and-node-pools#autoscale-automatically-resize-node-pools) turned on
- A second node pool consisting of at least 2 **RTX4000 Ada x1 Medium [GPU](https://techdocs.akamai.com/cloud-computing/docs/gpu-compute-instances)** plans

Once your LKE cluster is provisioned and the App Platform web UI is available, complete the following steps to continue setting up your infrastructure.

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,7 @@ description: "This guide expands on a previously built LLM and AI inferencing ar
authors: ["Akamai"]
contributors: ["Akamai"]
published: 2025-03-25
modified: 2025-04-17
keywords: ['ai','ai inference','ai inferencing','llm','large language model','app platform','lke','linode kubernetes engine','rag pipeline','retrieval augmented generation','open webui','kubeflow']
license: '[CC BY-ND 4.0](https://creativecommons.org/licenses/by-nd/4.0)'
external_resources:
Expand Down Expand Up @@ -50,9 +51,13 @@ If you prefer a manual installation rather than one using App Platform for LKE,

## Prerequisites

- Complete the deployment in the [Deploy an LLM for AI Inferencing with App Platform for LKE](/docs/guides/deploy-llm-for-ai-inferencing-on-apl) guide. An LKE cluster consisting of at least 3 RTX4000 Ada x1 Medium [GPU](https://techdocs.akamai.com/cloud-computing/docs/gpu-compute-instances) nodes is recommended for AI inference workloads.
- Complete the deployment in the [Deploy an LLM for AI Inferencing with App Platform for LKE](/docs/guides/deploy-llm-for-ai-inferencing-on-apl) guide. Your LKE cluster should include the following minimum hardware requirements:

- [Python3](https://www.python.org/downloads/) and the [venv](https://docs.python.org/3/library/venv.html) Python module installed on your local machine.
- 3 **8GB Dedicated CPUs** with [autoscaling](https://techdocs.akamai.com/cloud-computing/docs/manage-nodes-and-node-pools#autoscale-automatically-resize-node-pools) turned on

- A second node pool consisting of at least 2 **RTX4000 Ada x1 Medium [GPU](https://techdocs.akamai.com/cloud-computing/docs/gpu-compute-instances)** plans

- [Python3](https://www.python.org/downloads/) and the [venv](https://docs.python.org/3/library/venv.html) Python module installed on your local machine

## Set Up Infrastructure

Expand Down
4 changes: 0 additions & 4 deletions docs/marketplace-docs/guides/_index.md
Original file line number Diff line number Diff line change
Expand Up @@ -106,17 +106,13 @@ See the [Marketplace](/docs/marketplace/) listing page for a full list of all Ma
- [Rocket.Chat](/docs/marketplace-docs/guides/rocketchat/)
- [Ruby on Rails](/docs/marketplace-docs/guides/ruby-on-rails/)
- [Saltcorn](/docs/marketplace-docs/guides/saltcorn/)
- [SeaTable](/docs/marketplace-docs/guides/seatable/)
- [Secure Your Server](/docs/marketplace-docs/guides/secure-your-server/)
- [Shadowsocks](/docs/marketplace-docs/guides/shadowsocks/)
- [Splunk](/docs/marketplace-docs/guides/splunk/)
- [Superinsight](/docs/marketplace-docs/guides/superinsight/)
- [Uptime Kuma](/docs/marketplace-docs/guides/uptime-kuma/)
- [UTunnel VPN](/docs/marketplace-docs/guides/utunnel/)
- [Valkey](/docs/marketplace-docs/guides/valkey/)
- [VictoriaMetrics Single](/docs/marketplace-docs/guides/victoriametrics-single/)
- [VS Code](/docs/marketplace-docs/guides/vscode/)
- [WarpSpeed VPN](/docs/marketplace-docs/guides/warpspeed/)
- [Wazuh](/docs/marketplace-docs/guides/wazuh/)
- [WireGuard](/docs/marketplace-docs/guides/wireguard/)
- [WooCommerce](/docs/marketplace-docs/guides/woocommerce/)
Expand Down
3 changes: 3 additions & 0 deletions docs/marketplace-docs/guides/seatable/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,6 +11,9 @@ license: '[CC BY-ND 4.0](https://creativecommons.org/licenses/by-nd/4.0)'
marketplace_app_id: 1177225
marketplace_app_name: "SeaTable"
---
{{< note type="warning" title="This app is no longer available for deployment" >}}
SeaTable has been removed from the App Marketplace and can no longer be deployed. This guide is retained for reference only.
{{< /note >}}

[SeaTable](https://seatable.io/) is a simple and flexible database management interface with native Python automation support. It is designed to mimic the user-friendly interfaces of common spreadsheet software (like Microsoft Excel and Google Sheets). SeaTable offers advanced data linking capabilities and allows for custom data organization and visualization.

Expand Down
3 changes: 3 additions & 0 deletions docs/marketplace-docs/guides/utunnel/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,6 +15,9 @@ license: '[CC BY-ND 4.0](https://creativecommons.org/licenses/by-nd/4.0)'
marketplace_app_id: 925530
marketplace_app_name: "UTunnel VPN"
---
{{< note type="warning" title="This app is no longer available for deployment" >}}
UTunnel VPN has been removed from the App Marketplace and can no longer be deployed. This guide is retained for reference only.
{{< /note >}}

[UTunnel VPN](https://www.utunnel.io/) lets you set up your own private VPN server quickly and easily; no technical expertise is required. It is well suited for small and medium businesses to set up easy and secure remote access for their employees, or for anyone who wants to keep their data private using their own VPN. UTunnel VPN supports multiple VPN protocols and comes with a server management console, secure 256-bit encryption, easy team management, single sign-on, 2-factor authentication, and an inbuilt firewall.

Expand Down
3 changes: 3 additions & 0 deletions docs/marketplace-docs/guides/victoriametrics-single/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,9 @@ license: '[CC BY-ND 4.0](https://creativecommons.org/licenses/by-nd/4.0)'
marketplace_app_id: 954759
marketplace_app_name: "VictoriaMetrics"
---
{{< note type="warning" title="This app is no longer available for deployment" >}}
VictoriaMetrics has been removed from the App Marketplace and can no longer be deployed. This guide is retained for reference only.
{{< /note >}}

[VictoriaMetrics](https://victoriametrics.com/) is a free [open source time series database](https://en.wikipedia.org/wiki/Time_series_database) (TSDB) and monitoring solution that is designed to collect, store, and process real-time metrics. It supports the [Prometheus](https://en.wikipedia.org/wiki/Prometheus_(software)) pull model and various push protocols ([Graphite](https://en.wikipedia.org/wiki/Graphite_(software)), [InfluxDB](https://en.wikipedia.org/wiki/InfluxDB), OpenTSDB) for data ingestion. It is optimized for storage with high-latency IO, low IOPS, and time series with [high churn rate](https://docs.victoriametrics.com/FAQ.html#what-is-high-churn-rate). For reading the data and evaluating alerting rules, VictoriaMetrics supports the PromQL, [MetricsQL](https://docs.victoriametrics.com/MetricsQL.html), and Graphite query languages.

Expand Down
4 changes: 4 additions & 0 deletions docs/marketplace-docs/guides/warpspeed/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,10 @@ marketplace_app_id: 923037
marketplace_app_name: "WarpSpeed"
---

{{< note type="warning" title="This app is no longer available for deployment" >}}
WarpSpeed has been removed from the App Marketplace and can no longer be deployed. This guide is retained for reference only.
{{< /note >}}

WarpSpeed makes it easy for developers to access cloud infrastructure via the powerful WireGuard® VPN protocol. It can also be used to enable remote workers to access the internet securely while on public WiFi.

## Deploying a Marketplace App
Expand Down
Loading