![]() |
| Getting Started with Nvidia Garak |
What Is Nvidia Garak?
Nvidia Garak (Generative AI Red-teaming and Assessment Kit) is an open-source LLM vulnerability scanner built by NVIDIA's AI Red Team.
![]() |
| Getting Started with Nvidia Garak |
What Is Nvidia Garak?
Nvidia Garak (Generative AI Red-teaming and Assessment Kit) is an open-source LLM vulnerability scanner built by NVIDIA's AI Red Team.
![]() |
| Adding GLIGuard to LiteLLM AI Gateway |
Guardrails for a Large language model (LLM) are rule based safety controls that validate the input and output of a model. They basically act like a gatekeeper between a user and a Large language model.
GLiGuard is an open-source, ultra-fast and very light weight AI guardrail that has only 300 million parameters. It is available on HuggingFace and can be easily integrated on any AI Gateway like LiteLLM.
![]() |
| Getting started with Nvidia AIPerf |
The new Nvidia AIPerf tool is an excellent free tool for LLM Performance testing. You can customise it as per your needs and is a massive upgrade to other tools especially if you use Nvidia GPUs.
|
| Reduce CPU spikes - AI Summarization |
Summarization aims to compress a lengthy source document into a concise format while retaining its core components and key ideas.
However, when you are hosting your own LLM, handling CPU spikes (in the absence of a GPU) can be your biggest concern.
![]() |
| GPU workloads on k3s |
K3s is a highly available, certified Kubernetes distribution designed for production workloads. It can also be used for AI workloads.
By default, k3s nodes do not recognize GPUs. In this article, we will enable k3s to work with a GPU.
![]() |
| EmbeddingGemma on NVIDIA Triton Server |
|
| AI Crawl Control From Cloudflare |
Cloudflare has recently announced a new feature called AI Crawl Control .