Conversational AI with NIMs on GPU Droplets

This guide provides steps for setting up an end-to-end Conversational AI pipeline using NVIDIA NIMs on DigitalOcean GPU droplets, including client setup instructions.

Prerequisites

Ensure you have the following:

DigitalOcean CLI - doctl
NGC API Key: Generate one from NVIDIA NGC and install the ngc-cli
Anthropic API Key: Generate one from Anthropic

NVIDIA Triton/RIVA Server Setup

Create a GPU Droplet:

Use doctl to spin up a GPU Droplet, replacing <region> and <ssh-key-fingerprint> with appropriate values:

doctl compute droplet create ab-ai-ctk --region <tor1/ams3> --image gpu-h100x1-base --size gpu-h100x1-80gb --ssh-keys <ssh-key-fingerprint>

Run the NIM Services:

## inital-setup
ngc config set
docker login nvcr.io

Username: $oauthtoken
Password: <ngc_api_key>   

# Note: This cache directory is to where models are downloaded inside the container. If this volume is not mounted, the container does a fresh download of the model every time the container starts

mkdir ~/nim-cache
export NIM_CACHE_PATH=~/nim-cache
sudo chmod -R 777 $NIM_CACHE_PATH

## Run the services
cd server
# rename .env.example to .env and add the values in the .env file
mv .env.example .env
# spin up the nim services (asr and tts)
docker-compose --env-file .env up

Running the S2S Client

Run the Speech2Speech Client:

Install the following dependencies on your client machine:

pip3.13 install -r requirements.txt

Use the following command to transcribe audio from your microphone:

python3.13 src/main.py --asr-server <public-ip>:50051 --tts-server <public-ip>:50052 --language-code en-US --input-device 0 --output-device 1 --stream

Replace <public-ip> with the public IP of your GPU Droplet.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
proto		proto
scripts		scripts
server		server
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
perceptra.png		perceptra.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Conversational AI with NIMs on GPU Droplets

Prerequisites

NVIDIA Triton/RIVA Server Setup

Running the S2S Client

About

Releases

Packages

Languages

License

hivenetes/perceptra

Folders and files

Latest commit

History

Repository files navigation

Conversational AI with NIMs on GPU Droplets

Prerequisites

NVIDIA Triton/RIVA Server Setup

Running the S2S Client

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages