Create a Managed Inference Job (Parakeet)
Create a speech-to-text Parakeet Managed Inference Job with the CLI.
Create a Parakeet Managed Inference Job from the CLI. Answer the prompts or pass flags, and CosmicAC deploys the speech-to-text model behind an OpenAI-compatible transcription endpoint.
Prerequisites
You need the following before you start:
- A running CosmicAC deployment. See Installation.
- The CosmicAC CLI installed and configured. See Install the CLI.
Steps
Create the job
Create the job interactively by answering prompts, or non-interactively by passing flags.
Start the interactive job setup:
cosmicac jobs createSelect Managed Inference (Parakeet) as the job type.
Set these fields:
- Job name — a name to identify the job.
- Tags — comma-separated labels for the job.
- Location — the region where the job runs.
- GPU type — the GPU to use. The CLI lists the GPUs available in your location.
- GPU count — the number of GPUs.
- Model — the Parakeet model to serve,
nvidia/parakeet-tdt-0.6b-v3. - Chunk duration — the audio chunk length in seconds.
- Chunk overlap — the overlap between chunks in seconds.
- Max file size (MB) — the maximum audio upload size.
- Endpoint name — a name for the endpoint, used in its URL path.
- Require Authorization header — whether callers must send an API key. See Create an API key.
The Job configuration reference describes each field and its CLI flag.
Confirm the deployment
List your jobs to confirm CosmicAC created the job:
cosmicac jobs listThe job appears in the table with its ID, name, tags, and status. Wait for it to provision. The endpoint is ready to serve once its status is running.