CosmicAC Logo

Create a Managed Inference Job (Parakeet)

Create a speech-to-text Parakeet Managed Inference Job with the CLI.

Create a Parakeet Managed Inference Job from the CLI. Answer the prompts or pass flags, and CosmicAC deploys the speech-to-text model behind an OpenAI-compatible transcription endpoint.

Prerequisites

You need the following before you start:

Steps

Create the job

Create the job interactively by answering prompts, or non-interactively by passing flags.

Start the interactive job setup:

cosmicac jobs create

Select Managed Inference (Parakeet) as the job type.

Set these fields:

  • Job name — a name to identify the job.
  • Tags — comma-separated labels for the job.
  • Location — the region where the job runs.
  • GPU type — the GPU to use. The CLI lists the GPUs available in your location.
  • GPU count — the number of GPUs.
  • Model — the Parakeet model to serve, nvidia/parakeet-tdt-0.6b-v3.
  • Chunk duration — the audio chunk length in seconds.
  • Chunk overlap — the overlap between chunks in seconds.
  • Max file size (MB) — the maximum audio upload size.
  • Endpoint name — a name for the endpoint, used in its URL path.
  • Require Authorization header — whether callers must send an API key. See Create an API key.

The Job configuration reference describes each field and its CLI flag.

Confirm the deployment

List your jobs to confirm CosmicAC created the job:

cosmicac jobs list

The job appears in the table with its ID, name, tags, and status. Wait for it to provision. The endpoint is ready to serve once its status is running.

Next steps

On this page