Audio

APIs for generating audios.

Upload Audio

This API uploads your audio to vimmerse for future use.

SecurityAPIKeyHeader
Request
Request Body schema: multipart/form-data
audio_file
string <binary> (Audio input file)

Input audio file.

audio_url
string (Audio input URL)
Default: ""

Input audio URL.

Responses
200

New Audio URL

400

Bad Request

402

Insufficient Credit

422

Validation Error

post/audio
Request samples
Response samples
application/json
{
  • "data": {
    }
}

Text 2 Speech

This api generates speech from text prompt.

  • X-Api-Key: Your customer API key.

Request Data

  • prompt: Text prompt of the speech.
  • voice: Voice of the speech. Available voices are "Aria", "Roger", "Sarah", "Laura", "Charlie", "George", "Callum", "River", "Liam", "Charlotte", "Alice", "Matilda", "Will", "Jessica", "Eric", "Chris", "Brian", "Daniel", "Lily" and "Bill". Default is "Rachel".
SecurityAPIKeyHeader
Request
Request Body schema: application/x-www-form-urlencoded
prompt
string (Prompt)
Default: ""

Prompt of the speech.

voice
string (Voice)
Default: "Rachel"

Voice of the speech. Available voices are "Aria", "Roger", "Sarah", "Laura", "Charlie", "George", "Callum", "River", "Liam", "Charlotte", "Alice", "Matilda", "Will", "Jessica", "Eric", "Chris", "Brian", "Daniel", "Lily" and "Bill". Default is "Rachel".

Responses
200

Audio URL

400

Bad Request

402

Insufficient Credit

422

Validation Error

post/audio/text-2-speech
Request samples
Response samples
application/json
{
  • "data": {
    }
}

Text 2 Sound Effect

This API generates sound effects from text prompt.

  • X-Api-Key: Your customer API key.

Request Data

  • prompt: Text prompt of the audio.
  • duration: Duration of the audio. Default value is 5 which means 5 second.
SecurityAPIKeyHeader
Request
Request Body schema: application/x-www-form-urlencoded
prompt
string (Prompt)
Default: ""

Prompt that describes the sound effect.

duration
integer (Duration)
Default: 5

Duration of the audio

Responses
200

Audio URL

400

Bad Request

402

Insufficient Credit

422

Validation Error

post/audio/text-2-sound-effect
Request samples
Response samples
application/json
{
  • "data": {
    }
}

Text 2 Music

This API generates 30-second music from text prompt.

  • X-Api-Key: Your customer API key.

Request Data

  • prompt: Text prompt of the audio.
SecurityAPIKeyHeader
Request
Request Body schema: application/x-www-form-urlencoded
prompt
string (Prompt)
Default: ""

Prompt that describes the music.

Responses
200

Audio URL

400

Bad Request

402

Insufficient Credit

422

Validation Error

post/audio/text-2-music
Request samples
Response samples
application/json
{
  • "data": {
    }
}