Alignments（字幕打轴）

正式环境

POST

https://api.302.ai/v1/audio/alignments

将音频转录为输入语言。
转录API接受您想要转录的音频文件作为输入，以及您希望的音频转录输出文件格式。我们目前支持多种输入和输出文件格式。

价格：0.002 PTC /分钟

请求参数

Header 参数

string

必需

示例值:

application/json

Authorization

string

可选

示例值:

Bearer {{YOUR_API_KEY}}

Body 参数multipart/form-data

file

必需

要转录的音频文件，采用以下格式之一：mp3、mp4、mpeg、mpga、m4a、wav 或 webm。

text

string

音频的转录结果

必需

model

string

必需

要使用的模型的 ID。whisper-v3 , whisper-v3-turbo

示例值:

whisper-v3-turbo

vad_model

string

可选

示例值:

silero

preprocessing

string

可选

none
dynamic
soft_dynamic
bass_dynamic

示例值:

none

response_format

string

可选

回复的格式，采用以下格式之一：srt、verbose_json、vtt

示例值:

verbose_json

alignment_model

string

可选

tdnn_ffn
mms_fa
gentle

示例值:

tdnn_ffn

示例代码

Shell

JavaScript

Java

Swift

PHP

Python

HTTP

Objective-C

Ruby

OCaml

Dart

curl --location --request POST 'https://api.302.ai/v1/audio/alignments' \
--header 'Accept: application/json' \
--header 'Authorization: Bearer ' \
--form 'file=@""' \
--form 'text=""' \
--form 'model="whisper-v3-turbo"' \
--form 'vad_model="silero"' \
--form 'preprocessing="none"' \
--form 'response_format="verbose_json"' \
--form 'alignment_model="tdnn_ffn"'

返回响应

🟢200OK

application/json

Body

text

string

必需

示例

{
  "text": "Imagine the wildest idea that you've ever had, and you're curious about how it might scale to something that's a 100, a 1,000 times bigger. This is a place where you can get to do that."
}

修改于 2024-12-11 17:49:25

Transcriptions（语音转文字）

WhisperX（语音转文字）