Skip to main content
Version: v1.2

Get Audio Transcription

Use this API to retrieve the transcription result specifically from the audio track of a video you have uploaded.

uploadupload

Prerequisites

  • You’re familiar with the concepts described on the Platform overview page.
  • You have uploaded a video via the Upload API and obtained its videoNo.
  • You have a valid memories.ai API key.
  • The video must include an audio track.

Host URL

  • https://api.memories.ai

Endpoint

GET /serve/api/v1/get_audio_transcription


Request Example

import requests

headers = {"Authorization": "<API_KEY>"}

params = {"video_no": "<VIDEO_ID>", "unique_id": "<UNIQUE_ID>"}

response = requests.get("https://api.memories.ai/serve/api/v1/get_audio_transcription", headers=headers, params=params)

print("Status:", response.status_code)
try:
print("Video Transcription Response:", response.json())
except Exception:
print("Response Text:", response.text)

Request Parameters

NameLocationTypeRequiredDescription
video_noquerystringYesUnique video number returned after upload
AuthorizationheaderstringYesYour API key for authentication
unique_idbodystringNodefault by default

Response Example

Status code 200

{
"code": "0000",
"msg": "success",
"data": {
"videoNo": "VI605961375402668032",
"transcriptions": [
{
"index": 0,
"content": " I'm going to get my emergency and report.",
"startTime": "0",
"endTime": "7"
},
{
"index": 1,
"content": " Mommy, you're going to go.",
"startTime": "7",
"endTime": "12"
},
{
"index": 2,
"content": " What's going on up there?",
"startTime": "12",
"endTime": "15"
},
{
"index": 3,
"content": " Yes.",
"startTime": "15",
"endTime": "20"
},
{
"index": 4,
"content": " Mommy, you're going to go.",
"startTime": "20",
"endTime": "30"
},
{
"index": 5,
"content": " Mommy, you're going to go.",
"startTime": "30",
"endTime": "35"
}
]
},
"success": true,
"failed": false
}

Response Structure

NameTypeDescription
codestringResponse status code
msgstringHuman-readable status message
dataobjectResponse data container
» videoNostringThe unique video ID
» transcriptionlist of objectsList of transcription segments from the audio track
»» startfloatSegment start time in seconds
»» endfloatSegment end time in seconds
»» textstringTranscribed speech content for that time segment

Notes

  • This endpoint returns only the transcription derived from the audio track.
  • Ensure the video has finished processing (status: PARSE) before calling this API.