Skip to main content
GET
/
v1
/
sessions
/
{id}
/
transcript
curl "https://api.kallglot.com/v1/sessions/sess_01HXYZ123456789/transcript?format=json" \
  -H "Authorization: Bearer sk_live_your_api_key"
{
  "object": "transcript",
  "session_id": "sess_01HXYZ123456789",
  "status": "complete",
  "language": {
    "source": "de",
    "target": "en"
  },
  "segments": [
    {
      "id": "seg_001",
      "speaker": "agent",
      "language": "en",
      "text": "Hello, thank you for calling. How can I help you today?",
      "start_time": 0.0,
      "end_time": 3.2,
      "confidence": 0.98
    },
    {
      "id": "seg_002",
      "speaker": "customer",
      "language": "de",
      "text": "Guten Tag, ich habe eine Frage zu meiner Bestellung.",
      "translation": "Good day, I have a question about my order.",
      "start_time": 3.5,
      "end_time": 6.8,
      "confidence": 0.96
    },
    {
      "id": "seg_003",
      "speaker": "agent",
      "language": "en",
      "text": "Of course, I'd be happy to help. What's your order number?",
      "start_time": 7.2,
      "end_time": 10.1,
      "confidence": 0.97
    },
    {
      "id": "seg_004",
      "speaker": "customer",
      "language": "de",
      "text": "Die Bestellnummer ist zwölf, drei, vier, fünf.",
      "translation": "The order number is twelve, three, four, five.",
      "start_time": 10.5,
      "end_time": 13.2,
      "confidence": 0.95
    }
  ],
  "duration": 245.3,
  "word_count": 512
}
Retrieve the complete transcript for a session. The transcript includes all spoken text with speaker identification, timestamps, and translations.

Path Parameters

id
string
required
The unique session identifier (e.g., sess_01HXYZ123456789).

Query Parameters

format
string
default:"json"
Output format:
  • json - Structured JSON with segments (default)
  • text - Plain text with speaker labels
  • srt - SRT subtitle format
  • vtt - WebVTT subtitle format
include_translations
boolean
default:true
Whether to include translated text for each segment.
speaker_labels
string
default:"role"
How to label speakers:
  • role - Use agent and customer labels
  • channel - Use channel_0 and channel_1 labels
  • name - Use actual names if available from metadata

Response (JSON format)

object
string
Always transcript.
session_id
string
The session this transcript belongs to.
status
string
Transcript status:
  • processing - Still being generated
  • complete - Fully available
  • failed - Generation failed
language
object
Language information.
segments
array
Array of transcript segments.
duration
number
Total transcript duration in seconds.
word_count
number
Total word count across all segments.
curl "https://api.kallglot.com/v1/sessions/sess_01HXYZ123456789/transcript?format=json" \
  -H "Authorization: Bearer sk_live_your_api_key"
{
  "object": "transcript",
  "session_id": "sess_01HXYZ123456789",
  "status": "complete",
  "language": {
    "source": "de",
    "target": "en"
  },
  "segments": [
    {
      "id": "seg_001",
      "speaker": "agent",
      "language": "en",
      "text": "Hello, thank you for calling. How can I help you today?",
      "start_time": 0.0,
      "end_time": 3.2,
      "confidence": 0.98
    },
    {
      "id": "seg_002",
      "speaker": "customer",
      "language": "de",
      "text": "Guten Tag, ich habe eine Frage zu meiner Bestellung.",
      "translation": "Good day, I have a question about my order.",
      "start_time": 3.5,
      "end_time": 6.8,
      "confidence": 0.96
    },
    {
      "id": "seg_003",
      "speaker": "agent",
      "language": "en",
      "text": "Of course, I'd be happy to help. What's your order number?",
      "start_time": 7.2,
      "end_time": 10.1,
      "confidence": 0.97
    },
    {
      "id": "seg_004",
      "speaker": "customer",
      "language": "de",
      "text": "Die Bestellnummer ist zwölf, drei, vier, fünf.",
      "translation": "The order number is twelve, three, four, five.",
      "start_time": 10.5,
      "end_time": 13.2,
      "confidence": 0.95
    }
  ],
  "duration": 245.3,
  "word_count": 512
}

Notes

Transcripts are typically available within a few seconds after the session ends. For long sessions, processing may take up to a minute.
Word-level timing data is available on Pro and Enterprise plans. Include ?include_words=true to get detailed word timing.