You are viewing a preview of this job. Log in or register to view more details about this job.

Freelance Opportunity Transcription Specialist Remote

Transcription Specialist

Uber AI Solutions is seeking detail-oriented transcription specialists to support a large-scale generative AI training program. In this engagement, you will transcribe and annotate audio files (Single & Multitrack) with accuracy, capturing utterance, stutter, and linguistic nuance exactly as spoken.

Supported Languages & Dialects

We are looking for freelancers in the following languages: 

  • Arabic: (ar-001 | ar-MSA), (ar-SA), (ar-AE | ar-UAE)
  • Bengali: (bn-BD | bn-IN)
  • Catalan: (ca-ES)
  • Chinese: (zh-CN | zh-Hans), (zh-Hant), (zh-HK), (zh-TW)
  • Croatian: (hr-HR)
  • Czech: (cs-CZ)
  • Danish: (da-DK)
  • Dutch: (nl-NL)
  • English: (en-US), (en-GB)
  • Estonian: (et-EE)
  • Finnish: (fi-FI)
  • French: (fr-FR), (fr-CA)
  • German: (de-DE), (de-CH)
  • Greek: (el-GR)
  • Hebrew: (he-IL)
  • Hindi: (hi-IN)
  • Hungarian: (hu-HU)
  • Indonesian: (id-ID)
  • Italian: (it-IT)
  • Japanese: (ja-JP)
  • Kannada: (kn-IN)
  • Korean: (ko-KR)
  • Lithuanian: (lt-LT)
  • Maithili: (mai-IN)
  • Malay: (ms-MY)
  • Malayalam: (ml-IN)
  • Norwegian: (no-NO)
  • Polish: (pl-PL)
  • Portuguese: (pt-PT), (pt-BR)
  • Romanian: (ro-RO)
  • Russian: (ru-RU)
  • Sinhala: (si-LK)
  • Slovak: (sk-SK)
  • Spanish: (es-ES), (es-US), (es-419 | es-LATAM), (es-MX)
  • Swedish: (sv-SE)
  • Tagalog/Filipino: (tl-PH)
  • Tamil: (ta-IN)
  • Telugu: (te-IN)
  • Thai: (th-TH)
  • Turkish: (tr-TR)
  • Ukrainian: (uk-UA)
  • Urdu: (ur-PK)
  • Vietnamese: (vi-VN)

What you’ll work on

  • Transcription: Transcribe audio with 98% accuracy, capturing every disfluency, filler word (um, uh), false start, and stutter exactly as heard.
  • Precision Timestamping: Align text segments to the audio waveform with millisecond precision (max gap <500ms).
  • Speaker Identification: Accurately identify and label speakers in multi-speaker audio files (2–8 interlocutors).
  • Tagging and Annotation: Apply correct tags for non-speech events—like (laughs) or (applause)—and unintelligible segments.

Skills and Qualifications

  • Native-level fluency: You must be a native speaker of the assigned language with a deep understanding of cultural nuances and regional accents.
  • Attention to Detail: You can distinguish between "clean" speech and "verbatim" speech (e.g., typing "I- I- I don't know" instead of "I don't know").
  • Tech Savvy: You are comfortable learning and navigating new web-based annotation tools.

Engagement Details

  • Location: Remote (Global)
  • Volume: Steady task flow available for high-quality contributors. (Note: Additional details around the project will be provided as they become available.).
  • Flexibility: Work on your own schedule, provided quality, consistency, and deadline standards are met.
  • Type: Freelance/Independent Contractor

Why this matters 

Your expertise will guide how AI systems handle complex logic and human-centered communication. By transcribing and refining audio and text and responses, you’ll help ensure that AI is not only accurate but also clear, safe, and engaging for professional use.