You are viewing a preview of this job. Log in or register to view more details about this job.

Freelance Opportunity: English Language Transcription Specialist

Uber AI Solutions is seeking detail-oriented specialists for Audio Transcription to support a large-scale generative AI training program. In this engagement, you will perform high-precision conversion of spoken audio into text, capturing every linguistic nuance, dialectal variation, and "filler" word exactly as spoken.

Supported Languages & Dialects

We are looking for native-speaking freelancers for the following varieties, utilizing local terminology for the role:

  • English (en-US)

Responsibilities

  • Listen to English audio recordings and write down what is said as clearly and accurately as possible.
  • Capture natural speech, including pauses, repeated words, filler words, or corrections when needed.
  • Match the written text to the correct part of the audio so it is easy to review.
  • Label different speakers when more than one person is talking.
  • Add simple notes for important non-speech sounds, such as laughter, applause, background noise, or unclear audio.

Skills and Qualifications

  • Strong English fluency, with comfort understanding different accents, speaking styles, and everyday expressions.
  • Good listening skills and attention to detail.
  • Ability to follow transcription guidelines and decide when speech should be written exactly as heard.
  • Comfortable using online tools and learning new task platforms.

Engagement Details

  • Location: Remote
  • Flexibility: Work on your own schedule, provided quality, consistency, and deadline standards are met.
  • Type: Freelance/Independent Contractor

Why this matters 

Your work provides the critical ground truth data needed to train advanced speech recognition models. By capturing the exact nuances of natural conversation—including stutters, fillers, and accents—you enable AI to understand and process human speech with human-level precision. Through high-fidelity annotation, you are directly improving how technology listens to and interacts with the world.