AI Tutor - Urdu

at xAI
📍 World
USD 35-45 per hour
MIDDLE
✅ Remote

🕙 10-40 hours per week

Used Tools & Technologies

Not specified

Required Skills & Competences

Communication @ 6 macOS @ 3 AI @ 3

Details

Contribute to xAI's work on multilingual audio capabilities by curating, annotating, and recording high-quality audio data to improve Grok's voice interactions, speech recognition, and auditory experiences across languages, accents, and cultural contexts.

Responsibilities

  • Use proprietary software to provide labels, annotations, recordings, and inputs for projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.
  • Support delivery of high-quality curated audio data that ensures clear, natural spoken output and accurate representation of linguistic and prosodic details (intonation, rhythm, accent) to professional audio standards.
  • Collaborate with technical staff to develop tasks that improve the AI's ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.
  • Work with technical staff to improve annotation tools and efficient audio workflows.

Requirements

  • Native proficiency in Urdu, with exposure to diverse accents, dialects, or regional variations.
  • Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording.
  • Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages.
  • Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.
  • Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.
  • Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.
  • Strong comprehension, independent judgment on ambiguous or varied audio material (including noisy or accented speech), and strong communication, interpersonal, analytical, detail-oriented, and organizational skills.
  • Ability to use a personal device that is a Chromebook, a Mac with macOS 11.0 or later, or Windows 10 or later.
  • Commitment to developing AI that masters sophisticated multilingual audio capabilities.

Preferred Skills and Experience

  • Exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work.
  • Deep understanding and taste for what constitutes good/useful audio data.
  • Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion) with high consistency and accuracy.
  • Background in linguistics (phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or equivalent practical experience analyzing accent variation and multilingual speech patterns.
  • Experience working with speech/audio datasets, annotation workflows, or AI training data, including familiarity with training voice models and how data quality impacts model performance.
  • Professional experience in voice work (voice acting, voice recording, podcasting) demonstrating attention to clarity and recording quality.
  • Portfolio (strongly preferred for advanced candidates): voice samples, annotated transcripts, or audio-related work demonstrating quality, methodology, and attention to detail.

Location and Other Expectations

  • Tutor roles may be offered as full-time, part-time, or contractor positions depending on role needs and candidate fit.
  • For contractor positions, hours vary widely with no fixed commitments; on average most projects may require at least 10 hours per week, though this is not a fixed commitment and depends on scope and availability. Contractors have flexibility to set their own hours.
  • Roles may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role-specific needs. (Note: for US-based candidates, xAI cannot hire in Wyoming or Illinois.)
  • xAI is unable to provide visa sponsorship.

Compensation and Benefits

  • US-based candidates: $35/hour - $45/hour depending on experience, skills, education, geographic location, and qualifications. International candidate compensation details provided during recruitment.
  • Benefits vary by employment type, location, and jurisdiction. Benefits for eligible U.S.-based positions may include health insurance, a 401(k) plan, and paid sick leave. Specifics provided during the interview process.