AI Tutor - Gujarati

at xAI
📍 World
📍 United States
USD 35-45 per hour
MIDDLE
✅ Remote

🕙 10-40 hours per week

Used Tools & Technologies

Not specified

Required Skills & Competences

Communication @ 6 macOS @ 3 AI @ 3

Details

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. The team is small, highly motivated, and focused on engineering excellence. Employees are expected to be hands-on, communicate concisely, and contribute directly to the company mission.

Role overview

As an AI Tutor specialized in multilingual audio capabilities, you will train and refine Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts. The role focuses on curating and annotating high-quality audio data to enhance Grok's global accessibility, enabling natural spoken interactions and improving handling of multilingual audio nuances.

Responsibilities

  • Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.
  • Support delivery of high-quality curated audio data that ensures clear, natural spoken output, and accurate representation of linguistic and prosodic details such as intonation, rhythm, and accent.
  • Collaborate with technical staff to develop tasks that improve the AI's ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.
  • Work with technical staff to improve annotation tools for efficient audio workflows.

Requirements (Basic Qualifications)

  • Native proficiency in Gujarati with exposure to diverse accents, dialects, or regional variations.
  • Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes.
  • Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages.
  • Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.
  • Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.
  • Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.
  • Strong comprehension skills and ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech.
  • Strong communication, interpersonal, analytical, detail-oriented, and organizational skills.
  • Commitment to developing AI that masters sophisticated multilingual audio capabilities.

Preferred Skills and Experience

  • Exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work.
  • Deep understanding of what good/useful audio data is.
  • Advanced transcription and annotation practices experience, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion) with high consistency and accuracy.
  • Background in linguistics (phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or equivalent practical experience.
  • Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models and understanding how data quality impacts model performance.
  • Professional voice work experience (voice acting, voice recording, podcasting) or portfolio of voice samples, annotated transcripts, or audio-related work is strongly preferred.

Location and other expectations

  • Tutor roles may be offered as full-time, part-time, or contractor positions.
  • For contractor positions, hours vary widely; on average, most projects may require at least 10 hours per week to deliver effectively, though this is not a fixed commitment.
  • Roles may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role-specific needs.
  • For US-based candidates, xAI is unable to hire in Wyoming and Illinois at this time.
  • xAI is unable to provide visa sponsorship.
  • For those using a personal device, supported devices: Chromebook, Mac with macOS 11.0 or later, or Windows 10 or later.

Compensation and Benefits

  • US-based candidates: $35/hour - $45/hour depending on experience, skills, education, geographic location, and qualifications.
  • International candidates: Information provided during recruitment.
  • Benefits vary by employment type, location, and jurisdiction. Benefits for eligible U.S.-based positions may include health insurance, 401(k), and paid sick leave.