Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Communication @ 6
Prioritization @ 6
macOS @ 3
AI @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. The team is small, highly motivated, and focused on engineering excellence. This organization values hands-on contributors, strong prioritization, and clear communication.
As an AI Tutor specialized in multilingual audio capabilities, you will train and refine Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts. Your focus will be on curating and annotating high-quality audio data to improve Grok's global accessibility and natural spoken interactions.
Responsibilities
- Use proprietary software to provide labels, annotations, recordings, and inputs for multilingual audio clips, voice recordings, speech samples, and auditory elements.
- Support delivery of high-quality curated audio data to ensure clear, natural spoken output and accurate representation of linguistic and prosodic details (intonation, rhythm, accent).
- Collaborate with technical staff to develop tasks that improve the AI's handling of speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.
- Work with technical staff to improve annotation tools and efficient audio workflows.
Requirements (Basic Qualifications)
- Native proficiency in French, with exposure to diverse accents, dialects, or regional variations.
- Proficiency in English (minimum B2) with clear, natural vocal delivery and pronunciation suitable for recording.
- Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality.
- Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy and cultural vocal expressions.
- Ability to transcribe audio with high accuracy across accents and varying audio quality.
- Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.
- Strong comprehension, independent judgment on ambiguous audio, and attention to detail.
Preferred Skills and Experience
- Exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work.
- Deep understanding of what constitutes useful audio data and advanced transcription/annotation practices (handling disfluencies, accents, prosodic features such as intonation, stress, rhythm, emotion).
- Background in linguistics (phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or equivalent practical experience analyzing accent variation and multilingual speech patterns.
- Experience with speech/audio datasets, annotation workflows, or AI training data and knowledge of how data quality impacts model performance.
- Professional voice experience (voice acting, recording, podcasting) is a plus. Portfolio (voice samples, annotated transcripts) is strongly preferred for advanced candidates.
Location and Other Expectations
- Tutor roles may be full-time, part-time, or contractor positions depending on needs and fit.
- Contractor hours vary widely; most projects may require at least 10 hours per week on average, but this is not a fixed commitment. Contractors have flexibility to set their hours.
- Roles may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role-specific needs.
- For US-based candidates, xAI is unable to hire in Wyoming and Illinois at this time.
- xAI is unable to provide visa sponsorship.
- Personal device requirements: Chromebook, Mac with macOS 11.0 or later, or Windows 10 or later.
Compensation and Benefits
- US-based candidates: $35/hour - $45/hour depending on experience, skills, education, geographic location, and qualifications. International compensation information provided during the recruitment process.
- Benefits vary by employment type and location. Benefits for eligible U.S. positions may include health insurance, a 401(k) plan, and paid sick leave. Specifics provided during the interview process.