Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 β basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 β daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 β you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 β exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Communication @ 6
macOS @ 3
AI @ 3
- 1-2 β basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 β daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 β you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 β exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
Contribute to xAI's mission by training and refining Grok's multilingual audio capabilities. The role focuses on curating, annotating, and recording high-quality audio data to improve voice interactions, speech recognition, and auditory experiences across languages, accents, and cultural contexts.
Responsibilities
- Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.
- Support delivery of high-quality curated audio data to ensure clear, natural spoken output and accurate representation of linguistic and prosodic details (intonation, rhythm, accent) and professional audio standards.
- Collaborate with technical staff to develop tasks that improve AI handling of speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.
- Work with technical staff to improve annotation tools and audio workflows.
Requirements
- Native proficiency in Greek, with exposure to diverse accents, dialects, or regional variations.
- Proficiency in English (minimum B2) with clear, natural vocal delivery and pronunciation suitable for audio recording.
- Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality.
- Ability to handle multilingual audio content, evaluate speech accuracy, cultural vocal expressions, and contextual interpretation.
- Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.
- Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.
- Strong comprehension, independent judgment on ambiguous audio, and strong communication, interpersonal, analytical, detail-oriented, and organizational skills.
- Commitment to developing AI that masters sophisticated multilingual audio capabilities.
Preferred Skills and Experience
- Exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work.
- Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion) with high consistency and accuracy.
- Background in linguistics (phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or equivalent practical experience, with demonstrated ability to analyze accent variation and multilingual speech patterns.
- Experience working with speech/audio datasets, annotation workflows, or AI training data; knowledge/experience with training voice models and understanding how data quality impacts model performance.
- Professional experience in voice work (voice acting, voice recording, podcasting) or audio production demonstrating attention to clarity and recording quality.
- Portfolio (strongly preferred for advanced candidates): voice samples, annotated transcripts, or audio-related work demonstrating quality and methodology.
Location and Other Expectations
- Roles may be full-time, part-time, or contractor positions. Contractors have flexible hours; most projects may require at least 10 hours/week on average, though this is not a fixed commitment.
- Tutor roles may be performed remotely from any location worldwide, subject to legal eligibility and time-zone compatibility.
- For US-based candidates, xAI cannot hire in Wyoming or Illinois.
- xAI is unable to provide visa sponsorship for this role.
- For personal-device workers, supported devices: Chromebook, Mac with macOS 11.0 or later, or Windows 10 or later.
Compensation and Benefits
- US-based candidates: $35/hour - $45/hour depending on experience, skills, education, geographic location, and qualifications. International candidates: information provided during recruitment.
- Benefits vary by employment type and jurisdiction. For eligible U.S.-based positions, benefits may include health insurance, 401(k), and paid sick leave. Specific details provided during the interview process.