Used Tools & Technologies
Not specified
Required Skills & Competences
Tag name is followed by "@" symbol and proficiency level value.
About proficiency levels:
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Communication @ 6
macOS @ 3
AI @ 3
- 1-2 — basic awareness. Minimal hands-on experience, and a rudimentary understanding of the technology's purpose;
- 3-6 — daily use. Comfortable and regular usage, capable of handling common tasks and challenges related to the technology;
- 7-9 — you are an expert, you can teach others, you know all the pitfalls and tricks;
- 10 — exceptional knowledge, comprehensive understanding, and adeptness in all aspects of the technology, including advanced problem-solving. Think twice before claiming or demanding such level.
Details
As an AI Tutor specialized in multilingual audio capabilities, you will train and refine Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts. Work focuses on curating and annotating high-quality audio data to enhance Grok's global accessibility, enabling natural spoken interactions and improving the AI's handling of multilingual audio nuances.
Responsibilities
- Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.
- Support delivery of high-quality curated audio data that ensures clear, natural spoken output and accurate representation of linguistic and prosodic details (intonation, rhythm, accent) and professional audio standards.
- Collaborate with technical staff to develop tasks that improve the AI's ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.
- Work with technical staff to improve annotation tools and audio workflows.
Requirements
- Native proficiency in Norwegian with exposure to diverse accents, dialects, or regional variations.
- Proficiency in English (minimum B2) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes.
- Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages.
- Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.
- Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.
- Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.
- Strong comprehension, independent judgment on ambiguous audio, communication, interpersonal, analytical, detail-oriented, and organizational skills.
- For device compatibility: Chromebook, Mac with macOS 11.0 or later, or Windows 10 or later.
Preferred Skills and Experience
- Exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work.
- Deep understanding of what constitutes good/useful audio data and advanced transcription/annotation practices (handling disfluencies, accents, prosodic features) with high consistency and accuracy.
- Background in linguistics (phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or equivalent practical experience, with ability to analyze accent variation and multilingual speech patterns.
- Experience with speech/audio datasets, annotation workflows, or AI training data; knowledge/experience with training voice models and understanding how data quality impacts model performance.
- Professional experience in voice work (voice acting, voice recording, podcasting) or a portfolio demonstrating voice samples, annotated transcripts, or audio-related work is strongly preferred.
Location and Other Expectations
- Tutor roles may be offered as full-time, part-time, or contractor positions. Contractor hours vary widely based on project scope and availability; on average, most projects may require at least 10 hours per week (not a fixed commitment).
- Roles may be performed remotely from any location worldwide, subject to legal eligibility and time-zone compatibility.
- For US-based candidates, xAI cannot hire in Wyoming and Illinois.
- xAI is unable to provide visa sponsorship.
Compensation and Benefits
- US-based candidates: $35/hour - $45/hour depending on experience, skills, education, geographic location, and qualifications. International compensation details provided during recruitment.
- Benefits vary by employment type and location. For eligible U.S.-based positions, benefits may include health insurance, 401(k), and paid sick leave. Specific details provided during the interview process.
xAI is an equal opportunity employer. For details on data processing, view the Recruitment Privacy Notice referenced in the original posting.