StageZero

/

Audio Conversations Segmented and Transcribed

0

Data Summary

This dataset consists of 100 hours of conversational speech segmented by speaker and transcribed for training speech recognition models. It includes various languages such as English, Bulgarian, German, Lithuanian, and Norwegian, with each recording segmented into speech, noise, and music.

Use Cases

  • Speech Recognition
  • Speaker Recognition
  • Dictation
  • Voice Recognition
  • Speaker Diarization