Speaker Diarization APIs

Remove the color from an imageaSegment Audio per speaker

Free during the Beta

cowboy-removebg-preview.png

Speaker Diarization API

Segment Audio per speaker

Gladia.io's Speaker Diarization API makes it easy to add speech detection and speaker segmentation in your own application. By using the best algorithms and models developed by our research team, you can easily auto-detect speaker identity and the number of speakers and even whether a person is talking or not in a given audio stream!

test it live below (free Signup needded):

Here is an example of a transcription:


Results

{
   prediction: {
      labels: [
         "SPEAKER_00"
      ],
      segments: [
         {
            start: 0.4978125,
            end: 8.462812500000002,
            label: "SPEAKER_00"
         }
      ]
   },
   prediction_raw: {
      labels: [
         "SPEAKER_00"
      ],
      segments: [
         {
            start: 0.4978125,
            end: 8.462812500000002,
            label: "SPEAKER_00"
         }
      ]
   }
}

Test is live with you own audio below

We use cookies and analytics services to provide you with a better browsing experience, personalized content and ads. They also help us monitor the traffic on our website and the impact of our marketing campaigns.

Cookie Settings

We use cookies to improve user experience. Choose what cookie categories you allow us to use. You can read more about our Cookie Policy by clicking on Cookie Policy below.

These cookies enable strictly necessary cookies for security, language support and verification of identity. These cookies can’t be disabled.

These cookies collect data to remember choices users make to improve and give a better user experience. Disabling can cause some parts of the site to not work properly.

These cookies help us to understand how visitors interact with our website, help us measure and analyze traffic to improve our service.

These cookies help us to better deliver marketing content and customized ads.