Speech-to-speech synthesis with Alex Serdiuk, CEO, Respeecher

Kane Simms
March 5, 2022
in Podcast

Speech-to-speech synthesis with Alex Serdiuk, CEO, Respeecher https://vux.world/wp-content/uploads/Respeecher-landscape.jpg 1600 1200 Kane Simms Kane Simms https://secure.gravatar.com/avatar/26839585565b6484d0560f5e365378f0?s=96&d=blank&r=g March 5, 2022 March 5, 2022

Emmy-Award winning Respeecher join us to share the future of synthesised speech.

Supporting Ukraine

The VoiceLunch Foundation is taking donations to help support the voice lunch and voice technology community in Ukraine. VUX World has, of course, donated. I plead with you to donate too.

Donate here

Presented by Deepgram and Symbl.ai

Deepgram is a Speech Company whose goal is to have every voice heard and understood. We have revolutionized speech-to-text (STT) with an End-to-End Deep Learning platform. This AI architectural advantage means you don’t have to compromise on speed, accuracy, scalability, or cost to build the next big idea in voice. Our easy-to-use SDKs and APIs allow developers to quickly test and embed our STT solution into their voice products. For more information, visit:

See how easy it is to add simple but powerful call coaching and call tracking functionality to your customer experience solutions with Symbl.ai’s customizable Conversation Intelligence APIs. From calls to videos to text conversations — apply best in class contextual AI in no time by getting started for free.

AVAILABLE ON ALL PODCAST PLAYERS.

Apple podcasts | Spotify | YouTube | Overcast | CastBox | Spreaker | TuneIn | Breaker | Stitcher | PlayerFM | iHeartRadio

Speech-to-speech

Emmy Award-winning Respeecher is changing the speech synthesis game. Move over TTS and SSML, and enter Speech to Speech.

From voice preservation, to accessibility, to voiceovers to film studios, the uses for speech to speech are endless.

Rather than programming machines to read text out loud, like most speech synthesis systems (text-to-speech), Respeecher uses it deep learning to sonically reproduce voices at film studio levels of fidelity. With Respeecher, a voice actor (or anyone) can simply speak and have their voice transformed into synthesised speech in next to real time. It reproduces all of the character and delivery in the voice, so that the resulting synthesised speech is exactly like the original source. Whether you shout, whisper or sing, the speech to speech technology will replicate everything. It’s truly ground breaking and has to be hear to be believed.

And you can hear it in the intro, as I introduce the episode using four different Respeecher voices.

Respeecher CEO, Alex Serdiuk, joins us to share more.

Timestamps

00:00 Intro and presenting Deepgram and Symbl.ai
04:05 Welcome Alex and closing the Ukraine airspace
08:40 About Respeecher
12:40 Sourcing voices
14:10 Nixon and winning an Emmy
17:37 Process of creating speech to speech
25:34 Limitations of TTS for long for audio
29:00 The future of voice acting
34:25 Voice marketplace
38:00 Pricing of speech to speech voices
42:00 How to achieve higher quality voices
43:30 Accessibility
48:50 Endless use cases
51:00 Ethics
55:54 Outro and more information

Links

Learn more at https://respeecher.com

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
resolution	session	This is a functionality cookie used to collect the horizontal value of the visitor screen resolution. It helps in optimizing the website view to the user.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_111445333_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
ajs_anonymous_id	never	This cookie is set by Segment.io to check the number of ew and returning visitors to the website.
CONSENT	16 years 2 months 25 days 18 hours	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.

Cookie	Duration	Description
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__smSessionId	9 hours	No description available.
__smToken	1 year	This cookie is set by the Sumo. This cookie is used for verifying whether the user is logged in or not.
__smVID	1 month	This cookie is set by Sumo. The purpose of the cookie is not yet known.
_mailmunch_visitor_id	never	This cookie is set by MailMunch which is email collection and email marketing platform. We do not know the exact purpose of the cookie.
AnalyticsSyncHistory	1 month	No description
attribution_user_id	1 year	This cookie is set by the provider Typeform. This cookie is used for Typeform usage statistics. It is used in context with the website's pop-up questionnaires and messengering.
cookielawinfo-checkbox-functional	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
debug	never	No description available.
intercom-id-or0x2acp	8 months 26 days 1 hour	No description
intercom-session-or0x2acp	7 days	No description
li_gc	2 years	No description
li_sugr	3 months	No description available.
mailmunch_second_pageview	never	This cookie is set by MailMunch which is email collection and email marketing platform. We do not know the exact purpose of the cookie.
UserMatchHistory	1 month	Linkedin - Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.