888 Royalty-Free Audio Tracks for "Speech Synthesis"

00:00
02:27
w:Jingle Bells, Or The One Horse Open Sleigh. Sung by the w:Festival Speech Synthesis System's "singing-mode", using the default US voice . Piano accompaniment by w:TiMidity/freepats. Source: [1]
Author: Robotchoir
00:00
02:55
La Bandera de las Estrellas (The Star Spangled Banner), sung by the Festival speech synthesis system.
Author: Dwiakigle
00:00
04:14
God Save The Queen - Sung by the Festival Speech Synthesis System's "singing-mode", using the default US voice.
Author: Robotchoir at English Wikipedia
00:00
00:48
I made this sound sample in Fruity Loops 7 using the Fruity Granulizer to show off some of its capabilities as well as demonstrate Granular synthesis technology.
Author: Originally uploaded by Fontenot 1031 (Transferred by clusternote)
00:00
00:09
This file has no description, and may be lacking other information. Please provide a meaningful description of this file.
Author: Original uploader was Paulnasca at en.wikibooks
00:00
00:02
Work in progress speech synthesis speech, all drawn in photosounder. I've drawn the basic vocalisation by drawing horizontal lines in the 100 hz range using the white spray with the harmonics modifier, and modulated that using the dark spray to create the formants. I also added a couple of fricatives with the white spray, but it's incomplete.
Author: Photosounder
00:00
00:11
Output waveform from the assessed lab for speech processing (lasc11065). The wave was created using the festival speech synthesis system version 1. 96 beta (july 2004), with one of their own available voices (male, scottish english).
Author: Studentb
00:00
00:30
Daisy Bell sung by the DECtalk speech synthesizer. Using v4.61.02 for Windows. In G major at 200BPM, roughly. The input code is as follows: [:phoneme on] [d<40,27> ey<860> z<40,24> iy<860> d<40,20> ey<860> z<40,15> iy<860>] [g<40,17> ih<220> v<40> m<40,19> iy<260> yx<40,20> or<260> ae<300,17> en<300> s<40,20> rr<260> d<40,15> uw<860> _<900>] [ay<860,22> m<40> hx<40,27> ae<810,27> f<30> k<30,24> r<30> ey<860> z<40,20> iy<860>] [ao<200,17> el<100> f<40,19> or<260> dh<40,20> ax<260> l<40,22> uh<560> v<40,24> ax<260> v<40,22> yu<860> _<600,24>] [ih<260,24> tx<40> w<40,25> ow<180> n<40> t<40> b<40,24> iy<260> ax<260,22> s<40> t<40,27> ay<560> l<40,24> ih<220> sh<40> m<40,22> ae<60> ae<100,24> ae<100,22> r<40,20> ih<560> jh<40> _<560,22>] [ay<300,22> k<40,24> ae<520> en<40> t<40,20> ax<260> f<40,17> or<560> d<40,20> ax<260> k<40,17> ae<260> r<40,15> ih<560> jh<40> _<560>] [b<40,15> uh<220> tx<40> yu<560,20> d<40> l<40,24> uh<210> k<30> s<30,22> w<30> iy<260> t<40> _<560,20>] [aa<560,20> en<40> dh<40,24> ax<260> s<40,22> iy<260> t<40> _<260,22>] [ah<150,24> v<40,25> ax<110> b<40,27> ay<260> s<40,24> ih<260> k<40,20> el<260> b<40,22> ih<260> el<260> tx<40> f<40,15> or<250> t<10> t<40> uw<860,20> _<900>]
Author: JapanYoshi
00:00
00:02
Spanish language speech synthesis.
Author: Eerdman
00:00
00:04
Male speech synthesizer announcing train arrival on terminal "a". Dry recording with no effects.
Author: Deleted User
00:00
00:02
File recorded in goldwave, buzzer portion generated by formula, don't even ask me. . . Speech synthesizer used was innoetics john. File might be useful as part of a project or in a game, or audio production. Light flanger applied to speech. Slight reverb applied to entire project.
Author: Ironcross
00:00
08:42
President Roosevelt's Pearl Harbor Day message to joint session of Congress asking for a declaration of war with Japan. "The Star-Spangled Banner" is played on this recording after the speech. NARA claims the entire speech to be "Unrestricted"
Author: Recording: Bradley, John G. (John Grover), 1886-1974 (NARA record) Derivative work: Uploaded to Wikimedia Commons by W. Guy Finley.
00:00
00:09
Synthesis resulting from the transformation process performed with the harmonic plus stochastic model implemented in the sms-tools software package (http://github. Com/mtg/sms-tools) on the male speech sound http://freesound. Org/people/islijepcevic/sounds/329042/. The parameters used in the analysis are:window="hamming", m=1501, n=4096, t=-80, minsinedur=0. 1, nh=200, minf0=80, maxf0=200, f0et=7, hamrdevslope=0. 01, stocf=0. 4freqscaling = [0,1. 3,0. 82,1. 3,0. 825,1. 158,0. 99,1. 158,1. 0,1. 38,1. 1,1. 38,1. 12,1. 7,1. 2,1. 7,1. 21,1. 4592,1. 278,1. 73,1. 28,1. 5,1. 45,1. 8,1. 7,1. 8]freqstretching=[0,1,1,1]timescaling=[0,0,0. 6,0. 001,0. 67,0. 071,0. 83,2. 071,1. 0,2. 4043,1. 12,2. 7377,1. 2,4. 7377,1. 28,5. 4043,1. 45,8. 071,1. 7,8. 2]timbrepreservation=1.
Author: Islijepcevic
00:00
00:19
Speech enhancement clean signal for group 12.
Author: Dpsa
00:00
00:07
Before delivering a speech, you make sure to get attention. . . With tipping against your glass!.
Author: Trolln
00:00
00:40
This is for a class? a speech distorted through many filters. Edited in adobe audition and given to us by our teacher initially.
Author: Summermon
00:00
00:14
Processed german, talking about bandits.
Author: Mikesh
00:00
00:15
Robot speech synthesis you can use for for voice acting.
Author: Metkir
00:00
00:04
Frequency scaled output sound obtained by analysis and synthesis of sound speech-female_hpsmodel. Wav (https://www. Freesound. Org/people/anjds/sounds/377067/) with transformations model implemented in the sms-tools software package (http://github. Com/mtg/sms-tools)parameters considered are:-ifrequency scaling factors= [0,1,1,0. 6],ii) frequency stretching factors = [0,1,1,1],iii) timbre preservation = 1, iv) time scaling factors = [0, 0 ,1 ,1].
Author: Anjds
00:00
00:04
Synthesis output sound obtained by analysis and synthesis of sound speech-female. Wav (http://freesound. Org/people/xserra/sounds/317745/) with harmonic plus stochastic model implemented in the sms-tools software package (http://github. Com/mtg/sms-tools)parameters used in analysis are :- window type = blackman, window size(m) = 2019, fft size(n) = 2048, magnitude threshold in db(t) = -100, minimum duration of harmonic tracks = 0. 1, maximum number of harmonics = 100, minimum fundamental frequency = 80, maximum fundamental frequency = 300, maximum error in f0 detection algorithm = 5, max frequency deviation in harmonic tracks = 0. 01,stochastic approximation factor = 0. 7.
Author: Anjds
00:00
00:04
Residual output sound obtained by analysis and synthesis of sound speech-female. Wav (http://freesound. Org/people/xserra/sounds/317745/) with harmonic plus residual model implemented in the sms-tools software package (http://github. Com/mtg/sms-tools)parameters used in analysis are :- window type = blackman, window size(m) = 2019, fft size(n) = 2048, magnitude threshold in db(t) = -100, minimum duration of harmonic tracks = 0. 1, maximum number of harmonics = 100, minimum fundamental frequency = 80, maximum fundamental frequency = 300, maximum error in f0 detection algorithm = 5, max frequency deviation in harmonic tracks = 0. 01.
Author: Anjds
00:00
00:03
Synthesis of a kind of piano, made by a karplus-strong loop.
Author: Deegnot
00:00
00:11
Synthesis of a kind of piano, made by a karplus-strong loop.
Author: Deegnot
00:00
00:02
Synthesis of a kind of piano, made by a karplus-strong loop.
Author: Deegnot
00:00
00:15
Sound effect sample synthesised into audacity using lame plugins.
Author: Rosendojavier
00:00
13:33
Short dark ramble about the hardship of being a man.
Author: Cheesepuff
00:00
00:17
Describing how things are linked on the internet.
Author: Acquirk
00:00
01:00
First and last verses of edgar allen poe's "the raven" read to a click-track of 100 bpm. Recorded with a sennheiser mkh 416 into pro tools.
Author: Pinehadmz
00:00
01:51
Japanese conversation.
Author: Macdaddyno
00:00
00:32
Assignment 1 of "miracles of human language", a coursera course.
Author: Chengiz
00:00
01:10
Assignment 2 of "miracles of human language", a coursera course. This has three pretend conversations between friends, employee-boss, and neighbors. It illustrates politeness aspects of the language.
Author: Chengiz
00:00
00:02
Female voice saying "hi jim".
Author: Boater
00:00
00:05
A female voice saying testing.
Author: Crecord
00:00
00:04
Me coughing through a low-pass distorted filter.
Author: Carbilicon
00:00
54:32
Ghanasyam speaking on low self-esteem on january 14, 2014 at the bhakti center, new york.
Author: Dasosmi
00:00
00:03
This file is slower version of "when did you reach here".
Author: Pavan Cm
00:00
00:02
This is faster version of "when did you reach here?".
Author: Dpsa
00:00
00:03
Female voice saying "speech signal processing".
Author: Dpsa
00:00
00:01
/ai/ diphthong.
Author: Dpsa
00:00
00:06
Posted in reply to someone who wanted some dialogue recorded. Rode nt1, mackie mixer. 44. 1, 16 bit stereo.
Author: Improviz
00:00
00:14
Voz robot.
Author: Mialena
00:00
00:11
Voz robot.
Author: Mialena
00:00
00:10
Saying ''tuvalu''.
Author: Performansas
00:00
01:27
Collection of different human voices introducing original nitrate optical sound effects made for hollywood in the 1930s and 40s as digitized by craig smith. Sound effects extracted, these are mostly only the intro voices except for some small overlaps or tails. Sometimes it cuts off or on in the middle of a word but that's just what was available in the optical soundtracks. I plan to use these for a film but thought it might be fun for others to play with. Splice markers added in reaper, optimized for the make noise morphagene tape and microsound synthesizer, just rename the file to mg1 etc. , morphagene’s file-naming convention (but of course you don’t at all need the morphagene to use these).
Author: Exhapax
00:00
00:21
Many people talking.
Author: Stryker
00:00
00:33
Sound speech new.
Author: Ipercin
00:00
00:33
Sound speech new.
Author: Ipercin
00:00
00:19
Old movie talking vintage voices.
Author: Wesbtrm
00:00
00:43
Male voice using binaural plugin.
Author: Diegotinajero
1 - 50 of 888 Next page
/ 18