PlainTalk
PlainTalk is the collective name for several speech synthesis and speech recognition technologies developed by Apple Inc. In 1990, Apple invested a lot of work and money in speech recognition technology, hiring many researchers in the field. The result was "PlainTalk", released with the AV models in the Macintosh Quadra series from 1993. It was made a standard system component in System 7.1.2, and has since been shipped on all PowerPC and some other 68k Macintoshes.
Software
Speech synthesis
Technology
Apple's text-to-speech uses diphones. Compared to other methods of synthesizing speech, it is not very resource-intensive, but limits how natural the speech synthesis can be. American English and Spanish versions have been available, but since the advent of Mac OS X, Apple has shipped only American English voices, relying on third-party suppliers such as Acapela Group to supply voices for other languages.An application programming interface known as the Speech Manager enables third-party developers to use speech synthesis in their applications. There are various control sequences that can be used to fine-tune the intonation and rhythm. The volume, pitch and rate of the speech can be configured as well, allowing for singing.
Input to the synthesizer can be controlled explicitly using a special .
Original MacinTalk
The initial Macintosh text-to-speech engine, MacinTalk, was used by Apple in the 1984 introduction of the Macintosh in which the computer announced itself to the world. While it was incorporated into the Macintosh's operating system, it was not officially supported by Apple. MacinTalk was developed by Joseph Katz and Mark Barton who later founded which currently markets TTS engines for Windows, Linux and embedded platforms. Macintalk used direct access to the original Macintosh sound hardware and all attempts to license the source code by Apple to update it for newer Macs failed.MacinTalk 2
Eventually, Apple released a supported speech synthesis system, called MacinTalk 2. It supports any Macintosh running System Software 6.0.7 or later. It remained the recommended version for slower machines even after the release of MacinTalk 3 and Pro.MacinTalk 3, Pro
MacinTalk 3 introduced a great variety of voices. Apart from the standard adult voices "Ralph", "Fred" and "Kathy", and children's voices like "Princess" and "Junior", various novelty voices were included, like "Whisper", "Zarvox", "Cellos", "Albert", "Bells", "Boing", "Bubbles", and others.Each of these voices came with its own example text, that would be spoken when one hit the "Test" button in the Speech control panel. Some would just say their name, language and the version of MacinTalk they were introduced with. Others would say funny things, like "I sure like being inside this fancy computer", "I have a frog in my throat... No, I mean a real frog!", "We must rejoice in this morbid voice", or "The light you see at the end of the tunnel is the headlamp of a fast approaching train". These voices as well as their test texts are still in Mac OS X today.
With the increase in computing power that the AV Macs and PowerPC based Macintoshes provided, Apple could afford to increase the quality of the synthesis. MacinTalk 3 required a 33 MHz 68030 processor and MacinTalk Pro required a 68040 or better and at least 1 MB of RAM. Each synthesizer supported a different set of voices.
Text-to-speech in Mac OS X
Text-to-speech has been a part of every Mac OS X version. The Victoria voice was enhanced significantly in Mac OS X v10.3, and added as Vicki. Its size was almost 20 times greater, because of the higher-quality diphone samples used.A new, much more natural-sounding voice, called "Alex" has been added to the Mac text-to-speech roster with the release of Mac OS X 10.5 Leopard.
With Mac OS X 10.7 Lion, voices are available in additional U.S. English and other English accents, as well as 21 other languages.
The Speak selected text when key is pressed feature allows selected text from any application to be read via a key combination. From Mac OS X 10.1 to Mac OS X 10.6, the feature would copy the selected text to the clipboard and read it from there. From Mac OS X 10.7 to Mac OS X 10.10, a new implementation of the feature required software developers to implement a speech synthesis API into their applications. This prevented the clipboard from being overwritten, but also meant that, for applications that did not use the API, the feature would not function as expected, reading the title bar rather than the selected text.
In macOS Sierra 10.12, Siri was introduced for the Mac, however, the voice wasn't available as a System Voice, which meant that the Siri voices could be only used in Siri. Siri was made available as a System voice in macOS Catalina 10.15, so that it would work for any text. The Siri voices work in a completely different way and the command remains unable to use Siri.
Speech recognition
Apple hired many speech recognition researchers in 1990. After about a year, they demonstrated a technology codenamed Casper. It was released as part of the PlainTalk package in 1993. Although available for all PowerPC Macintoshes and AV 68k machines, it was not part of the default system install prior to Mac OS X, requiring the user to perform a custom OS installation to get speech recognition capabilities.In Mac OS X 10.7 Lion and earlier, Apple's speech recognition was voice-command oriented only, i.e. not intended for dictation. It can be configured to listen for commands when a hot key is pressed, after being addressed with an activation phrase such as "Computer", or "Macintosh", or without prompt. A graphical status monitor, often in the form of an animated character, provides visual and textual feedback about listening status, available commands and actions. It can also communicate back with the user using speech synthesis.
Early versions of the speech recognition provided full access to the menus. This support was later removed, since it required too many resources and made recognition less reliable, only to be re-added in Mac OS X 10.3 as a "universal access technology" called spoken user interface.
The user can launch items located in a special folder, called "Speakable Items", simply by speaking their name. Apple shipped a number of AppleScripts in this folder, but aliases, documents and folders can be opened in the same way.
Additional functionality is provided by individual applications. An application programming interface lets programs define and modify an available vocabulary. For example, the Finder provides a vocabulary for manipulating files and windows.
In OS X 10.8 Mountain Lion, Apple introduced “Dictation,” intended for general text. Originally, it required the sending of audio data to Apple servers for processing. In OS X 10.9 Mavericks, Apple added the option to download support for dictation without an Internet connection. As of OS X 10.9.3, eight languages are supported.
In popular culture
In radio
The MacinTalk speech synthesis can be heard in a few radio programmes:- Some of the radio stings in the BBC Radio 1 series Blue Jam use MacinTalk.
In music
- MacinTalk's "Vicki" speaks during the breaks in Studio Killers' "In Tokyo."
- "Repeating Yesterday" from As I Lay Dying's Shadows Are Security album used MacinTalk in the intro.
- "Satisfaction" by Benny Benassi, as well as other songs by him
- "Toby's Mac" by tobyMac
- MacinTalk's "Fred" is featured in the Radiohead songs "Paranoid Android" and "Fitter Happier", both off their 1997 album OK Computer. In the former song, Fred is used to provide background vocals, while in the latter he acts as the lead "vocalist."
- Fall Out Boy used MacinTalk at the beginning of "Grand Theft Autumn/Where Is Your Boy" and at the end of the last track and bonus tracks on their Infinity on High album
- The French band Air uses several Macintosh voices in their track "How Does it Make You Feel"
- "Yoshimi Battles the Pink Robots, Pt. 1" by The Flaming Lips features the "Zarvox" voice
- Many Aphex Twin tracks including "Funny Little Man" and "Cow Cud Is a Twin" also feature PlainTalk
- The song "John Orr the Arsonist" by And Then There Were None features a short monologue using this voice.
- "The Vic-E Interpretation - Interlude" from TLC's album FanMail features "Victoria" comparing and contrasting club culture in the US and Japan.
- The "Intro" from OutKast's Stankonia album features the "Bells" voice.
- The song "Pomme C" by French singer Calogero on the album Pomme C features a short monologue at the very beginning using the voice "Vicki"
- "DWYL" by Phil Joel
- The album Antichrist Superstar by the band Marilyn Manson makes repeated use of MacinTalk voices "Kathy", "Princess", and "Junior", often layered on top of each other. The album's title song ends with the MacinTalk voices repeating, "When you are suffering, know that I have betrayed you". MacinTalk voices are used again in the music video and live performances of "Antichrist Superstar". The MacinTalk voices repeat "You might as well kill yourself — you're already dead" at the end of the song. The song "Man That You Fear" also ends with MacinTalk voices saying, "When all of your wishes are granted, many of your dreams will be destroyed."
- Most of Mr Oizo's Transsexual EP and Lambs Anger album contains MacinTalk voices
- Korea Idol T-ara's song "Bo Peep Bo Peep" from the Absolute First Album introduction part 'Don't lose your temper so quickly'
- The El-P song "Stepfather Factory", from the 2002 album Fantastic Damage, ends with MacinTalk voices repeating, "Why are you making me hurt you? I love you".
- In EDM producer Skrillex's track titled "I Wish You All The Luck Of The World", the MacinTalk voice "Alex" can be heard repeating "David, I wish you all the luck in the world" several times in various pitches.
- Rob & Goldie's 1997 track "The Shadow " makes extensive use of the "Victoria" voice, reading a dictionary definition of "shadow" and a poem from the book Process: A Tomato Project.
- Jean-Michel Jarre's album "Métamorphoses" from 2000 uses MacinTalk to generate lyrics on the track "Love, Love, Love".
- The opening line of "Treasure" by Bruno Mars on the album Unorthodox Jukebox is the voice Alex saying, "Baby squirrel, you's a sexy motherfucker".
- In "Sometimes Things Get, Whatever" by deadmau5 on the album Random Album Title the looped line "Sometimes Things Get Complicated" is a combination of the "Ralph" and "Kathy" voices.
- German electronic act Monolake uses the PlainTalk "Whisper" voice in the track "Bicom" on their album Cinemascope.
- Plogue Art et Technologie's virtual synthesizer chipspeech emulates MacinTalk 1.0.
- I'm A Disco Dancer by Christopher Just uses various MacinTalk voices
- David James uses some MacinTalk voices in his song A Permanent State
In film
- The "Ralph" voice serves as the voice of the computerized autopilot AUTO in the 2008 Disney-Pixar film WALL-E. MacinTalk is credited as that voice in the film's end credits.
- In Blank Check, Preston used MacInTalk as the Voice of M.Macintosh to buy a house over the phone.
In television
- MacinTalk was used to perform the part of the wheelchair-using motor neurone disease patient with a voice synthesizer in the animated Family Guy episode "Ready, Willing and Disabled", who would later appear again in "Brian the Bachelor" and "Brian Goes Back to College". It was furthermore used in "Peterotica", when Stewie plays with "the speech function on his Macintosh", Using Talking Moose software, Stewie gets his laptop to say "Stewie is cool." It was also in "April in Quahog", where MacinTalk Fred voices Stephen Hawking.
- It was also used to introduce episodes on the anime Serial Experiments Lain.
- In the early years of Adult Swim, a pitched down "Junior" voice was used as the announcer for the former "Adult Swim Action" block.
- In the Transformers Animated episodes "TransWarped", "Decepticon Air" and "This is Why I Hate Machines", the Autobot Perceptor's voice was performed via MacinTalk speech generation, using the "Fred" voice.
- It was also used in the cartoon Whatever Happened to... Robot Jones? as the voice of Robot Jones' father, Dad Unit, and Robot in the first season.
- * The MacinTalk voice "Junior" was voicing Robot Jones in the 2000 episode of The Cartoon Cartoon Show, "Whatever Happened to... Robot Jones?".
- * MacinTalk's Junior voice would later return as Robot Jones in OK K.O.! Let's Be Heroes: Crossover Nexus.
- It was also used as the voice for the Red vs. Blue character, the lying A.I Gary/Gamma.
- The Mac OS X novelty voice "Boing" was used for the Robot face for Klasky Csupo.
- In one of Boomerang's older blocks Boomeraction, the voice used for the block was a low pitch version of the "Trinoids" novelty voice.
- In the Venture Bros episode "All This and Gargantua-2", the voice is used for the robots on the titular space station.
- It was used in Scrapheap Challenge Season 1 as the voice that announces the time remaining for the challenges.
- The "Junior" voice was used in the Wayside episode "Mad Hot."
- In one episode of Doki, the "Trinoids" voice is used in "To Catch A Fish."
- The "Junior" voice was used once in Hi Hi Puffy AmiYumi in the episode "Mean Machine".
- The "Fred" voice was used in the Space Ghost Coast to Coast episode "Warren", and episodes 1 to 14 of Wonder Showzen.
- The "Junior" or "Kathy" and "Fred" voices can be heard in the theme song for Pickle and Peanut.
In video games
- MacinTalk was used in the Japanese version of the game Killer7, as the voices of the Remnant Psyches.
- MacinTalk was used in the game No More Heroes. Before each boss fight, the "Whisper" voice would announce the current boss's name.
- The original 1999 release of The Silver Case used "Ralph" and "Boing" in its intro track.
- The video game Grand Theft Auto IV used "Vicki" as the voice of the DJ of the in-game radio station "The Journey".
- The 2009 video game Machinarium features several Macintalk voices in its ambient music.
- The 3DO version of Star Control 2 used "Bruce" as the voice of the Slylandro Probe. The game credits the voice of the Probe as 840AV. "Agnes" was also used as the voice of the Captain's ship's computer, heard on the first encounter with the Orz race.
- The Japanese and European versions of Sonic CD used MacinTalk voices in the background music of Metallic Madness' Bad Future stages.
- The video game Castle Crashers used the novelty voice Boing, for a character known as The Painter.
- The 2003 video game used "Albert" for vocals in its soundtrack.
In web videos
- MacinTalk voice "Zarvox" was used to voice the robot in the Smosh video "AWESOME NEW ROBOT!", and the "Fred" voice was used to voice Stephen Hawking in "15 HOUR ENERGY!"
- YouTuber Max Gilardi used various Macintalk voices for his videos.
- The Making Fiends web series had a version of the episode "Kitty Kitty Kitty", where the original audio was replaced with a computerized speech simulator.
- The opening for Nitro Rad has the voice saying the words "Nitro Rad."
Hardware