NEXTUP tECHNOLOGIES
TextAloud 2.0 User’s Manual
TextAloud 2.0 User’s Manual
ã
NextUp Technologies, LLC
NextUp.com
3.0 About
Text To Speech Voices
SAPI
– Speech Application Interface
4.0 Single
vs. Multi Article Mode
5.0 TextAloud
Main Window Overview
8.0 Bringing
Text into TextAloud
TextAloud’s
Automatic Clipboard Watching
Common
Clipboard Watching Setups
Actions
and Navigation during Speaking
Automatic
Voice Selection Settings
Changing
Pronunciation of Substrings
Pattern
Matching with Masks and Regular Expressions
Setting
Bit/Sample Rates for WAVE Files
Using
in Portable Audio Players (MP3 Players)
12.0 Internet
Explorer Plug-In
13.0 Working
with E-Books and other Large Text Files
15.0 Proofreading
with TextAloud
Changing
TextAloud Appearance through Skins
Customizing
Fonts and Colors of Text
Voice
Licensing and Allowed Usage
Volume,
School, and Site Licensing
18.0 Getting
More Information and Help
Welcome to the TextAloud 2.0 User’s Reference Manual. This document will help you get the most out
of TextAloud. TextAloud (formerly called
“TextAloud MP3”) 2.0 is a powerful text to speech (speech
synthesis) tool for computers running all versions of Windows 95 and later
(NOTE: Some voices and functions may not be available on Windows 95). TextAloud enables almost any text to be
listened to using your computer or portable audio devices. By reading this manual completely, you’ll
understand the power and features of TextAloud, allowing you to fully take
advantage of its capabilities.
This manual will reference command buttons and menu
actions available with TextAloud. When
referencing menu commands, the commands will be shown in bold using the
convention File->Exit for
example. This indicates you should
select File on TextAloud’s main
menu, then the Exit item on the
sub-menu that is displayed. Whenever
buttons are references, the convention <OK>
will be used. In this case,
indicating you click the button labeled OK.
Also, the term TTS will often be used as an abbreviation for Text To
Speech.
In addition to this manual, there are several other
information sources available to you:
·
http://www.nextup.com
– Our site provides product information and purchase links, as well as
downloads of the latest version of TextAloud.
·
support@nextup.com
– Email us at any time with any questions you have. You’ll find that we provide very quick and
helpful responses to all email questions.
·
http://nextup.com/mailinglist.html - Signup for the NextUp.com Monthly
Newsletter, as well as read back issues of our Newsletter. We typically provide a tip of the month for
getting more out of TextAloud, along with any info about updates and changes to
the product.
·
http://nextup.com/phpBB2/index.php
- Our User Forum is a gathering place for TextAloud enthusiasts. Many of our existing users are more than
willing to provide help regarding TextAloud and solving any problems or answer
questions you may have.
·
TextAloud includes a comprehensive help file
available via Help->Contents on
the TextAloud main menu, or by pressing F1
on your keyboard within most TextAloud windows.
This section provides a very quick overview of basic
TextAloud operation. More details on
everything discussed in this section are provided later in this manual.
TextAloud can be installed from purchased CD or via download from
http://nextup.com/TextAloud/download.html
If installing from download, visit the URL above and
download the TextAloud installer .exe file.
Running this file will guide you through installation.
If installing from CD, click the TextAloud entry on the CD
menu to install.
NOTE:
When installing a newer version of TextAloud onto a computer with TextAloud
already installed, you do not need to uninstall the older version first. TextAloud will automatically be updated and
previous settings retained. Also, if you
have purchased TextAloud, regardless of whether installing from CD or internet,
if you haven’t previously, you must enter the registration code you receive via
email after purchasing into the opening screen of the program to disable the
time limit of demo mode.
When you
first install TextAloud, the first thing to do is get it to speak a few words
just to make sure installation went well and everything is working. TextAloud
will be pre-loaded with a sample article. Simply click the <Speak> button, or select Speak->Speak Current Article on the main menu to get TextAloud
to talk. After this introductory article is spoken, you can move on to learning
more about how to use the program.
There are four main ways to load your text into TextAloud:
·
Clipboard
Watching – TextAloud can automatically monitor the Windows
clipboard. Anytime you copy text by
highlighting it and hitting Control-C, TextAloud can import the text. Select Options->TextAloud
Options->Clipboard Options for settings related to this.
·
File->Open
– By using this main menu entry, TextAloud can directly open text files and documents in .txt (plain text files), .rtf
(rich text files), .html (locally saved web documents), .doc (MS Word
Documents), and .pdf (Adobe Acrobat Files).
·
Using HotKeys –
See Options->HotKey Setup for a
variety of global HotKeys you can setup.
When working in any program, you can highlight text and press a HotKey
to have TextAloud speak or import text.
·
Manual
Creation – By right-clicking in the text area of TextAloud, then choosing New Article on the popup menu, or
choosing File->New Article on the
main menu, TextAloud will create a blank article, allowing you to type or paste
in text manually.
TextAloud now includes an Internet Explorer Plug-In.
After installing TextAloud, if you’ll restart Windows, you can then select
View->Toolbars->TextAloud in
Internet Explorer’s main menu, you can display the TextAloud toolbar within
Internet Explorer. This toolbar allows
you to directly speak text within Internet Explorer.
TextAloud supports a variety of voices for almost every
need. NextUp.com sells a wide variety of
voices to add personality and support for many languages. In particular, you should visit the pages
below to sample these recommended voices:
·
AT&T Natural Voices - http://www.nextup.com/attnv.html
·
NeoSpeech Voices - http://www.nextup.com/neospeech.html
·
Cepstral Voices - http://nextup.com/Cepstral.html
TextAloud can create audio files from your text. These files can be .wav, .mp3, or .WMA files.
To do this, set your output file options via Options->Voice and File Options->File Options. Then use the <Speak To File> button in TextAloud, or Speak->Speak All Articles to File on the main menu. Completed audio files will be placed in the
File Output Directory you specified, and will be ready for playback on your PC
or to move to your portable audio device or burn to an audio CD.
If you are upgrading from a previous version of TextAloud,
here are some of the exciting new features in TextAloud 2.0:
·
New easier to use interface.
·
Support for Skins to change TextAloud’s window appearance.
·
Import of Word, HTML, and PDF files.
·
Customization of article fonts and highlighting colors for improved
readability.
·
Change of voice speed, pitch, and volume on main window.
·
Improved support for screen readers used by sight-impaired
individuals.
·
Batch File Converter on the File Menu.
·
Support for changing voices within an article.
·
New Proofread HotKey for easy proofreading while writing documents or emails.
·
Bookmarks allow easy return to important sections of
text
·
Internet Explorer Plug-In.
·
Search/Replace added to Edit menu.
·
Ability to specify longer than normal pauses between sentences and paragraphs.
·
Creation of ID3 tags on MP3 files.
TextAloud supports a variety of voices used to turn text
into sound. Voice technology continues
to advance, but they all on some level work based on the same principles. A little background on how they work may be
helpful in understanding how to choose voices and how to get the clearest
possible speech.
All of the voices used by TextAloud are based on
concatenative synthesis.
Concatenative voices are developed by taking actual human voice samples,
and slicing them up into tiny fragments of sound. Then, based on the text being synthesized,
the computer tries to figure out which bits of sound to splice together to make
natural sounding words. .
These early voices would take incoming text, break each
word down to its phonemes, select the sound for these phonemes, concatenate
them together, to make words, then play that audio. This is the basis for all of the current good
voice engines.
However, all voices aren’t created equally. Languages and the way we use them are more
complicated than that. These
complications include:
·
Rise and fall of pitch depending on context (for example, pitch
sometimes rises at the end of questions)
·
Words may be written in identical ways but be
pronounced differently based on context (He read the book. I will read the book.)
·
Some words may not be in a voice engine’s
dictionary (a voice developed in 1990 may not know the word eBay or google).
·
Some text is made up of things other than actual
words (such as URLs, Acronyms, Roman Numerals, symbols)
There are other examples and issues beyond this description. The main point to understand is that each
voice engine will likely handle things differently in decoding text and
creating the sound. Many of the newer
engines do a much better job handling complicated context and more closely
generating natural sounding spoken words.
An additional factor in how natural sounding the voice
will be is the quality and quantity of sound samples used. Very small voice engines, such as those older
Microsoft and L&H voices take up only a couple of megabytes of disk space. This makes them great for downloading easily
and not requiring a lot of resources.
The trade-off is in the depth and richness of the sound they generate. On the other end of the spectrum, voices such
as AT&T Natural Voices include nearly 500 megabytes of sound
samples. This allows the program to do a
much better job approximating human speech with a large number of high-quality
samples to build from. The trade-off is
these require a great deal of disk space and more CPU power to create real-time
speech.
As with many aspects of computing using Windows computers,
Microsoft has created standard interfaces for speech. SAPI, or Microsoft’s Speech API, is Microsoft’s attempt to
create a standard way for multiple companies to develop voices and make them
available through a common programming method.
Normally users never need to know about these interfaces. However, there are two versions of SAPI, with
varying abilities, and with different
support from different engines. These
two versions are SAPI4 and
SAPI5. TextAloud
supports both, however, some engines and some features are only supported in
one or the other.
SAPI4 is
the earlier version of the standard. It
is supported by all of the voices created before 1999, and some created
later. SAPI4 will work on most Windows
computers. It provides little ability to
customize speech beyond TextAloud’s basic pronunciation editor. Voices most often used with TextAloud that
will support SAPI4 are the Microsoft voices Mary, Mike, and Sam, along with the older
L&H Voices. Some users may also have IBM ViaVoice voices
from other products that will be available under SAPI4. AT&T Natural Voices also support SAPI4
SAPI5 is
the newer standard. It is supported by
newer voices such as AT&T Natural Voices, NeoSpeech Voices, Cepstral, and other new
voices we will be releasing. SAPI5 adds
additional features not available in SAPI4. These include the ability to change voices
within a single article of text (for example, to create a script read by
multiple characters), the ability to insert pitch and speech changes into the text, and
additional specialized TAGs for manipulating speech further. A few features within TextAloud are only
available when you tell the program to use ONLY SAPI5 Voices. This change is made via Options->Voice and File Options.
These functions include the Advanced Pronunciation Editor and Inserting Voice Changes.
NOTE: One important thing to
note related to advanced SAPI5 features is that not all voices
support all SAPI5 options. For example, Neospeech voices do
not support the advanced pronunciation editor.
AT&T Voices do not support pitch adjustment or emphasis adjustment.
For more information on advanced speech manipulation using
SAPI5 XML Tags, you can download the SAPI documentation at
http://www.microsoft.com/speech/techinfo/apioverview/#_doc
Another important note is that installing TextAloud will
not automatically install both the sapi4 and sapi5 interfaces. Windows 98 ships with SAPI4. Windows 2000 and XP ship with SAPI5 already installed. If neither is available, TextAloud will warn
you and give you directions for installing one or the other. You can download and install SAPI4 and SAPI5
via
http://www.nextup.com/sapi.html
Audio Files are digital representations of analog
signals. When these sounds are
digitized, they are sampled and converted to numbers. The rate at which they are sampled is the
sample rate. That sounds a little complicated, and it
is. The only part you really need to
remember is that each voice engine has a natural sample rate. This rate is represented as a number, 8kHz,
11kHz, 16kHz, or 22kHz. For most uses
this sample rate doesn’t matter as long as you are happy with the sound you are
getting from the voice. There are a
couple of uses where this sample rate does matter though.
Telephony usage, where voices are played over the phone,
such as with voice menu systems or computer based answer machines, typically
require 8kHz voices. You’ll notice when
you purchase AT&T voices or Neospeech voices from NextUp.com, you’ll see an
option to purchase either 8kHz or 16kHz voices.
While 16kHz voices sound best on the computer, either 8kHz or 16kHz will
work fine when played within TextAloud.
However, voices purchased for telephony usage should be 8kHz.
The other area where sample rates become important is when
creating audio files for certain portable uses. See Creating Audio Files later in this
document for further discussion of this.
One thing to note when selecting voices is that sample
rate alone doesn’t define the quality of the
voice. For example, Microsoft voices are 22kHz, but don’t sound nearly as good as
16kHz AT&T voices. But, within a
given engine, higher Bitrates sound better.
22kHz Microsoft voices sound better than the 8kHz Microsoft voices. 16kHz AT&T voices sound better than the 8kHz
AT&T Voices.
Text To Speech Engines and Voices are available in a
variety of languages. In the following
sections when we detail commonly available voices, available languages will be
listed.
Each voice has a native language. TTS Voices and TextAloud will not do
translation. So for example, if you have
English text and use a Spanish voice, you will not hear the Spanish version
of that text. Likewise, voices do not do
a good job of reading a language other than the one it was designed for. Using English text with a French voice will not give you English with a French
accent, it will give you mostly gibberish.
Here are voices that are commonly available for use within
TextAloud:
·
Microsoft
Voices – Originally
licensed by Microsoft from Lernout & Hauspie (L&H), Mary, Mike, and Sam
are available free for download at the bottom of
http://www.nextup.com/TextAloud/SpeechEngine/voices.html
While these voices are older and do not have a natural sound, they do a decent
job with most text and are easily understood.
They are available in English only and take up less than 2mb per voice
and are available in SAPI4 or SAPI5 versions.
·
L&H
Voices –
These voices are of similar quality to the Microsoft voices, and are available for
download at the bottom of
http://www.nextup.com/TextAloud/SpeechEngine/voices.html
They take up less than 2mb per voice, support only SAPI4, and are available in the
following languages:
o
American English
o
British English
o
Dutch
o
French
o
German
o
Italian
o
Portuguese
o
Spanish
o
·
AT&T
Natural Voices – These are among the most natural sounding
voices around. You can hear samples of
AT&T voices at
http://www.nextup.com/attnv.html
and play with an interactive demo at
http://www.nextup.com/nvdemo.html
These voices are available for purchase at NextUp.com. They take up nearly 600mb per voice, support
both SAPI4 and SAPI5, and are available in the
following languages:
o
American English
o
British English
o
American English (Indian Accent)
o
German
o
Latin American Spanish
o
French
NOTE: AT&T Natural Voices use a significant amount of system memory.
We recommend 256mb or more of memory for these voices.
·
NeoSpeech
Voices – These are also among the most natural
sounding you will find. You can hear
samples of Neospeech voices at
http://www.nextup.com/neospeech.html
These voices are available for purchase at NextUp.com. They take up nearly 300mb per voice, support
SAPI4 Only, and
are available in American English.
·
Cepstral
Voices – These are high-quality voices with smaller
machine requirements. You can hear samples
of Cepstral voices at
http://www.nextup.com/Cepstral.html
These voices are available for purchase at NextUp.com. They take up around 25mb per voice, support
SAPI5 Only, are
currently available in American English and Scottish Accent English, a child’s English
Voice, with more languages are coming soon..
·
Other
Voices – NextUp.com is adding new voices often, so be sure to check with us
about newly available voices. In
addition to the voices listed above, TextAloud will attempt to load any other
SAPI4 or SAPI5 voices that may be available on your
computer. If you have other speech
products, you may have received other voices, such as those from IBM or
ScanSoft. If available, these voices
will be shown on the Engines/Voices TAB in TextAloud, available under Options->Voice and File Options. While we have tested with as many voices as
possible, if you find you have additional voices and they do not seem to work
properly in TextAloud, email us at support@nextup.com
for assistance.
NOTE: Anytime you
install a new voice, it will not immediately be available within
TextAloud. You must first fully exit
TextAloud via File->Exit, then restart the program before new voices will be
displayed.
A voice is assigned to
each article added to TextAloud. This voice can be changed via the Voice
Dropdown list on the main window. Which
voice is assigned is controlled by the settings for Voice Selection under Options->TextAloud
Options->Article Options. Voice
Assignment can be set to:
·
Default
– The default voice as selected under Options->Voice
and File Options is used.
·
Random
– A random voice is selected from the enable voices. Voices can be enabled and disabled under Options->Voice and File Options.
·
Round
Robin – Voices are rotated sequentially based on enabled voices.
Judging the quality of TTS Voices is very subjective. What sounds good to one user may not be intelligible
to another. TextAloud has many features
to allow you to improve the sound. The
most important thing you can do is upgrade to one or more of the higher-quality
voices listed above.
Additionally, TextAloud easily allows you to adjust the
speed, pitch and volume on voices with the sliders on the main window. (NOTE: AT&T Natural Voices do not support pitch adjustments). Most users, as they become accustomed to listening to TextAloud will gradually increase the
speed of speech to enable faster reading.
Also see later sections on the Basic and Advanced
pronunciation editors to assist with any words that the voices mispronounce.
Before jumping into details of using TextAloud, the
concepts of Articles and Single vs. Multi Article mode need to be understood.
An Article is a piece of text you have placed into
TextAloud to be read. It may be a single
news story, a single document, a single paragraph, whatever you want it to
be. The point is, it is dealt with as a
single entity within TextAloud. This
means it is a chunk of text that is dealt with as one and is given a single
title. Understanding this concept helps
understand the remainder of this document.
If you open a text file into TextAloud, that becomes a single
article. If you copy an entire news
story to TextAloud using the clipboard, that becomes a single article.
There are two distinct modes of operation in TextAloud,
Single Article Mode, and Multi Article Mode.
You can change Mode by choosing Options->Single
Article or Options->Multi Article. Most users will keep TextAloud in Multi Article mode as detailed in the next
section. However, there are situations where a user may want to use Single
Article Mode.
In Single Article Mode, you can only deal with one article at a
time. Anytime a new article is loaded,
the previous article is deleted. If you
typically deal with only one thing you may want to hear, then once it is done,
you are going off to look for something else, then single article mode may be
simpler.
One powerful use of Single Article Mode is by turning on Automatic Speaking. Under Options->TextAloud
Options->Article Options, if you check the Automatically Speak New Text, when in Single article mode,
TextAloud will automatically speak any newly opened articles or new articles
added via Clipboard Watching.
Place TextAloud in Multi Article Mode via Options->Multi
Article. In Multi Article Mode,
TextAloud can support an unlimited number of articles. Each article is automatically assigned a
title based on the first few words in the article. You can change this article title by
overwriting it in the Title Field. When
a new article is added, the new article is displayed, but all previous articles
are still available within TextAloud.
You can change to a previously added article using the Article List
dropdown at the top of the window.
As mentioned, most users prefer Multi Article Mode because it builds a queue of
articles. Then when ready, you can
listen to individual articles, your complete list of articles, or easily create
audio files from the entire list of articles. Based on the Automatically Delete Text After Play setting under Options->TextAloud Options->Article
Options, articles can be automatically removed after spoken.
The TextAloud main window is depicted below:

Numbers on the picture annotate key areas detailed as
follows:
1.
TextAloud
Main Menu – Provides menu items to all functions in TextAloud. Note that most menu items have shortcut keys,
and using Options->Shortcut Setup allows you to customize those
keyboard shortcuts.
2.
TextAloud
Toolbar – These icons provide quick access to commonly
used menu items. Pause your mouse over
an icon and a popup hint will provide a description of the icon’s function. You can adjust the size of the toolbar,
remove it completely, or remove captions via the View->Toolbar menu item.
The last two buttons on this icon provide information about TextAloud
for schools and Language support. These
are primarily marketing vehicles, and you may right-click these buttons and
choose Hide on the popup menu to remove these two icons.
3.
Article
List Dropdown – This is visible only in Multi Article Mode.
Click this dropdown list to see the full list of all articles currently
in TextAloud, then select an article title to load it into the main window.
4.
Article
Title – This field is the Title of the current article. Article titles are generated based on the
first few words of an article when the article is first loaded. You can change the title by typing in this
field. NOTE: The article title becomes
the filename when creating audio files.
5.
Voice
Dropdown – The voice dropdown shows the voice selected for the current
article. You can click this field to
select a different voice from enabled voices.
When you change the voice, the speed and pitch settings on the right of the screen will be
adjusted to the saved values for that voice.
6.
TextAloud
Text Area – This area is where text of the current article is
displayed. When manually creating an
article, type or paste in this area.
While the article is being spoken, the word currently being spoken will
be highlighted in this area (assuming word highlighting is turned on under Options->TextAloud Options->Miscellaneous). You may edit text in this area, and several
actions are available on a popup menu when you right-click in this area.
7.
Status
Bar – The status bar at the bottom of the window will often provide
helpful status information such as the number of current articles or progress
indicators during speaking.
8.
Pitch
Slider – This area contains a slider to allow easy adjustment of the pitch for the current voice. Note that some voices do not support pitch
adjustments, as shown in the figure above.
Changing a value here will adjust the default pitch for the current
voice, so the new value will be used anytime that voice is selected.
9.
Speed
Slider – Use this slider to speed up or slow down speaking when using the
current voice. All voices support speed
adjustments. You will notice the ranges
shown will vary based on voice and SAPI version they support. Changing a value here will adjust the default
speed for the current voice.
10.
Volume
Slider – The volume slider controls the WAVE volume for the Windows
mixer. This is the same volume that is
shown if you bring up the Windows mixer by double-clicking the speaker icon in
the system tray area near the clock on the Windows start
bar. Changes here are not voice specific
but change Windows volume settings for most audio programs.
11.
<Speak>
Button – Click this button to have the current article spoken aloud. This has the same action as clicking Speak->Speak Current Article.
12.
<Speak
To File> Button – Click this button to create an audio file from the
currently selected article.
13.
<Delete>
Button – Deletes the current article.
Same action as Edit->Delete
Selected Article.
14.
Text
Scrollbar – If the current article has more text than can be shown within
the TextAloud window, this scrollbar will appear and can be used to navigate up
and down the text of the article.
15.
Multi
Article Scrollbar – When TextAloud has more than
one article, use this scrollbar to navigate to different articles. This is useful to move to the next or
previous article, or a quick method to the first or last. To move to a particular article, using the
Article List Dropdown is easier since you can identify the articles by title.
TextAloud includes a smaller, floating toolbar which can
be used to manage speaking when the main TextAloud window is not shown.

You can adjust the size of the icons as well as remove
captions via Options->TextAloud
Options->Miscellaneous. There are
also settings there to dictate when the floating toolbar will be displayed, and
whether the toolbar should always stay on top of other windows to keep it
available for use. Choices for when the
toolbar is displayed are:
·
On
Request – Toolbar is displayed only when TextAloud is minimized
and you right-click the TextAloud system tray icon, and then choose Show Toolbar from the
menu.
·
When
Minimized – Toolbar is displayed anytime TextAloud is minimized.
·
When
Minimized and Reading Starts – Only shown if speaking while TextAloud is
minimized.
While use of the floating toolbar isn’t vital for
TextAloud use, some users find it very helpful to manage playback of articles
while leaving most of your desktop free for other work.
The Windows system tray is the area on the Windows start bar near the
system clock. Programs that are running
frequently place an icon here to provide you quick access to the program and
its functions. Anytime TextAloud is
running, it places a small TextAloud icon in the system tray, recognizable as a
red T and a speaker icon.
You can double-click this icon at any time to bring the
TextAloud main window open and to the top of other windows. By right-clicking this icon, a TextAloud menu
will open, providing access to several functions including:
·
Start, Pause, Stop Speaking
·
TextAloud Options – Under this entry is a convenient
choice to turn on and off clipboard watching.
·
Exiting TextAloud
·
Showing the Floating Toolbar
Via Options->TextAloud
Options->Miscellaneous there are two settings related to running
TextAloud in the system tray.
·
The checkbox Start TextAloud on Windows Login can be set to cause TextAloud to
automatically start and run in the system tray when you login to Windows. This will make sure it is always available,
which is important if you often use with Clipboard Watching or HotKeys from within other programs without
interacting with the TextAloud main window first.
·
The checkbox TextAloud resides in System Tray when window closed. When this is checked, if you close TextAloud
by clicking the ‘X’ in the upper-right corner of TextAloud, the program will
not exit, but will simply close the main window and remain in the system tray. If this choice is not checked, clicking the
‘X’ will cause TextAloud to completely exit.
TextAloud offers a variety of methods for creating
articles from various text sources:
Using File->Open
on TextAloud’s main menu, you can directly open various file types. The selected file will be imported into
TextAloud as a new article. You may also
select multiple files if in Multi Article Mode to import multiple articles at one
time. The following file types can be
directly opened:
·
Text
Files (*.txt) – Article(s) are created from one or more selected .txt
files. Article Title will be the
filename.
·
Rich Text
Files (*.rtf) – These are more advanced text files that include some additional text formatting.
·
HTML Files (*.html) – HTML files are the files
that make up web pages. If you save an
HTML file locally, for example using File->Save As within Internet Explorer, TextAloud can import that
file as an article.
·
Microsoft
Word Files (*.doc) – These are documents from
the most popular word processor.
·
Adobe
Acrobat Files (*.pdf) – Popular publishing file
format. One item to note with Adobe
Acrobat files is that some PDF files have been copy protected by the author,
preventing other programs including TextAloud from extracting text. For documents like these, a specialized
converter may be required, such as SolidPDF listed at
http://www.nextup.com/partners.html
By default, TextAloud has a document size limit on how much text will be displayed within the
TextAloud window. If you open a very
large document, you may see text within the TextAloud window noting that the
document was too large to import directly into TextAloud, and that it will be
referenced in the external file. While
this message appears to be an error message, this is simply a possibly
over-cautious protection included in TextAloud to prevent huge documents from
taking too much of Windows resources.
The document will still be spoken as other articles, but text will not
be highlighted within the TextAloud window.
This protection mechanism is primarily included within
TextAloud to protect older Windows systems.
NextUP.com support can provide you with instructions to turn off this
limit, so contact support@nextup.com if
interested.
Microsoft Windows has a
feature called the Windows clipboard. This is a global shared memory area where
text or other objects can be copied to, then used in other programs. You’ve likely used this even if you didn’t
realize it. Copying text to the Windows
clipboard can be done from within most programs that allow viewing of text.
Copying text to the Windows
clipboard involves first highlighting the text of interest, then performing the
actual copy to the Windows clipboard.
Highlighting text is accomplished by any of the following:
·
Holding left mouse button down while moving
mouse over the text of interest, then releasing mouse button once selection has
been highlighted
·
Within programs that allow text editing, by
holding down the Shift button while using arrow keys to move the character
cursor.
·
By choosing Edit->Select All on the main menu
of programs that offer this option.
Once selected text is
highlighted, it can be copied to the Windows clipboard by one of the following
methods:
·
Right-Clicking and choosing Copy from the popup
menu within programs that offer this option.
·
Choosing Edit->Copy from the main menu of
programs that offer this option.
·
Holding down the Control Key on your keyboard
then pressing C (Control-C).
Once this copy command
has been executed, the selected text is in the global Windows clipboard,
meaning it can be used in other programs by “Pasting” the text into that
program (Edit->Paste or Control-V).
To assist you in importing text of interest into
TextAloud, TextAloud includes a feature called Clipboard Watching. When Clipboard Watching is turned on,
TextAloud monitors the Windows clipboard for changes, and can automatically speak
or import the copied text into TextAloud, or can pop up a window to give you
options on what to do with the text.
TextAloud’s Clipboard Watching is a very flexible system to give you control
over how the program behaves when text is copied to the clipboard. Settings that control this behavior are
available via
Options->TextAloud
Options->Clipboard. This settings
window is shown below:

Use this window to control the behavior of clipboard
watching:
·
Watch
Clipboard – When checked, TextAloud will monitor the Windows
clipboard.
·
Ignore
Clipboard Text less than XX characters – If you use the Windows clipboard
often for other tasks, you may want TextAloud to ignore smaller amounts of
text. For example, if you typically only
use TextAloud for longer news stories, you might set this value to 200, and any
clipboard copies of less than 200 characters will be ignored by TextAloud. TextAloud will only act on copies of more
than 200 characters.
·
Ignore
Clipboard Updates if text has not changed – Normally this checkbox should
be checked. Some Microsoft programs,
including Internet Explorer, have bugs that cause them to
send duplicate clipboard messages often.
For example, each time you close an internet explorer window, the Windows
clipboard will act as if a program has copied text to the clipboard, even
though the contents of the clipboard will not change. Checking this will cause TextAloud to
completely ignore duplicate copies.
However, a side-effect of this is if you manually copy identical text to
the clipboard twice in a row, TextAloud will ignore the second copy.
·
Prompt
Options Section – TextAloud can be set to either automatically import
copied text as a new article, or to open the Clipboard Prompt Window to give you a choice of actions to
perform on the text. The Clipboard Prompt Window is described in
detail below.
·
Prompt to
Copy from Clipboard – When this is checked, the Clipboard Prompt Window will be displayed anytime new text is
copied to the Windows clipboard. When
unchecked, the prompt will not be displayed and the Default Action will be performed automatically.
·
Display
Prompt for XX Seconds – The Clipboard
Prompt Window is displayed as a popup window,
showing on top of other windows after text is copied to the clipboard. Using this setting you can specify how long
the window should be displayed. For
example, if you specify 3 seconds, the window will be displayed for 3
seconds. This gives you 3 seconds to
click a button to specify what you want to do with the text. If you do not click within 3 seconds, the
window will close and the Default Action
will be executed. If you need longer to decide
what to do, increase this time so the window will be displayed longer.
·
Default
Action – Your selection here determines what will be done with clipboard
text if you do not have prompting turned on, or if you do not select an action
before the Clipboard Prompt Window disappears. Options for this default action are:
o New – Automatically import the
clipboard text into TextAloud as a new article.
If in Single Article Mode and
Automatic Speaking is turned on, automatically speak the text.
o Append – Automatically append the
clipboard text to the bottom of the current article.
o Cancel – Do nothing with the clipboard
text.
o Remember Last – Automatically perform
whatever action was chosen the last time you actually clicked a button on the Clipboard Prompt Window.
As mentioned above, if Clipboard Prompting is turned on,
TextAloud will display the following clipboard window:

Items on this window are:
·
<New>
- Click the New button to import the clipboard text into TextAloud as a new
article.
·
<Append>
- Click the Append button to append the clipboard text to the bottom of the
current article. This is useful for
example if you are copying a news story from Internet Explorer, but you bring it in as two
separate copies in order to skip over some advertisement text.
·
<Cancel>
- Click Cancel to do nothing with the text.
·
Hide
Details – Clicking the Hide Details link will close the lower section of
the window, providing a smaller popup window in the future which displays only
the action buttons.
·
Watch Clipboard,
Ignore Clipboard Text…, and Display Prompt… - These are identical to those
items in the Clipboard Settings window from above.
·
Status
Bar – The status bar at the bottom of the window has two
sections. The left side reminds you whether
TextAloud is in Single or Multi Article mode.
The right side of the panel, where 2
is displayed in this example, is a counter that counts down the number of
seconds before the window will close automatically and the default action will
be taken.
For most users, one of two clipboard setups will be most
commonly used.
For users who use Single Article Mode and want the clipboard typically spoken,
use:
·
Clipboard Watching On
·
Ignore less than 100 characters
·
Prompting Off
·
Default Action of New
·
Automatically Speak under Article Options
checked.
This will cause TextAloud to automatically speak any text
over 100 characters that is copied to the clipboard.
For users who typically use TextAloud to clip articles
that they will listen to after compiling several articles, use:
·
Clipboard Watching On
·
Ignore less than 200 characters
·
Prompting On
·
Default Action of Cancel
With these settings, when you copy more than 200
characters, TextAloud will give you the option to import as new or append, or
to cancel and do nothing with the text.
Note that when doing other work that may use the clipboard often, you
can easily turn of clipboard watching by right-clicking the TextAloud system
tray icon, choosing Options, and unchecking Watch
Clipboard.
It is also important to note that clipboard watching is
often too intrusive for some users who often use the Windows clipboard when
performing tasks unrelated to TextAloud.
For these users we recommend that clipboard watching be turned off, and
instead HotKeys be used for dealing with text from other programs in
TextAloud. HotKeys will be detailed in a
later section.
In addition to the more automatic methods of creating new
articles in TextAloud, you may also manually create articles. To do this, click File->New Article on the main menu. This will create a blank article. A default title will be generated but can be
overwritten in the Title edit box.
Next, within the Text Area, you can manually type in
contents of the article. In addition you
can use Cut, Copy, and Paste within this area similar to the way you can within
any text editing program.
You can even save the text you create as an external text
file by choosing File->Save Article
As->Text File.
HotKey’s are keyboard combinations that you can setup via Options->HotKey Setup. A HotKey is somewhat like a global shortcut
key. As long as TextAloud is running,
even if the TextAloud window isn’t displayed but is minimized, TextAloud will
watch for this keyboard sequence, and can then take an action. More details on all of TextAloud’s HotKey
functionality is detailed in the HotKey Reference section, but the Import New HotKey in particular provides a convenient way to
import text into TextAloud.
Via Options->HotKey
Setup, define a key combination for the Import New HotKey. Choosing something obscure, such as
Control-Alt-Shift-N insures you select a sequence that isn’t already in use by
another program. Once this is defined,
you can work within any program, highlight some text, then press
Control-Alt-Shift-N, and the highlighted text will automatically be imported
into TextAloud as a new article.
Using this Import New HotKey gives you an easy way to create new articles
while working in Documents, Email, or other text programs without having to use
Clipboard Watching, which can sometimes become
annoying.
More details on the Internet Explorer Plug-In are provided in a later section. After installing TextAloud 2.0, then
restarting Windows, the TextAloud toolbar will be available within Internet
Explorer via View->Toolbars->TextAloud.
The 3rd icon in this toolbar looks like a
printed document with a folded corner.
This is the Create TextAloud
Article from Browser Text button.
When viewing a web page within Internet Explorer, if you select text within
the browser window, then click this button, a new TextAloud article will be
created containing the highlighted text.
If no text is selected within Internet Explorer, then pressing the
Create button will create a new article in TextAloud containing all of the text
from the displayed web page.
TextAloud offers several methods to start speaking.
To start listening to the current article, click Speak->Speak Current Article, or
click the <Speak> button on
the right side of the window (just above the <Speak To File> button).
If in Single Article Mode,
clicking the <Speak> toolbar
button will have this same effect.
If in Multi Article Mode, you can start listening to all
articles by clicking Speak->All
Articles Aloud or clicking the <Speak>
toolbar button. TextAloud will switch to
the first article and begin speaking.
After an article has completed, it will optionally be deleted
automatically (based on settings under Options->TextAloud
Options->Article Options) and then speaking will continue with the next
article in your article list, until all articles have been spoken.
If Word Highlighting is turned on, as you hear a word
spoken, it will be highlighted within the text area. Word highlighting can be enabled/disabled via
Options->TextAloud Options->Miscellaneous. You’ll also notice status updates shown on
the TextAloud Status Bar showing information about voice initialization,
progress through the article lists as well as progress indicating % completion
for the current article.
In addition to reading full article text from the
beginning, additional options are available:
·
Speak->Speak
From Cursor Aloud or right-click in the text area and choose Speak from Cursor Aloud to start
speaking at the current cursor location instead of from the start. So if you want to skip some portion of text,
click to the spot where you want to start speaking, and then choose Speak From
Cursor Aloud
·
You may also speak only a portion of an
article. Highlight a portion of text
within the current article, then choose Speak->Speak
Selected Text Aloud or right-click and choose Speak Selected Text Aloud.
While TextAloud is actively speaking aloud, there are
several actions available:
·
Stop
– To stop speaking click
o Speak->Stop on the main menu.
o The
<Stop> toolbar icon.
o The
<Stop> icon on the floating
toolbar.
o Choosing
Stop on the menu displayed by
right-clicking on the TextAloud System Tray Icon.
·
Pause/Resume
– To pause speaking and keep cursor in place to resume at the same spot click
o Speak->Pause on the main menu.
o The
<Pause> toolbar icon.
o The
<Pause> icon on the floating
toolbar.
o Choosing
Pause on the menu displayed by
right-clicking on the TextAloud System Tray Icon.
NOTE: Once speaking has been paused, the Pause items above will
change to Resume. Click any of those to resume speaking.
·
Skip to
Next Article – While in Multi Article Mode and Speaking All Articles, you can skip
the current article and start speaking the next article by choosing
o Speak->Skip to Next Article on the
main menu.
o The
<Skip> toolbar icon.
o The
<Skip> icon on the floating
toolbar.
o Choosing
Skip on the menu displayed by
right-clicking on the TextAloud System Tray Icon.
NOTE: If Automatically Delete is
turned on under Options->TextAloud
Options->Article Options, when you skip an article it will automatically
be deleted.
·
Skip to
Previous Article – While in Multi Article Mode and Speaking All Article, if Automatic
Delete is not on, you may skip to next article with the commands above, and you
may skip back to a previous article using
Speak->Skip To Previous
Article.
NOTE: Skip to Next and
Previous can also be used when TextAloud is not speaking to simply switch
between multiple articles when in Multi Article Mode.
·
Skipping
Forward and Back Within a single article – While text is speaking, there
are several menu options under Speak->Skip
Forward and Speak->Skip Back
allowing you to skip forward and back by line, paragraph, or a set number of
lines. These menu items by default do
not have shortcut keys defined, but using Options->Shortcut Setup you can define shortcut keys to
allow you to easily skip back and forth as desired during speaking.
Text To Speech is an imperfect science. Additionally, because each listener has
different listening skills and preferences, judging the quality of speech is a
very personal and subjective process.
TextAloud provides dozens of features and tools to help you customize
the speech in order to customize and improve speech.
As discussed in the About
Text To Speech Voices section, TextAloud supports a wide variety of
voices. Review that section and
mentioned voices to insure the voices you have on your computers are the voices
that sound best to you.
TextAloud provides flexible options for selecting
voices. Some users find a single voice
they enjoy most and want each article to be spoken by that voice. Some users find that having each article
spoken with a different voice makes the process less monotonous, much like when
listening to a news cast, anchors take turns reading stories.
The first step in setting up your voice preferences is
reviewing available voices. By going to Options->Voice and File Options
you’ll see all available voices.

SAPI Version NOTE: As discussed in other
sections of this document, some features of TextAloud are available only with
SAPI5 Voices,
while some voices are only available under SAPI4. The SAPI Version dropdown at the top of this
window selects whether you use SAPI4, SAPI5, or Both types of voices. Normally this selection should be set to Both
unless you want to use functions specified for SAPI 5 Only.
From this Engines/Voices window, you will see folders
listed for each voice company. By expanding
the folder, as the AT&T folder is expanded above, you’ll see each available
voice. By clicking a voice, you will
have options to:
·
Disable/Enable
– Enabling a voice makes it available for use within TextAloud.
·
Set
Default – Sets a voice as the default voice. If your voice selection settings under
Article Options is set for “Default”, this voice will always be used. Additionally, the Default voice is always
used for the Speak HotKey detailed in the HotKey Reference section.
·
Test
– plays a test message using the voice.
·
Reset
Defaults – Resets Pitch, Speed, and Volume sliders to their default values
for a given voice.
·
Pitch –
Adjusts the pitch of the
voice (note that AT&T Natural Voices do not support pitch adjustments).
·
Speed
– Adjusts the speed at which
the voice speaks.
·
Volume
– Adjusts the volume of this voice.
Note that this volume is different than the Windows mixer volume slider
available on the TextAloud main window.
This volume setting also affects volume of the voice when creating audio
files.
Note that you may also edit the Pitch, Speed, and Volume for
a voice by right-clicking in the text area of TextAloud, and choosing Edit
Voice.
The main use of this window is to enable only those voices
you wish to use, disabling all other voices.
When TextAloud creates a new article, a voice is
automatically assigned to that article.
This is the voice shown on the Voice Dropdown when the article is
selected in the main TextAloud window and is the voice that will be used to
speak the article unless it is manually changed via that Voice Dropdown.
Via Article Options at Options->TextAloud Options->Article Options you can set how
you want voices to be selected and assigned to new articles. The options here are:
·
Default
– The voice you’ve selected as the Default voice will be automatically assigned
to new articles.
·
Random –
A random voice from the pool of enabled voices will be selected and assigned to
new articles.
·
Round
Robin – Voices will be selected and assigned to new articles
sequentially. For example, if you had
Mary, Crystal, and Kate enabled, and you imported 6 new articles, voices might
be assigned in this order: Mary, Crystal, Kate, Mary, Crystal, Kate …
As noted, these settings are for the voice assignment when
articles are created. You can change the
actual voice to be used at any time manually using the Voice Dropdown.
(NOTE:
discussions related to Pitch do not apply to AT&T Natural Voices, which do not support Pitch
adjustments).
TextAloud allows you to adjust the speed and pitch of speaking.
On the main TextAloud window, sliders are provided to adjust both speed
and pitch.
Move the pitch indicator to the right to increase pitch (make
the voice sound higher) or to the left to decrease pitch (make the voice sound
lower). With some voices enough
adjustment may be available to even change the gender sound of the voice, or
raise it enough to make a voice sound like a child. The amount of adjustment supported varies by
each voice so experiment with these.
Speed can also be increased or decreased. When first starting to use TextAloud, some
users find the voices more easily understood by slowing down the rate of speech
slightly. Then, as you grow accustomed
to the voices, you can increase the speed to absorb information more quickly.
These adjustments may be made while speaking is taking
place. You’ll likely notice a small
pause after adjustments, while TextAloud readies the voices for the new
settings.
TextAloud automatically saves speed and pitch settings for each voice in the Windows
registry. This means that when using a
particular voice if you change these settings, these new settings are saved and
will be in effect again the next time the voice is used.
There are two separate volume adjustments available within TextAloud. Each voice has a volume setting. This volume affects the loudness of the voice
whether used aloud or to audio file, and can be thought of somewhat as a
relative volume setting, where you are specifying a volume relative to other
voice volumes. This is the volume
setting discussed, in the Reviewing
Available Voices section.
Additionally, when speaking aloud, the volume level you will hear is affected by:
·
Volume of the volume adjustment on your speakers
·
Master Volume setting on the Windows mixer
·
WAVE Volume setting on the Windows mixer
The Windows Mixer is the volume control for Windows. You can open this dialog typically by
double-clicking the speaker icon in the Windows system tray, near the clock on the start
bar. If you have problems with volume
being too low, or hearing no sound at all, be sure to turn up these volumes and
make sure none of the Mute checkboxes are checked.
On the main TextAloud window, there is a volume control.
This corresponds to the WAVE Volume setting on the Windows mixer, and
gives you some immediate control over the system volume.
Each voice engine has its own characteristics on how long
it pauses for certain types of punctuations, between
sentences, and between paragraphs. There
are times when you would like to have a slightly longer pause at certain spots
in playback of audio.
Via Options->TextAloud
Options->Predefined Pauses, you can specify longer than normal pauses for end of sentences or end of
paragraphs. Entering values of 0 seconds
uses the default values for the voice being used. By putting a other values here, you can
extend the amount of pause. For example,
putting 1.5 into the Sentence Pause Interval causes TextAloud to pause an additional 1.5 seconds at the end of
each sentence.
You may also manually insert pauses at any spot within text. By right-clicking at a spot in the text, a popup menu will be displayed. From there, choose Insert Pause, and you’ll
see several pre-defined pause intervals to choose from. Choosing an interval will insert a Pause TAG. For example, if you choose 0.5 Seconds, the
Pause tag that will be inserted is {{Pause=0.5}}
While speaking text, when TextAloud encounters this pause
tag while speaking, there will be ½ second of silence at that spot. You may also manually insert pause TAGs of
any value by inserting the text {{Pause=x.y}} where x.y is the amount of time
in seconds to pause.
Speech engines are designed to work primarily with
standard text. When used with some types
of text, such as email and other documents that may have symbols, non-standard punctuation,
or other characters not typically meant to be spoken, some speech engines will
speak these characters. This can often
be very distracting and make the real text of interest difficult to listen
to.
Under Options->TextAloud
Options->Miscellaneous there is a checkbox for Filter non-alpha symbols. With this checked, some characters that
aren’t typically part of speech will be filtered out by TextAloud to prevent
them from being spoken. The characters
typically filtered out by this setting are:
[ ] ( ) | {
} ^ * _ ` ~ :
along with small and large bullet characters.
Each voice has its own characteristics and quirks. The newer voices pronounce most words
correctly, but no voice gets everything perfect. TextAloud includes two types of pronunciation
editors available for you to use to correct any words that you find are often
mispronounced.
Found via Options->Pronunciation Editor, TextAloud’s basic pronunciation
editor allows you to correct most types of pronunciation mistakes. The basic editor creates entries that apply
to all voices, and works on a “sounds like” spelling of words. This means the basic editor does not specify
actual phonemes (as discussed in the How
Text To Speech Voices Work section).
This “sounds like” spelling of words is often much simpler to use than
other types of pronunciation editing.
The basic edits work by replacing a word in the text with
its defined Pronunciation Spelling.
For example, some voice engines attempt to convert abbreviations to
their full words. In a case like this,
the word “Jan” in a sentence might refer to a girl named Jan, but the speech
engine may pronounce it “January”. To
disable this behavior, you create a new entry for the word Jan and have it
pronounced as “Jann”. This substitution
tricks the voice engine into saying the word as you’d like it pronounced rather
than trying to expand an abbreviation.
Another example is the term “NextUp.com”. Some voices are unable to speak this
correctly, seeing NextUp as one work and running it together, seeing the period
as the end of a sentence, and unsure what to do with “com”. Using the basic editor we can define a better
pronunciation by defining “NextUp.com” to be spoken as
“next up dot com”.
The Basic Editor is shown below:

·
Defined
Words List – On the left side of the window, the list of words you have
defined in the editor is a two column list with column headers of Word and
Pronounced As. To Edit an existing word
definition, click the entry, make changes in the Pronunciation field, then click Apply.
·
Word
– The word you are correcting.
·
Pronunciation –
The new way you’d like the word to be pronounced. This basically is what is substituted for Word when it is sent to the speech
engine.
·
Case
Sensitive – Check this box if Word
is case sensitive. For example if you
want a substitution to happen for “OK” but not for “ok”, then Word would be
“OK” and case sensitive would be checked.
·
Test
Voice – This selects the voice to be used when you click the Test
Button. The Text button will speak the
new pronunciation of the word. NOTE: Because
voice engines sometimes choose pronunciation based on the context within a
sentence, there are cases where a word will sound fine within this test, but be
pronounced differently when read with full sentences.
·
New –
Click this to start defining a new pronunciation edit. This basically clears the fields above.
·
Apply –
Save the pronunciation edit you have defined above.
·
Delete –
Delete the currently selected pronunciation edit from your list of defined
words.
·
Done –
Close this window.
The most common method for identifying mispronunciations
you’d like to correct is hearing them while listening to an article. When listening to text and you hear a word
you’d like to correct, simply click stop, highlight the mispronounced word,
then right-click and choose Pronunciation Editor from the popup menu. This will open the basic editor and preload
the mispronounced word into the Word field to speed definition of the new edit.
By default, entries in the Basic Pronunciation Editor
are applied only when text in the "Word" field actually occurs on a
word boundary. Substitutions are not performed on text that appears as a
substring of another word.
For example, entering an
alternative pronunciation for the word "day" changes the
pronunciation of "day" everywhere it appears as a word, but not in
words like "today", "yesterday", etc.
To change the
substitution of words that appear as substrings, use the & character.
The placement of the & determines whether leading characters,
trailing characters, or both should allow substitutions.
In the table below, a
"Yes" in the column indicates that a Basic Pronunciation Editor
substitution will be performed for the word.
|
|
Word Field of Pronunciation Editor |
|||
|
Word In Text |
board |
&board |
board& |
&board& |
|
board |
Yes |
Yes |
Yes |
Yes |
|
snowboard |
|
Yes |
|
Yes |
|
boarding |
|
|
Yes |
Yes |
|
snowboarding |
|
|
|
Yes |
Masks and Regular Expressions allow you to perform more complex
filtering and pronunciation changes from the Basic Pronunciation Editor. Instead of specifying
pronunciations for individual words or substrings, you can specify patterns to
search for and build pronunciations based on sub expressions in the matching
strings.
If you're not familiar with the
term, Regular Expressions are a pattern matching language
used for searching and parsing text. TextAloud help will not get into the
detailed syntax of using Regular Expressions, so if you've never seen them
before you will need to look at some reference material. There are plenty
of references available on the web, you can get started by visiting
http://www.regular-expressions.info/
Although not as powerful as Regular Expressions, Masks provide similar capability and are easier to
learn and use.
A Mask or Regular Expression is added to the Basic
Pronunciation Editor by placing one of the following into
the Word field:
{{re=.... regular expression ...}}
{{mask=.... mask ...}}
A string in the Word field beginning with {{re= or {{mask=
indicates the beginning of a regular expression or mask. The trailing }}
characters are required to terminate the string.
The following characters may be used to construct a Mask:
|
Mask Character |
Description
|
|
# |
Matches
any digit (0 thru 9). |
|
$ |
Matches
any alpha character (a thru z and A thru Z). |
|
@ |
Matches
any alpha-numeric character (a thru z, A thru Z, and 0 thru 9). |
|
? |
Matches
any character (wildcard). |
|
_ (underscore) |
Matches
any word boundary character (spaces and punctuation). |
In the Pronunciation field, supply the text that should be
substituted for any text that matches the Mask or Regular Expression. Use
sets of parentheses in the Mask / Regular Expression to indicate sub
expressions that should be included in the Pronunciation field. $1 in the
pronunciation field references the first parentheses set, $2 the second set,
and so on. This allows you to change the pronunciations of text patterns,
where the substituted text contains pieces of the strings that match the
patterns.
The following examples will illustrate how to use a Mask or
Regular Expressions to change the pronunciation of year numbers.
The sample problem is to ensure that year numbers (like 1987) are
pronounced as "nineteen eighty seven" and not "one thousand nine
hundred and eighty seven".
Example 1 - Year Number Pronunciation Using a Mask
To solve this problem using a mask, enter the following text
into the Word and Pronunciation fields:
Word:
{{mask=_19(##)(_)}}
Pronunciation: <s>nineteen $1$2
The pattern specified by the mask is a leading word
separator, followed by the string "19", two digits and another word
separator. Whenever this pattern is detected in the text, the text in the
Pronunciation field will replace it. The <s> in
the pronunciation indicates to insert a space. This is necessary because
the first character matched by the mask is a word separator, so a leading space
should be included in the substitution. Following this space, the string
"nineteen " is added. The $1 and $2 strings in the
Pronunciation reference parentheses sets in the mask: $1 is replaced by the
last two digits of the year, and $2 is replaced with any trailing punctuation.
Example 2 - Year Number Pronunciation Using a Regular Expression
To solve the same problem using a Regular Expression, enter
the following text into the Word and Pronunciation fields:
Word:
{{re=\b19(\d\d)(\b)}}
Pronunciation: <s>nineteen $1$2
The use of a regular expression is very similar to a mask in
this case. The "\b" strings in the Regular Expression match on
word separators, and the "\d" strings match on the digits 0 thru 9.
As discussed in the How
Text To Speech Voices Work section, Phonemes can be used to define how a word is
spoken. When used only with some SAPI5 Voices that support it, TextAloud offers an
Advanced Phoneme Editor to give you greater control over how some words are
spoken.
Phonemes are defined as:
|
Definition |
|
|
|
A phoneme is a member
of the set of the smallest units of speech that serve to distinguish one
utterance from another in a language or dialect. |
|
Examples |
|
|
|
A unit of speech is
considered a phoneme if replacing it in a word results in a change of
meaning. Here are some examples of phonemes: |
|
|
|
To enable the advanced phoneme editor, you must switch
TextAloud to use SAPI5 Only. This is done via Options->Voice and File Options->Engines/Voices by setting
SAPI Version to
SAPI5 Only. After doing this, when you choose
Options->Pronunciation Editor, there will be the Basic Editor as
previously discussed, but there will also be an additional TAB available labeled
Advanced. Clicking this Advanced TAB
will display the Advanced Pronunciation Editor as shown below:

As with the Basic editor, the list on the left side shows
all words you have defined in the advanced editor. Select a word there to load it into the Word
and pronunciation fields to edit.
The significant difference between the basic editor and
this advanced editor is what is allowed in the pronunciation fields. Only defined Phonemes from the table in the middle of the screen can
be used. As you define the pronunciation
of a word, you scroll through the list of available phonemes, then double-click
the phoneme to have it appended to the pronunciation. When done, click Apply, then done.
Note that some advanced features, such as Primary Stress
and Secondary Stress are provided to adjust emphasis on syllables, but not all
voices will obey these instructions.
Edits defined in the advanced editor only apply when using
SAPI5 voices.
Note on Neospeech Voices: Neospeech voices currently do
not support phoneme edits.
TextAloud 2.0 has the ability to switch voices within the
middle of an article. This is another of
the features that is only available when TextAloud is set to use SAPI5 Voices Only.
After this is done, you can right-click at any spot within the text and
choose Insert Voice Change (or
choose Edit->Insert Voice Change)
and TextAloud will insert a special voice change TAG. This tag will look similar to
<voice required="name = Crystal16">
When speaking text and it gets to this TAG, TextAloud will
switch to the
One of the most powerful and popular features of TextAloud
is its ability to create audio files from text.
These spoken word audio files give you greater flexibility in listening
to text, by allowing you to use the audio elsewhere. Some of the possible uses include:
·
Sending audio files via email to friends or co-workers.
·
Burning on Audio CD to listen on your car CD player during your
commute.
·
Listening while exercising using portable MP3 player or iPod
·
Listen to converted text on PDA’s such as
PocketPCs.
·
Creating audio files for use in presentations, such as PowerPoint
files.
·
Creating audio versions of web pages for
sight-impaired users.