[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Search]

Aural CSS Settings Explained was Re: [emacspeak The Complete Audio Desktop] Emacspeak And Voice Locking Using Aural CSS

To: rdc1x@xxxxxxxxxxx
Subject: Aural CSS Settings Explained was Re: [emacspeak The Complete Audio Desktop] Emacspeak And Voice Locking Using Aural CSS
From: "T. V. Raman" <raman@xxxxxxxxxxx>
Date: Tue, 21 Feb 2006 06:19:38 -0800
Delivered-To: priestdo@xxxxxxxxxxx
Delivered-To: emacspeak@xxxxxxxxxxx
In-Reply-To: <874q2txuxo.fsf@xxxxxxxxxxx>
List-Help: <mailto:emacspeak-request@xxxxxxxxxxx?subject=help>
List-Post: <mailto:emacspeak@xxxxxxxxxxx>
List-Subscribe: <mailto:emacspeak-request@xxxxxxxxxxx?subject=subscribe>
List-Unsubscribe: <mailto:emacspeak-request@xxxxxxxxxxx?subject=unsubscribe>
Old-Return-Path: <tvraman@xxxxxxxxxxx>
Reply-To: raman@xxxxxxxxxxx
Resent-Date: Tue, 21 Feb 2006 09:19:39 -0500 (EST)
Resent-From: emacspeak@xxxxxxxxxxx
Resent-Message-ID: <nZ2dkB.A.mdE.7Fy-DB@xxxxxxxxxxx>
Resent-Sender: emacspeak-request@xxxxxxxxxxx

Aural CSS Settings Explained:

Here is how the four dimensions average-pitch, pitch-range,
stress and richness work (or are supposed to work)

First a bit about voices:

A speaking voice has a default pitch --- fundamental frequency
---
and this changes over the course of a sentence due to
inflection. Speakers also have the ability to "project" their
voice, or alternatively pitch it lower --- this is similar to
volume but not quite the same.

The ACSS Dimensions:

average-pitch: Basic voice pitch.
               In practice, speakers with smaller heads have
               higher pitched voices, so on formant TTS engines,
               you need to vary the head-size inversely with  the
               fundamental frequency -- see dectalk-voices.el and
               outloud-voices.el --- these are both formant
               engines.

Pitch-range: Determines "how excited" the speaker sounds.
If you look at the overall intonation contour, pitch-range
determines how high the peaks get and how deep   the valleys get.

Stress: This is indeed subtle.
Basically pitch-range is the overal intonation contour; stress
controls the individual peaks such as primary and secondary
stress. Just increasing pitch-range ends up with a very sing-song
effect; stress and pitch-range together often do better.

Richness: This is the "project your voice" setting. Its inverse
is "smoothness" which is why overlays like voice-smoothen set
richness to be low. The perceived effect is that the voice is
softer,  with higher values of richness, the voice gets
"brighter". If you look at the  spectogram, the "saw-tooth"
patterns you see are much sharper for higher richness values.

-- 
Best Regards,
--raman

      
Email:  raman@xxxxxxxxxxx
WWW:    http://emacspeak.sf.net/raman/
AIM:    emacspeak       GTalk: tv.raman.tv@xxxxxxxxxxx
PGP:    http://emacspeak.sf.net/raman/raman-almaden.asc
Google: tv+raman 

-----------------------------------------------------------------------------
To unsubscribe from the emacspeak list or change your address on the
emacspeak list send mail to "emacspeak-request@xxxxxxxxxxx" with a
subject of "unsubscribe" or "help"

References:
- [emacspeak The Complete Audio Desktop] Emacspeak And Voice Locking Using Aural CSS
  - From: "T. V. Raman" <noreply-comment@xxxxxxxxxxx>
- Re: [emacspeak The Complete Audio Desktop] Emacspeak And Voice Locking Using Aural CSS
  - From: "Robert D. Crawford" <rdc1x@xxxxxxxxxxx>

Prev by Date: Re: [emacspeak The Complete Audio Desktop] Emacspeak And Voice Locking Using Aural CSS
Next by Date: [emacspeak The Complete Audio Desktop] Emacspeak, SuDoKu And History
Prev by thread: Re: [emacspeak The Complete Audio Desktop] Emacspeak And Voice Locking Using Aural CSS
Next by thread: [emacspeak The Complete Audio Desktop] Emacspeak, SuDoKu And History
Index(es):
- Date
- Thread

Emacspeak Files | Subscribe | Unsubscribe | Search