|
A few comments
about this page:
With the continued interest in digital voice
communications in amateur radio, we decided to run a few tests using
D-Star radios to ascertain the behavior of the codec when subject to
sounds other than those of the human voice.
We felt it important to be able to understand how the voice of a communicator - either via an analog or digital transmission - might be affected in conditions that were less-than-ideal: Particularly in light of recent re-emphasis on the facility of using amateur radio in emergency communications systems, we wanted to provide some characterization of how they might behave in "less-than-ideal" situations - such as those where the speaker's voice may be in severe competition with other sounds. It should be noted that the intent of this analysis was to provide a reference for those who might consider the implementation of a digital radio system: It should come as no surprise that the digital voice system used in D-Star is somewhat more "fragile" than old-fashioned analog system. This not-unexpected result is a logical consequence of the "lossy" coding typical of low-rate speech-only codecs - the one used in D-Star being one of the better-performing codecs in this class. As can be seen from this page, the codec was subject to sounds that it was NOT intended to be able to handle (such as music) in order to observe how it would break down and provide some insight to how it responded when presented with other, more-realistic situations. In some ways, the results were rather surprising: Some listeners were, frankly, appalled at the results, but the opinion of many was along the lines of "That's better than I thought it would be..." To be sure, being able to make sense out of a degraded transmission - either digital or analog - is a skill acquired through experience, practice, and training. It was noted that so-called "skilled" operators (e.g. those that regularly work pileups on HF and are rather used to picking fragments of voice out of chaos) were generally able to make out the gist of what was being said on both the analog and digital transmissions, but that the analog transmissions were noticeably more "copyable." It was also noted that the un-skilled listener (a random person, unaccustomed to having to dig "speech fragments" out of such chaos or even a casual user of FM voice) had noticeably more difficulty deciphering the degraded digital speech than the analog. Again, such a result was not surprising, once again showing that experience and practice are of paramount importance in any critical traffic-handling situation. Brief mention has also been made on this page with reference to the degradation of signals due to the "digital cliff" - that is, the rather un-graceful drop-off in perceived quality that occurs when digital signals degrade below a certain point. Again, recognizing and knowing how to deal with these sorts of situations is another important facet of training and experience. Finally, links have been provided to observations made by various public-service entities related to digital (some of which use somewhat more sophisticated codecs at higher bit rates which are arguably more-resistant to the demonstrated degradations) and analog (both trunked and un-trunked) systems and how they perform in a number of environments. While these observations may not always be directly applicable to many amateur-radio situations (e.g. trunking versus non-trunked systems, and the ability of the experienced amateur to arbitrarily choose a frequency, mode and signal path as necessary) they are well-worth a read by any would-be system designer and emergency planner! While I'm sure that at least some of this has already been hashed and re-hashed, revisiting this and related topics is likely to be worthwhile. |
This page is not intended to "bash" D-Star or its codec, but rather to educate the user about peculiarities intrinsic to the codec used by D-Star.Some of the observations made on this page may not apply solely to D-Star, but to other digital voice codec/systems as well.
Analysis:
The "FM" (right channel) audio is provided mostly as a basis of
comparison: Aside from reduced frequency response and noted loss
of
fidelity - as would be expected from a "voice" channel - the original
program material sounds pretty much like the original:
In each case, there is the understandable difficulty in understanding
the voice when it is at roughly the same amplitude as the crowd noise,
but the trained ear can dig out most of the words.
The D-Star (left channel) audio, unlike the situation in which there
are
just two speakers competing for "codec time," the codec has trouble
when faced
with a (more-or-less) constant background noise - particularly in light
of the fact that this noise (the crowd) consists of many voices
overlaying each other. With this constant noise, the codec has
trouble finding "holes" in which clearly-audible syllables of the
speaker can be heard and the voice becomes largely
unintelligible. Note: Keep in mind that upon repeated
playbacks, one will likely become more-familiar with the text being
read and be able to understand more of what is being said that would be
the case for a first-time listener despite adverse conditions.
Through careful scrutiny, the clip also demonstrates another
property: When the speaker's voice is competing with that of the
crowd, one hears the speaker's voice only when its peak level exceeds
that of the noise of the crowd, effectively "capturing" the codec.
The lessons here should be obvious, no matter what sort of system (analog or digital) you are using: Do your best to make sure that your voice is the dominant one! Soft-spoken, mic-shy users may find themselves competing unfavorably with the crowd noise: If the sound-level of their voice is too quiet, the background sounds may simply override and "capture" the codec, removing too many traces of the speaker's voice to be audible.
There is another "gotcha", however: The D-Star codec includes
an
audio compander - that is, a built-in device that equalizes the audio
level such that either too high or too low audio gets attenuated or
amplified to maintain an overall, constant level. While this
feature is generally useful, it can be of detriment when other
noises are present. During pauses between words and syllables it
will happily increase the gain, bringing up the background and,
possibly, causing the sort of "codec confusion" that has been
demonstrated here.
|
What about "Narrow" FM?
Another mode available on D-Star capable
radios is "FM-N" or "Narrow"
FM. This is the same as "normal" FM used by amateurs on the VHF
and UHF bands for decades, except that the peak deviation is limited to
+-2.5 kHz instead of +-5 kHz.
The advantage of narrower deviation is that somewhat less bandwidth is required for a "Narrow" FM voice channel than a "Normal" FM voice channel - but it would be wrong to assume that it was half as wide! Why? A quick consultation of "Carson's Rule" provides the answer: While the deviation itself is half as wide, one still needs to modulate the same audio bandwidth as before. What this means is that, according to Carson's Rule, if your voice bandwidth extended to 2.5 kHz, you'd still need at least 5 kHz of bandwidth even if your deviation was set to zero! In practice, one could never set the deviation to zero: As one decreases deviation, the "FM Advantage" (that is, the "quieting" effect of the FM system) begins to diminish with narrower deviation. In fact, the use of "Narrow" FM suffers from this to an extent as compared to "Normal" FM, but the effect is not readily obvious to the causal observer. It turns out that using half the deviation in "Narrow" FM allows the system designer to tighten channel spacing from 20 kHz to closer to 15 kHz for geographically-adjacent systems, and down to 12.5 kHz (instead of 15 kHz for "normal" FM) for those systems with adequate geographical separation. It should go without saying that these benefits would not be possible unless all receivers used on these systems also incorporated correspondingly narrower filtering in their IF's: It is not enough to simply reduce transmit deviation! Also remember that Carson's Rule is an approximation: It cannot take into account all situations, such as the instantaneous wider bandwidth of an FM transmission that can occur during transient spectral peaks that may happen during modulation. In tests done by commercial and public-service entities, it seems that "Narrow" and "Normal" FM are pretty-much equal in their coverage and audio quality when properly implemented. It should also be noted that in many cases, "Narrow" FM radio systems include amplitude-companding techniques - using a compressor at the transmitter and a matching "de-compressor" at the receiver to maximize the signal/noise ratio across the link and to minimize effects of the S/N loss associated with the narrower deviation. It is important to note that not all implementations of "Narrow" FM, particularly those used in amateur radio, include the use of a "compander" system! |
This page is not intended to "bash" D-Star or its codec, but rather to educate the user about peculiarities intrinsic to the codec used by D-Star.It is worth reiterating that both experience and proper training can allow operators to recognize and remedy those situations in which communications may be compromised by extraneous noises, regardless of the medium over which the voice is conveyed!
Other Utah VHF Society links related to D-Star:
Misc. links related to D-Star:
This matter is open for discussion: If you have concerns or opinions one way or another, please make them known to the frequency coordinator at the email address below.
Questions, updates, or comments pertaining to this web page may
be
directed to the frequency
coordinator.
Return to the Utah VHF Society home page.
Updated 20081229