My Audio Setup

My Audio Setup

To start off, let me say that I was a touring soundman and a front of house mixing engineer in clubs full time for a number of years. The only reason I play with the equipment I do is simply because I enjoy the gear and tinkering with processing. 

There is absolutely no reason any ham should go down this rabbit hole unless they find it enjoyable. I've seen many people get frustrated in this area and spend money that they really should not have spent in the endeavor.

That said, I'll start with some fundamentals that any ham can follow to improve the quality of their audio and the cleanliness of the signal they are transmitting.

What is my secret?

There is no secret. What you hear is a combination of the hardware and equipment for sure, but this is very important, it is also the result of my experience in using all of it, the endless combinations possible and having spent hours playing with it...even with the experience I have.

Mic choice, use of processing and EQ and how to apply all of that simply takes time and experimentation. Wedging that result into a narrow 100Hz to 3kHz pass band for SSB just complicates things even more. The biggest improvement in my signal is the proper application of compression, downward expander (noise gate) and EQ, in that order. This is why my audio character stands out from a stock microphone.

This will take time and patience to learn.

The equipment I have kicking around

The very basics for most hams

The most common mistake people make is not properly adjusting the mic gain level on their rigs. Typically, people have it adjusted far too hot trying to squeeze every last watt from their 100W rig. This comes from a fundamental misunderstanding of how SSB works as a mode.

SSB mode, considering most rigs have average power meters will only indicate about 30W or so while speaking if the mic gain is properly configured. The mistake people make is trying to see 100W output on an average reading (as opposed to a peak reading) meter and they proceed to max the mic gain and processor drive levels to do it. 

The result of this is horrible sounding audio and a very dirty signal that will interfere with other users on adjacent frequencies, not to mention that it stresses the RF deck in the radios.

Step by step setup

If you made it this far you should be in very good shape for basic operation and sound pretty good. The ham radio world would be a better place if everyone just took a few minutes to understand this setup process.

A further comment on ALC. Some rigs will start to splatter with even a very small amount of ALC being indicated. I suspect poor quality control and improper bias voltages on pre-drivers and final transistors is to blame. Even if you follow the instructions in the manual, you can still have some IMD issues. The FT-991 series from Yaesu is an example of a rig that gets really dirty fast if you are hitting the ALC hard. The Icom 7300 in contrast, you have to be just smashing your ALC to have significant issues with IMD. Icom did a great job with the 7300.

While monitoring on a second receiver you will notice that each rig has a point with the mic gain where you can't hear yourself getting any louder as you increase the mic gain. You should stop increasing mic gain right about where you hear this threshold cross. This generally will prevent IMD issues, regardless of what your rig's ALC meter is telling you. On my 590 as an example, mic gain at 35 (with my external audio chain driving) is where this threshold exists. I can turn it to 99 without gaining any improvement over the air but it drastically increases IMD and splatter up and down the band from the actual transmitted audio.

Further complications...and improvements

Rigs started to have adjustable TX bandwidth back in the 90s or so. It was mostly the top end rigs that began to add the feature and now, even the entry level rigs have menu adjustments to set this. TX Bandwidth and also in turn RX Bandwidth controls determine the range of audio frequencies that you will transmit or receive.  

This is the overall width of your SSB signal on the spectrum and can make a huge difference in the characteristic sound that you transmit or hear while operating.

You will have to consult your manual to figure out if this is a feature your radio has and how to configure it.

I will use my Kenwood TS-590SG as an example. In my menus I have a high pass (this is the lowest speech frequency it will transmit before it is filtered out) and a low pass setting (this is the highest frequency of speech that it will transmit). I can set the high pass as low as 10Hz which will pass through a lot of rumbly bass output and as high as 3kHz which has the definition or "shine" on the sound of speech.  

I recommend that you set your high pass to 100Hz to avoid puffing sounds and wasted energy in your signal. Too much bass will make you sound muddy, waste transmit power and make you harder to understand. It is another layer of complexity to try and EQ and process the rumbly bass below 100Hz.

Properly managed, a low bottom end gives energy and a presence to your audio but you typically only want to spend transmit power there in strong signal and quiet band conditions and it's one of the trickiest things to get right. If you get it wrong you sound like you are talking through a pillow.

On the low pass (high frequency limit), in typical conditions you want this to be about 3kHz. In crowded bands like 20m you may want to turn it back to 2.8, 2.6 or even 2.4 to help punch your signal through. There are rigs that can do 4kHz and beyond, but the wider you are, the more distance you need from any adjacent conversations or you will interfere with them.  The higher frequencies add articulation, clarity and a more natural and open sound quality.

Unfortunately a lot of people are running 4 and even 5kHz wide while only being 3kHz away from other people having a conversation. They are actively interfering with the communications of others by overlapping their transmissions in the passband of the other users. If you take one thing away from this, please understand this concept and have some courtesy for other operators on the bands when you choose your operating parameters.

Wider sounds more natural when properly balanced, but it's also less efficient so it requires more power to get the same level of effectiveness from your signal. Go narrower to leave more room and to make a punchier and more effective signal if you have low power for the noise level of the propagation or band conditions.

Some rigs have very advanced equalizer capabilities now as well. Some have basic bass and treble controls that are fixed frequency (IC-7300 as an example), some have graphic EQs that you use software on a PC to configure, like the ARCP 590G software Kenwood provides and some have 11 band graphic EQs built into the rig itself. There are even rigs like several from Yaesu (991 as an example) that have parametric EQs in them.

To sum this section up

My favorite system to monitor myself

I put the radio I'm setting up on a multi-port switch.

I set the output of the switch to my dummy load.

I put the receiver I'm monitoring myself with on one of the grounded switch ports. This will give a good S9+20 signal to monitor with. My preference is an SDR Play using SDRConsole with the RX EQ disabled. SDRConsole allows for very easy recording as well.

Always take care that the receiver you are using has a wide enough, unconstrained frequency filter to hear the full signal you are testing. Disable all RX EQ and any noise reduction or other filters on the receiver you are using to test. You may need to experiment with the AGC on the receiver as well to be sure you are hearing things as they are.

Some audio file samples

A full range recording before any filtering is applied 

A 100-3000Hz bandpass filter, typical for SSB

300-2400 band-pass with heavy compression for DX 

You should notice the difference in low frequency and high frequency response. These files are not recorded after transmit, they are recorded on my PC. In transmit, through the radio, applying these filters will have a more audible and obvious effect.

Misc issues, other things to know

As you make your station more complex, you are without a doubt inviting more issues. Consider the notes here before you start adding complexity. As I've mentioned a bit, this is something that should be fun or interesting to pursue or it will result in a lot of possible frustrations.

I recently had the specter of RF in the shack pop up. In my case it was just a single band, either 40m with one antenna configuration or 160m with another. There are a number of causes for this, antenna imbalances (my likely cause using balanced feeder), equipment issues, grounding issues, improperly seated connectors, common mode currents, running power with the antenna too close to the shack (sitting in the RF field) as examples.

My fix in this case was to locate where the RF was getting in. In my case I changed microphones and the issue was exaggerated so it occurred to me to swap in other mics to narrow this down. After I realized it was one of my microphones being more sensitive to 160m RF, I decided to apply a few things to mitigate the issue.

First I picked up an assortment of FairRite snap on ferrites from Mouser. I picked Mix 31 compound to be most effective with 160m, some quite large snap on beads and some smaller. This way I could isolate my XLR audio cable right at the microphone and also the smaller ones for my headphones and other audio wires. I put 5 turns of the mic cable through the large toroid right at the back of the mic. I also added a Jensen ISO Max MI-XX isolation transformer placed right at the input of my Symetrix 528e processor on the mic input line. This damped the RFI enough to solve my issue and should serve me well as I change my configurations around. 

When troubleshooting RF in the shack issues, start by disconnecting everything you can, just a rig, a hand mic and an antenna/tuner.  This eliminates more possible entry points. If this works, start to add back your connections, CAT control, PTT links to amplifiers, audio outputs, external microphones, audio chain etc etc.  Check and re-check across multiple bands until you find when the RF issue appears again. Generally whatever you last added is the issue.

Before you go too crazy tearing apart the shack, do a connector inspection first.  Spin and re-seat any RCA, TRS, mic and coax connectors. Be sure they are all seated and making clean contact. Often a bit of corrosion from sitting or things getting tugged on will loosen things and this can easily introduce RFI issues.

You can address this by mitigating and filtering as I did or by correcting the actual cause which is generally an antenna system issue. The latter is preferred as ideally you don't have RF in the shack at all but sometimes it's simpler to use a band-aid fix or maybe you can't relocate your antenna.

Another tip with RF in the shack. If you run powered speakers, headphone amps and other complications, you will at some point get RF into all of that. It makes a big difference how you route your cabling. Visualize from left to right, all your receive audio gear on the far left, the transmitting RF emitting gear in the center and antennas and amplifers on the opposite side. This helps to keep higher power RF away from the receiving audio equipment. 

Keeping RF, audio and power cables somewhat bundled and separated will help you quite a bit. When it's all tangled in a spider's web, you are inviting issues. With all the tinkering I do, I have to take time about three or four times a year to unplug and re-route it all after things get all tangled from plugging and un plugging as I mess around.

More advanced use of processing concepts and gear...to be continued.

I'm running an Apache Labs Anan 7000DLE MKII SDR transceiver as well as a SunSDR2 DX from Expert Electronics. In both cases my audio is routed as mic -> outboard pre-amp processors -> USB sound interface (Motu M2) -> Windows 10 or Ubuntu -> SDR software via the ASIO audio or ALSA (linux) transport layer.

Both of these SDR rigs have a complete suite of processing tools, EQ, gate, compressors, clippers. They have everything you need to get almost any microphone dialed in to the best it can be. The two rigs differ in the tools, but they can achieve similar results in the end.

Using Voicemeeter Potato beta release with the built-in processing

A recent (11/2022) change I've been trying with great results is running the Voicemeeter Potato beta release stand-alone, no DAW involved. They added a six band parametric EQ, a compressor, a gate, a denoiser filter, a nice reverb and delay plugin. The only missing link is a de-esser. I'm very pleased with how well it all works.

The advantage is that it's just one less layer of software audio to glitch.

At this point, I run virtual audio through Voicemeeter for the Anan, the Sun and the TS-590sg as my normal daily routine.

Using Voicemeeter Potato and the Reaper DAW with plugins

So I got bored again and decided to give it another try running a DAW for audio processing.

A DAW or digital audio workstation is studio recording software that runs on a PC or Mac and takes the input typically of a USB soundcard interface (A Motu M2 in my case) and records and alters tracks for mixing music typically.

I use Voicemeeter Potato, which is a digital mixer. The point of Voicemeeter is to take my input devices, the signal from my Motu and mix and route the audio for that device to things like my radios or for use with chat and video conferencing apps. Voicemeeter also serves as a handful of virtual audio cables that allow interconnecting audio between applications. Running digi modes with fldigi as an example, I tie one of the busses in Voicemeeter to the SDR rig I'm running for sound input and output.

Once the audio is now into the PC and routable, the next component is the processing to apply. In my case I'm running the Reaper DAW, there are many others to look at like Audio Mulch, Pro Tools, VST Host etc. Most of the DAW packages include a number of VST or other type "plugins" and these plugins are basically the software equivalent of rack gear.

The complexity and learning curve of getting all of this installed, configured, tweaked and stabilized (relatively) is a bit daunting and it will go wonky fairly often requiring a dance of shut down and bring up of the software and generally gritting teeth until all the sample rates settle in and stabilize.

The major advantage is, this offers limitless processing capabilities and for vastly less money than using hardware rack gear.  I think, all in, Voicemeeter, Reaper and a few added plugins I paid for, I'm in for maybe $200.  It would be a few grand in physical gear to equal the capability I have.

A few notable plugins I'm using. 

Pros

Cons

Summary

I recommend doing this if you are patient and enjoy tinkering with software, but if you are not, just find another way.

A cool alternative

The Behringer Xair and the Midas sister to it is a hardware appliance that has digital processing built into it. This takes your PC out of the loop and the software/firmware runs more stably inside the hardware appliance.  If I get mad enough at my software DAW setup, I'll be trying one of the Xair 18 or Midas 18 models as my next experiment.

EQ some tips to make sense of it and how to learn to use one.

Equalizing audio takes time and patience to "master", there is no formula to follow and it has many variables.

Different types of EQ

How you learn where to start with an EQ

You need a way to listen to yourself. The worst option is the "monitor" function in most radios using headphones plugged into the jack. It is better than nothing but they aren't known to be very accurate. A much better option is a second receiver. Transmit into a dummy load through a switch, use one of the other non selected switch ports to feed a second receiver and listen with headphones on that receiver. You only need a few watts to get a good 20 over 9 signal generally.

I use a convoluted method of a dummy load with an RF coupler on the line to the load and feed the output of that coupler into an SDR receiver and use software that allows me to not only hear myself but record and play back as well.

A critical part to understand is that the receiver you use has a receive filter and you need to understand how wide that filter is, if your speakers or headphones reproduce things well etc. Many receivers do not open up down to 0Hz so you can have unintended spikes in low frequency and not realize it if testing with the wrong receiver or filter width. The same goes with higher frequencies. Icom rigs as an example are very limited, 100Hz to 2.9kHz is about all they allow in some cases so you won't have a good idea of anything that is sent below or above those limits.

Take your EQ and set it to zero all the way across.

Now while talking, one frequency at a time, take the frequency and boost the heck out of it. Take notice of where in your speech range that frequency has an effect and notice what that effect is.  Do this one frequency at a time, returning each one to zero as you go through and learn what part of your speech each frequency range affects.

What you want to listen for are "hot spots", there are frequencies that for each voice + mic + rig will be too hot and they "ring" when you find them by boosting the EQ. "Ringing" means they are much hotter when boosted than other ranges. Generally a hot spot is a problem frequency that is out of balance and could use some cutting or attenuation.  Hot spots make you sound unnatural.

Mud muddy muddier muddiest, you sound like you are talking through a pillow.

The number one issue are "mud frequencies", they sap efficiency and obscure clarity and make it hard to understand people. The cure for mud is choosing a high pass/low cut frequency that works for you. Typically I recommend starting with 100Hz for high pass to make your life simpler. Rolling off below 100Hz solves a lot of issues for people and makes your signal more power efficient as well.

Below 100Hz is rumble, vibration, an anchor energy.  A kick drum beat in music is 63-80Hz the punch you feel from it. For voice it's the semi-artificial Howard Stern DJ rumble range.  It uses a lot of power and most hams do not have speakers sufficient to hear it. You can wind up with far too much here because what you listen with or others aren't actually picking up how much energy you are wasting here.

The low-mid 100-200Hz depending on your voice is a back of throat, guttural sound like a low pitched "ooooooh" "new" "you", it's a warmth frequency but this is the most common range 120Hz-160Hz where most people need to cut quite a lot for radio use, like -5db and deeper. If you are too hot here, you will sound mushy.

You almost never want to boost the bottom end for radio. If your voice doesn't have bottom end, boosting there will just pull in room rumble and waste power. You almost always want to cut these frequencies, which in turn will make your signal sound like you boosted the highs.

If people say you need more highs, it almost always means you need to cut the low-mid a bit more.

Now if you find where the low-mid needs to be cut for your voice, you can bet that the next harmonic (if you find it's 140Hz, roughly 280Hz is the next harmonic) can use a dip too. I generally find that if you drop say 6db at 140, that 280 will be a bit less, like 4db-5db.  This range is warmth to boxiness. If you say the words "wuh" "muh" "wah" it's that lower part of the mid and start of boxiness that you hear. It makes things sound unnecessarily "thick".

All these ranges vary with voices, mics, rigs etc. there is no magic formula and you really can't share settings with anyone unless all conditions are identical for mic, rig and then the voices will vary between ops.

The next range is the harder boxiness range. 400Hz to about 800 or 900Hz. The words "box" "ox" "octo" "ock ock ock". This range gets aggravated with speech processors commonly or if you are hitting the ALC too hard. It just creates an unpleasant character. You want to beware that if you dip this too much, you will lose some important energy for understanding words. Slight dips here to level things are typical, like -3db or so and only if you need them.

This boxy range can be detected when esses sound like you are saying "ex", if it's too hot here it will interfere with hearing "x" vs "s" or if you say "x-ray" where the "ex" sound is exaggerated.

Up from there is the hard edge of the top of mid-range, the 1.2 or 1.4kHz area if you make sounds like "ink" "ank" "tank" it's the harder edge of those and similar to the last they tend to get hot with heavy compression, too hot mic gain, heavy processing.  Go easy here as this is the meat of articulation, but too hot here makes you sound like you are on an old telephone.

The upper mid to what are the "highs" for radio 2-3kHz are sounds like "eek" while pinching it in your throat "squeak", to exaggerate the eeeee part is about 2.5kHz. This puts a metallic harshness if boosted to far but this is the area that helps cut through noisy band condx too so cut here only in odd conditions.  Typically this area, a filter centered at 2.5-3kHz depending with a slight boost of 3-4db will put some "shine" on audio for most people.

99% of the time, if you clean up your high pass to 100Hz, the low-mid just above that and the next harmonic and you get that just right, people will start to comment about how nice your audio sounds. 

The Icom IC-7300 has the easiest setup of any rig I know of. Using the stock hand mic, set TBW to WID, set Bass to -2, Treble to +3, Compressor to 1 and mic gain around 30-40 depending on how loud you speak and you are just good to go. The simplest rig with the nicest baseline audio that has been put on the market.