Audio Basics 0 - Intro
Posted: 27 Jul 2014 07:18
I have been pondering this for a while. I thought I would write up a little series of blurbs on audio basics. Perhaps start with some info on cleaning up files and some of the tools that are available, with a focus on the free ones. None of this is cast in stone. I have been doing professional audio for many years now and thought something like this may be useful
I have spent a couple days walking around with parts of this in my head, so many thoughts, where to begin...
Perhaps the best place to begin is at the beginning. When we create a sound file...
Back in the old days sounds were recorded by an engineer, on tape. The engineer had an interesting job. If you were to record in a perfectly quiet room, you still got some noise. This came from many sources. Any electronic device has an inherent amount of self noise, this is noise that is not related to the sound you are recording. More often it is thermal noise and related to heat and the components. There is also induced noise. This is again not related to the audio you are recording. This time it may come from magnetic fields or radio frequency interference.
The self noise is not curable, but a lot of decent equipment today comes very close to having as little self noise as it theoretically possible. Better equipment will do better in this regard, but it is never totally curable.
Induced noise will always be around as well, but better cables are shielded from it better, as is better equipment.
So if there is a gem in that part, it is start with the best equipment you can.
The next source of noise is audio in nature. In a studio it can be a hiss or a rumble from the HVAC system. In our recordings it can be literally anything. A TV being on, cars going by, people talking, birds chirping. About the only things we can do here are get as close to the source of the sound as we practically can to make it proportionally louder than the extraneous sounds and we can also use a directional microphone to our advantage.
So what did our old sound engineer do back in the day, and what was the art? We have already spoken about the noises that take up the quiet end of the sonic spectrum. Obviously if a sound we want to record is fainter than the background noise, we are not going to be able to hear it. So one thing our engineer did was he kept is finger on the gain control, and made sure that the gain was turned up such that when was being recorded was louder than the ambient noise.
Now magnetic tape had another nasty little feature too. You could only magnetize it so much, once you reached that point it became saturated. Much like a sponge can only hold so much water before it becomes saturated. You can try to magnetize the tape more but once you have reached the saturation point that was it. In more practical words, you can only record so loud on the tape. If you try and record louder, it more or less just distorts.
So we have a lower limit, and an upper limit. There is a technical word for this, and it is called dynamic range. This is the range of sound levels that the device or medium is capable of recording or reproducing. Back in the tape days this may have ranged from perhaps 40 to 70 db or a bit more with electronic noise reduction and a good tape formula. With digital media it is not uncommon to see dynamic ranges in the 90-100 db range or greater.
So, our engineer had to sit there and turn up the gain on quiet passages so they would be louder than the hiss on the tape, and turn down the gain on the louder passages so they would not distort. The art was how they did it. And it was an art. A good engineer could let a very quiet passage start very very quiet and slowly ramp it up so you could comfortably hear it, and than quickly lower the gain for a loud part. The engineer had a brain and also knew the piece so he was also in a sense playing with the band.
Ok, a few notes here. If you look at dynamic range or signal to noise ratios on equipment make sure you are comparing apples to apples. Over the years many trade groups have come up with what they call "weighing's" . In some instances they even made sense. A weighing is filtering some frequencies out of the noise figure. For example if one manufacturer has put very large filter capacitors in their equipment to quell all of the 60 cycle hum, they may like an weighed reading, where another manufacturer would prefer a weighed reading on their that does not respond to the 60 cycle noise. If you compared a weighed and an unweighed reading you are not making an accurate comparison.
So, over the years on almost all tape machines and many digital recorders they have a thing called AGC or AVC. This is automatic gain control or automatic volume control This lets you make generally make pretty good recordings with out ever having to set any levels. It is a form of dynamic compression. That means in general it makes soft things louder and loud things softer. Compressors have time constants. How long something has to be amiss before they will "attack" the problem, and than they have another time constant for how long the problem has to be resolved for before they will "release". If you have ever turned an inexpensive recorder on in a quiet room you no doubt have heard this. At first you hear the click of it starting, than over the next few seconds you hear the hiss build up louder and louder. Than as soon as there is any sound being recorded the sound it over modulated for the first fraction of the second, and than it sounds good. What you heard with the ACG in action. With no sound it tried to make quiet things louder so it kept turning the gain up, until it was all the way up. Than as soon as there was a sound, bang, it turned the gain down to a level appropriate for recording that sound.
If you have a classical FM station that still plays classical LP records on occasion, listen sometime critically with headphones. When there is a quiet passage in the music and it is really quiet, the compressor at the station will crank the level up. If there happens to be a scratch in the record, the "pop" from the scratch will be very loud and it will cause the compressor to turn the gain down quickly, and it will "punch" a hole in the sound until the release time on the compressor allows it to turn the gain back up again. Once you hear it, it is easy to spot.
And that brings up a bit of a warning. Many audio tricks, once you understand how they work it makes them much less enjoyable for you as you will always be spotting artifacts of them. If you really just enjoy listening you may not want to peek under the veil.
And a note about listening. Often I prefer to listen to a wild recording. I think my brain makes a better filter most of the time than the electronics. But I will do a lot of manipulations. Often I can hear more what I want to hear but the recording is not as nice to listen to.
A good case in point is that glasgow recording here that I am so fond up. I love the wild recording. I also enjoy it in my winmap rotation with the EQ set and rocksteady, my compressor. It is still very listenable. I have versions processed through audacity that are very filtered and compressed. I can hear much more what was going on, but it is a much less enjoyable listening.
So, what tools do we have once we have a file. We have our brains. Often that is the nest tool of all for picking stuff out. Next we have about 3 categories of tools to play with. The first are filters, the second are dynamics, and the third are everything else.
Believe it or not our friends at Ma Bell have done tons and tons of research over the years and while we all love hi fi, the majority of the intelligence in audio is contained in one small little band, from 300Hz to 3000 Hz. Back in the old days you had to buy precision parts to build filters that had deep cuts, but now that the world has gone digital it is easy as pie. So, often times my very first thing to do to a file is snip odd everything outside of 300-3000Hz. In winamp I can use the EQ, ditto for VLC media player. You can get up to 15 DB of cut in winamp nativity, 30 DB of cut in VLC if you turn on the 2 stage, and I think 48 DB in Auditcy. Th later requires running it through once in HI PASS mode and putting the freq on 300 and setting it to the highest order, and than a second pass in low pass mode and setting it at 3000, and again the highest order.
This will not sound real good, but, and this is a big but, all the rumble will be gone as will most of the hiss, so you can crank the volume up and listen much more "closely" than you could before. But yes, it will sound like a telephone. Not hi fi at all.
Note, there is a physcoacoustic trick you can play on yourself if you want to. It is called perfuming. Another study, not bell this time as far as I know showed that adding some high frequency hiss to a file devoid of any high frequency information, made it sound "brighter". Apparently somehow in your brain the his gets modulated with the sound and you kind of synthesize the effect. So, now you can try cranking up the higher frequency faders on the graphic EQ and see for yourself. Sometimes just a little snort of noise actually makes a file sound a lot better, even if it makes it harder to "listen" to.
Cutting everything but 300-3Khz takes out all 60Hz remnants. If there is some other fixed sound, which is rare, but occasionally happens or if the sound "blares" at one frequency, you may be able to calm that with a parametric EQ application. A panoramic EQ is a cool tool. They generally only have a few bands, and the bands often times overlap. Unlike a graphic EQ where the center frequencies are fixed, and parametric EQ lets you move the center frequencies of the filters as well as the boost and cut and also the filters Q. The Q is the width of the filter. So you can boost or cut from a very narrow notch to a very wide band. For fun you c an crank the gain all the way up, crank the Q to the max, and sweep the frequency back and forth and listen to the wah wah effect on the audio, just like the guitar wah wah peddle.
The nice thing about doing all of this digitally is that the toys are free and you can try and tune it and try and tune it and try.
That's it for tonight. Next time I will revisit tweaking the dynamics, and some things that have helped me. Again, a lot of radical tweaks can often bring things out in a file, but there is a difference between bringing everything out in a file and having it sound good to you. Luckily you can make copies of both (smile)
I have spent a couple days walking around with parts of this in my head, so many thoughts, where to begin...
Perhaps the best place to begin is at the beginning. When we create a sound file...
Back in the old days sounds were recorded by an engineer, on tape. The engineer had an interesting job. If you were to record in a perfectly quiet room, you still got some noise. This came from many sources. Any electronic device has an inherent amount of self noise, this is noise that is not related to the sound you are recording. More often it is thermal noise and related to heat and the components. There is also induced noise. This is again not related to the audio you are recording. This time it may come from magnetic fields or radio frequency interference.
The self noise is not curable, but a lot of decent equipment today comes very close to having as little self noise as it theoretically possible. Better equipment will do better in this regard, but it is never totally curable.
Induced noise will always be around as well, but better cables are shielded from it better, as is better equipment.
So if there is a gem in that part, it is start with the best equipment you can.
The next source of noise is audio in nature. In a studio it can be a hiss or a rumble from the HVAC system. In our recordings it can be literally anything. A TV being on, cars going by, people talking, birds chirping. About the only things we can do here are get as close to the source of the sound as we practically can to make it proportionally louder than the extraneous sounds and we can also use a directional microphone to our advantage.
So what did our old sound engineer do back in the day, and what was the art? We have already spoken about the noises that take up the quiet end of the sonic spectrum. Obviously if a sound we want to record is fainter than the background noise, we are not going to be able to hear it. So one thing our engineer did was he kept is finger on the gain control, and made sure that the gain was turned up such that when was being recorded was louder than the ambient noise.
Now magnetic tape had another nasty little feature too. You could only magnetize it so much, once you reached that point it became saturated. Much like a sponge can only hold so much water before it becomes saturated. You can try to magnetize the tape more but once you have reached the saturation point that was it. In more practical words, you can only record so loud on the tape. If you try and record louder, it more or less just distorts.
So we have a lower limit, and an upper limit. There is a technical word for this, and it is called dynamic range. This is the range of sound levels that the device or medium is capable of recording or reproducing. Back in the tape days this may have ranged from perhaps 40 to 70 db or a bit more with electronic noise reduction and a good tape formula. With digital media it is not uncommon to see dynamic ranges in the 90-100 db range or greater.
So, our engineer had to sit there and turn up the gain on quiet passages so they would be louder than the hiss on the tape, and turn down the gain on the louder passages so they would not distort. The art was how they did it. And it was an art. A good engineer could let a very quiet passage start very very quiet and slowly ramp it up so you could comfortably hear it, and than quickly lower the gain for a loud part. The engineer had a brain and also knew the piece so he was also in a sense playing with the band.
Ok, a few notes here. If you look at dynamic range or signal to noise ratios on equipment make sure you are comparing apples to apples. Over the years many trade groups have come up with what they call "weighing's" . In some instances they even made sense. A weighing is filtering some frequencies out of the noise figure. For example if one manufacturer has put very large filter capacitors in their equipment to quell all of the 60 cycle hum, they may like an weighed reading, where another manufacturer would prefer a weighed reading on their that does not respond to the 60 cycle noise. If you compared a weighed and an unweighed reading you are not making an accurate comparison.
So, over the years on almost all tape machines and many digital recorders they have a thing called AGC or AVC. This is automatic gain control or automatic volume control This lets you make generally make pretty good recordings with out ever having to set any levels. It is a form of dynamic compression. That means in general it makes soft things louder and loud things softer. Compressors have time constants. How long something has to be amiss before they will "attack" the problem, and than they have another time constant for how long the problem has to be resolved for before they will "release". If you have ever turned an inexpensive recorder on in a quiet room you no doubt have heard this. At first you hear the click of it starting, than over the next few seconds you hear the hiss build up louder and louder. Than as soon as there is any sound being recorded the sound it over modulated for the first fraction of the second, and than it sounds good. What you heard with the ACG in action. With no sound it tried to make quiet things louder so it kept turning the gain up, until it was all the way up. Than as soon as there was a sound, bang, it turned the gain down to a level appropriate for recording that sound.
If you have a classical FM station that still plays classical LP records on occasion, listen sometime critically with headphones. When there is a quiet passage in the music and it is really quiet, the compressor at the station will crank the level up. If there happens to be a scratch in the record, the "pop" from the scratch will be very loud and it will cause the compressor to turn the gain down quickly, and it will "punch" a hole in the sound until the release time on the compressor allows it to turn the gain back up again. Once you hear it, it is easy to spot.
And that brings up a bit of a warning. Many audio tricks, once you understand how they work it makes them much less enjoyable for you as you will always be spotting artifacts of them. If you really just enjoy listening you may not want to peek under the veil.
And a note about listening. Often I prefer to listen to a wild recording. I think my brain makes a better filter most of the time than the electronics. But I will do a lot of manipulations. Often I can hear more what I want to hear but the recording is not as nice to listen to.
A good case in point is that glasgow recording here that I am so fond up. I love the wild recording. I also enjoy it in my winmap rotation with the EQ set and rocksteady, my compressor. It is still very listenable. I have versions processed through audacity that are very filtered and compressed. I can hear much more what was going on, but it is a much less enjoyable listening.
So, what tools do we have once we have a file. We have our brains. Often that is the nest tool of all for picking stuff out. Next we have about 3 categories of tools to play with. The first are filters, the second are dynamics, and the third are everything else.
Believe it or not our friends at Ma Bell have done tons and tons of research over the years and while we all love hi fi, the majority of the intelligence in audio is contained in one small little band, from 300Hz to 3000 Hz. Back in the old days you had to buy precision parts to build filters that had deep cuts, but now that the world has gone digital it is easy as pie. So, often times my very first thing to do to a file is snip odd everything outside of 300-3000Hz. In winamp I can use the EQ, ditto for VLC media player. You can get up to 15 DB of cut in winamp nativity, 30 DB of cut in VLC if you turn on the 2 stage, and I think 48 DB in Auditcy. Th later requires running it through once in HI PASS mode and putting the freq on 300 and setting it to the highest order, and than a second pass in low pass mode and setting it at 3000, and again the highest order.
This will not sound real good, but, and this is a big but, all the rumble will be gone as will most of the hiss, so you can crank the volume up and listen much more "closely" than you could before. But yes, it will sound like a telephone. Not hi fi at all.
Note, there is a physcoacoustic trick you can play on yourself if you want to. It is called perfuming. Another study, not bell this time as far as I know showed that adding some high frequency hiss to a file devoid of any high frequency information, made it sound "brighter". Apparently somehow in your brain the his gets modulated with the sound and you kind of synthesize the effect. So, now you can try cranking up the higher frequency faders on the graphic EQ and see for yourself. Sometimes just a little snort of noise actually makes a file sound a lot better, even if it makes it harder to "listen" to.
Cutting everything but 300-3Khz takes out all 60Hz remnants. If there is some other fixed sound, which is rare, but occasionally happens or if the sound "blares" at one frequency, you may be able to calm that with a parametric EQ application. A panoramic EQ is a cool tool. They generally only have a few bands, and the bands often times overlap. Unlike a graphic EQ where the center frequencies are fixed, and parametric EQ lets you move the center frequencies of the filters as well as the boost and cut and also the filters Q. The Q is the width of the filter. So you can boost or cut from a very narrow notch to a very wide band. For fun you c an crank the gain all the way up, crank the Q to the max, and sweep the frequency back and forth and listen to the wah wah effect on the audio, just like the guitar wah wah peddle.
The nice thing about doing all of this digitally is that the toys are free and you can try and tune it and try and tune it and try.
That's it for tonight. Next time I will revisit tweaking the dynamics, and some things that have helped me. Again, a lot of radical tweaks can often bring things out in a file, but there is a difference between bringing everything out in a file and having it sound good to you. Luckily you can make copies of both (smile)