Equipment Review Organization And Uniform Terms

Jim Shaw · Apr 4, 2022

Topic: More consistent review structure and glossary of terms.

I have suggested this to a couple of subjective reviewers and was given the sub-zero shoulder. They don't want to be pinned down, especially about the slang terminology they toss around. But, here is different.

I'm happy with ASR's format and dogma in presenting relevant data -- objectively. Bravo. But I kind of think ASR and its readers could do a little better with presenting subjective information. My thought is that, just because presented subjective listening tests are opinions, it doesn't mean that they should be presented totally in essay format. Yes, a short essay is necessary, but (like I was taught in my old composition classes in engineering college) I'd prefer if they followed a general outline. And it could help the subjective reviewer to try to touch on most/all of the salient points of interest to readers. Sort of like taking a shopping list to the supermarket.

And more importantly, I would hope that, somewhere, there'd be a glossary of subjective terms to help us avoid misunderstandings. For example, what does unusually low impedance mean in ohms? What does excessive distortion in various conditions mean? What is sub-bass? What are mid-lows? What does warmth mean? What does bright tend to sound like?

I know that reviewers should have considerable freedom, especially to be wrong. But in the science of audio, V-curved, bright, excessive bass, or comments about male/female vocals would be nice to narrow down. Can anyone define "warm"? Dry? Stage? Depth?

I ask because I'm pretty sure I don't know what many of us are talking about. And I'm pretty sure we have the talent and determination here to at least form an ever-improving glossary. Good heavens, the subjective pundits might catch on.

Now back to desperately trying to be the smartest and too wordiest folks in the room...

DWPress · Apr 4, 2022

If you can understand the measurements that @amirm presents or take the time to learn to understand them (and Amir has a couple posts that do explain them) then the subjective comments he offers are usually pretty easy to correlate. Building up a glossary of subjective terms to adhere to would open up a can of worms I'm afraid.

amirm · Apr 4, 2022

You can't standardize terminology that some reviewers use because they have no physical backing. Mine are reflection of the measurements (more or less) so you should be able to correlate them. Just answering one, sub-bass in my book is below 40 to 50 Hz.

Vini darko · Apr 4, 2022

I often refer to @solderdude map of subjective translation.

Vini darko · Apr 4, 2022

amirm said:
You can't standardize terminology that some reviewers use because they have no physical backing. Mine are reflection of the measurements (more or less) so you should be able to correlate them. Just answering one, sub-bass in my book is below 40 to 50 Hz.

Yeah trying to understand what john darko is saying is a good example.

DVDdoug · Apr 4, 2022

For example, what does unusually low impedance mean in ohms? What does excessive distortion in various conditions mean? What is sub-bass? What are mid-lows? What does warmth mean? What does bright tend to sound like?

From what I've seen here THAT IS the summary and then you can look into the measurements or graphs for the specifics.

"Bright" usually means boosted highs and "dull", attenuated highs.
"Warmth" usually means boosted mid-bass (maybe 100-200Hz?) but somewhere I read that it can mean slight (and probably "pleasing") distortion so I try not to say "warm" anymore.

Personally, I try to be more specific and avoid this kind of terminology. Or a sometimes I'll include an explanation like, "Many older records sound a little dull (rolled-off highs)".

Jim Shaw · Apr 4, 2022

amirm said:
You can't standardize terminology that some reviewers use because they have no physical backing. Mine are reflection of the measurements (more or less) so you should be able to correlate them. Just answering one, sub-bass in my book is below 40 to 50 Hz.

Yeah. That was Sean's and Steve's response. I'm sorry I asked. I'm sorry that I'm sorry I asked. Maybe I should delete the thread. Glossaries make no sense in audio. Never, ever, for all future time.

Midwest Blade · Apr 4, 2022

The non standardization keeps the subjectivist in business. Some I still find entertaining, dislike it when they try and poop on the objective use of measurements. No need to feel sorry about the idea, it would be nice if the audio world just remain civil about the whole thing.

solderdude · Apr 4, 2022

Jim Shaw said:
I would hope that, somewhere, there'd be a glossary of subjective terms to help us avoid misunderstandings.

Jim Shaw said:
What does warmth mean? What does bright tend to sound like?

Airy – Describes the sense of space and openness of the music. Best heard in cymbals where the shimmer is realistic. This mostly has a relation with treble extension. It should be mentioned that not all well extended headphones sound ‘airy’ and some not so well extended headphones can still sound sort of ‘airy’. The superlative (exaggerated) form af ‘airy’ sound is ‘ethereal‘ where the sound feels very ‘fragile’ and overly ‘delicate’.

Analytical – A term used to describe a product that produces a high level of detail in reference to the music being played back. In technical terms this means the tonal response is between ‘flat’ and slightly elevated above 3kHz.

Balance – This could mean balance between channels or sometimes people use the term to describe Tonal Balance.
Balance between channels means the stereo image is right in the middle. In some cases balance can be correct for a certain part of the frequency range and not for higher or lower frequencies. This can be spotted in measurements. In sound it can lead to a fuzzy position of a stereo image.
Also one sided seal loss with headphones can cause this when bass is less prominent in the leaky side. On measurements with perfect seal this won’t be obvious.

Bassy – This is the lower end frequency of the human hearing. Bass can be measured in quantity (heaviness) and quality. Other bass descriptors are muddy, punchy, rubbery, boomy and fat.
The bass area is between 25Hz and 300Hz but harmonics can go as high as 5kHz. Peaks and dips in the frequency range between 20Hz and 500Hz can make the bass sound different from realistic. For descriptors and the frequency range they fall into see the chart above.
The opposite of bassy is Lean

Bass (Lows) – can be divided 4 ranges: sub-bass (10Hz-25Hz) this is not audible as tones or notes but deep rumbles in movies are an example. When it is absent most people won’t miss it. That is once you heard what bass extension does it can add to the musical experience. For instance with Piano music where pedals are used this lowest part can make the piano sound more real.
Lower bass (25Hz-50Hz) is the part of the bass where the deepest musical notes are. A small and gentle roll-off usually isn’t very audible. A substantial roll-off (-10dB) is quite audible.
Bass (50Hz-120Hz) is the part where most of the ‘body’ of bass instruments is. Sharp dips or peaks and emphasis or subdued parts here can color the bass, make it sound boomy, exaggerated, full, tight, dis-attached or muddy. The best sounding headphones/speakers have a flat response in the area. Some folks prefer substantially more bass than others. When traveling a bit more bass usually is welcome. High-bass (120Hz-250Hz) is where the harmonics of most bass sound producing instruments are. A small peak there may make bass sound tighter. Too much (>5dB) can make bass sound muddy and boomy. A dip in this area can make bass instruments sound ‘wrong’ or ‘disattached’. This is where bass appears to be ‘apart’ from the music. Some people actually prefer a dip, certainly when the lower frequencies are elevated.

Bloat – Bloat is present usually in the mid bass. When frequencies below 400Hz are lifted opposite the mids and highs the bass can sound ‘bloated’ and overdone. When there is a dip around 150Hz this is the opposite of bloated and the bass sounds disattached.
Bloated bass often also causes ‘bloom‘ in mens voices. In this case the male voice sounds unnaturally bassy/full and lacks clarity.
When only the part below 150Hz is boosted the bass doesn’t sound ‘bloated’.

Bright/Brightness – A small emphasis of upper mids. The frequency band between 1kHz and 7kHz. Brightness is a wider bandwidth form of Clarity + Detail, when exaggerated even more it becomes Shrill.

Clarity – A few dB emphasis between 1kHz and 3kHz can add some clarity for voices and instruments. The opposite (a dip) of clarity (seen a lot in headphones) could be described as Laid back, Sweet, lacking Attack. An exaggerated form is shrill, cold, hard.

Coarse – is another word for Grainy or Gritty and describes uneven treble response with deep dips and peaks. Note that measurements made with measurement gear using a fake Pinna can show substantial peaks and dips above 5kHz caused by the fake Pinna but may not indicate roughness nor should one try to EQ this out.

Cold – a general upward tilt in the frequency range between 300Hz and 3kHz. The opposite is warmth.

Congestion – Something sounds congested when it the sound is muffled and not very dynamic. It usually is a combination of emphasis in warmth and laid back sound / lacking in clarity.

Compressed – This can mean 2 things. There is compressed data and compressed music. Compressed data means that a file has been shrunk in size. Most people have heard of ZIP and RAR where one or more files are smaller in size so they can be sent as an attachment. This is lossless compression where the file can be reconstructed to its original size and then can be used. In Audio there are many lossless formats. The biggest difference with something like ZIP is that a ZIP file needs to be ‘unpacked’ before one can use the file (song or album) where audio files are de-compressed while the song is being played back. One can also compress files by leaving out data which can not be recovered later. The most well known lossy compression files are JPG (for pictures) and MP3 for music but there are many more formats. During the compression (encoding) one can usually select how much one throws away. More compression = more loss of fidelity.
When music is compressed the louder parts in a recording and lowered in volume and/or softer parts of a recording are amplified. Usually this is done in studios to get a more even sound. There is a fine line between this being done craft-fully (usually for each instrument its own compression) and it being overdone in dramatic ways. This sadly is very common for most popular recordings. Search for ‘loudness wars’.
When a headphone sounds compressed it usually is bit bloated sounding and lacks realism/clarity/dynamics.

Crisp – is another description of having clarity, an exaggerated form is shrill. The opposite is laid back.

Dark/Darkness – Describes a gradual downward slope of frequencies above 3kHz often combined with an emphasis on bass/lower mids. An exaggerated form is Muffled. The opposite is Shrill

Decay – How a sound/note/resonance fades away over time. This is not always visible in the frequency response but could be visible in Cumulative Spectral Decay (Waterfall) plots. When there is some clarity and no mid bass emphasis most people describe the decay as fast.

Depth – How far away the instruments spacing is from back to front. In headphones this is not always perceived as such. Sometimes a gradual downward slope between 300Hz and 10kHz can also give a sense of depth imitating a greater distance from an actual music performance.

Detail – When something sounds detailed the frequency band between 1kHz and 3kHz is usually audibly flat or neutral and is more or less flat between 3kHz and 10kHz or has some mild emphasis (a few dB) or a peak in that part of the frequency range. An exaggerated form is Edgy, Sharpness, Sibliance.

Dynamic – When a headphone sounds dynamic it sounds ‘lively’ and instruments and voices ‘pop-out’ of clearly. It is the opposite of a relaxed, muffled or compressed sound. When a recording is truly dynamic there is a substantial difference between soft sounds and peaks.
You need to play the music loud to hear small details. Funnily enough when you can hear small details already at lower playback levels the sound is actually compressed yet many say, incorrectly, this is a dynamic sound.
The word dynamic is also used to describe a type of headphone driver (the most common type) where a magnet and voice-coil is used to create sound.
Then there is also dynamic range in an electrical sense. This describes how many difference in decibel there is between the signal and the noise and distortion products.
And finally Dynamic Range as in DR-rating. This describes how the difference between the average levels and peak levels in a recording.
Usually the recordings with DR ratings between 10 and 20 sound more realistic than recordings between 2 and 5.

Edgy or Etched – Between 3kHz and 7kHz, as a subset of brightness and Detailed, an exaggerated form of brightness/detail is Edgy, Etched, Sharpness, Sibliance. The opposite (a dip) can be described as lacking Finesse, dis-attached Treble.

Ethereal – Describes a slight treble elevated sound that feels ‘fragile’ and slightly unnatural but otherwise neutral and pleasant.

Finesse – This describes fine and delicate details heard in music. When one can hear small details clearly, realistic and well defined as in not ‘coarse’.
The opposite of finesse is grainy.

Forward – When voices and instruments pop-out. This usually is caused by a mild peak in the 1kHz to 2kHz range. A small peak is usually not very detrimental to the sound at all. The opposite of forward is ‘laid-back’.

Flat – When the frequency response does not deviate in the audible range something is considered flat.
When this word is used with headphones or speakers one usually finds that it doesn’t color the sound (too much). Strangely enough not everyone that describes a sound as flat it actually is flat. So one man’s ‘flat’ can be another ones bassy, boring, midrangy, or sharp sound.

Fun – This often describes a sound with elevated treble (>5kHz) and bass (<100Hz), also called ‘U shaped’ or ‘bathtub shaped’ sound.

Grainy – Also called ‘gritty’ and describes a sound that lacks finesse. The finer details are rendered a bit coarse. Often this is caused by lots of sharp peaks and deep dips in the treble area above 6kHz.

Harsh – This describes a sound similar as grainy but more concentrated in the midrange from 1kHz to 5kHz rather than in the treble range. Often distortion levels >1% in the midrange and or sharp peaks in that frequency range are the cause.

Highs – Also called treble, these are the upper frequencies above 5kHz.

Imaging – When a headphone or speaker images well it is very easy to pinpoint where instruments are and that image is stable and ‘sharp’. This requires a flat frequency response that is also matched between the two drivers, and detailed recordings. Slightly angled headphone drivers can have better ‘imaging’ as they make use of the ear-shape.

Laid back – describes a relaxed sound signature and is caused by an audible and rather wide dip around 2-4kHz. It is the opposite of forward and dynamic. Upper mids and lower treble is under-emphasized and is found to be pleasant for longer listening sessions when the dip is just slight (between 5 and 10dB).

Lush – is similar to Laid back. It describes a rich, warm-pillowy sound usually with slightly elevated lower frequencies and rather wide dip around 2-4kHz with a good treble quality that is a bit higher in level than when said to be laid back.

Microphonics – sensitivity to touch. It could be present in some electronic components but in the case of headphones it refers to mechanically ‘conducted’ sounds of the cable. When one touches the earcups of a headphone one can hear this quite clearly. Depending on how a cable is coupled to the cup touching the cable does a similar thing but attenuated compared to touching the cups directly.
Cloth covered, thick and rigid cables usually are worse in this aspect then softer and thinner smooth surface cables. When microphonics are bad you can hear a cable rubbing against clothes when moving when music isn’t even playing that loud. Some are annoyed others simply don’t mind.

Mids/Midrange – Frequencies between roughly 250Hz and 4kHz. One can divide this in lower mids (250Hz – 500Hz) where the ‘body’ of voices and instrumens is. The mids (500Hz – 1500Hz) and upper mids (1.5kHz to 4kHz). In the upper mids the ‘clarity’ of voices and instruments as well as ‘attack’ of musical instruments can be found.

Muddy – Describes the sound of elevated lower mids/upper bass where the bass is not clearly defined and ‘blends in’ with the rest of the music in an unnaturally elevated way.

Muffled – sound is when it would appear as though the sound is coming from a speaker with a wool blanket draped over it. Lacking in clarity and treble. Frequencies above 2kHz are subdued and or rolled-off.

Natural – When a headphone sounds natural all instruments and voices are reproduced in a realistic manner without coloration, a correct stereo-image and with a dynamic sound.

Neutral – and neutral are closely the same but neutral is more about the reproduction of sound not being colored. All frequencies are reproduced at the proper level.
There can be concensus about how a headphone should measure to be considered neutral.
However, there are many people stating a headphone or speaker sounds neutral to them but in reality is quite colored. The term neutral used in subjective reviews thus can be considered a meaningless term unless comparisons are made to known neutral headphones.

Openness – Has little to do with a headphone being open or closed. It usually refers to a detailed and neutral sounding headphone where instruments have a clear separation. Headphones with a good clarity are often described as being ‘open’ sounding.

Punch – powerful bass and drum hits. A small emphasis around 100-150Hz can relatively boost harmonics of bass drum/bass.

Rolled-off – Sound can be rolled-off in the lows and in the treble or both. Rolled-off bass is lacking in lower bass and can have slightly subdued bass as well. When the treble is rolled off frequencies above 10kHz usually are low in amplitude. When this happens usually the sound is not considered airy and lacks sparkle.

Sharpness – An emphasis between 2kHz and 6kHz can give instruments and voices the impression of being ‘highlighted’ and sharply detailed.

Shrill – An exaggerated form of sharpness usually with an emphasis between 4kHz and 10kHz and is unpleasant, shreaky.

Sibilant / sibilance – A sharp peak in the frequency band between 5kHz and 8kHz can emphasize the ‘s’ sounds in words. A high peak (>5dB) can sound piercing and sharp and is usually fatiguing and unpleasant during longer listening sessions. A lower peak between 3dB and 5dB can create an illusion of ‘detail’ in music and ‘sharply’ defined instruments. In the long run also fatiguing.

Signature – the overall tonal balance of a headphone. One could describe a signature in many ways as found in this article.

Soft – can be used to describe the loudness (amplitude as in how loud the sound is) or the ‘texture’ of the treble. For the latter it can describe the amplitude opposite the bass and mids or define treble quality. When treble is not splashy elevated, sibilant or coarse but ‘sweet’ and pleasant it is often described as soft. When the frequency response is quite ‘flat’ between 6kHz and 15kHz without sharp peaks and dips the treble quality usually is good and depending on the relative level can be said to be soft or sweet.

Soundstage – Describes in 3d terms (height, width and depth) where one perceives recorded instruments. For headphones it is very rare to hear instruments ‘projected’ clearly meters in front of you without special recordings or software. In the vast majority of cases sounds are between left and right ear only. For headphones instead of soundstage the word headstage is used.
This describes how well instruments are defined between left and right and how ‘sharp’ they can be pinpointed. One has to realize the stereo image is created in the studio during mixing/mastering process and is artificial in most cases. For headphones there is crossfeed which makes some recordings easier to listen to but this too is just a ‘trick’.

Sparkle – Usually caused by a slight emphasis in the (upper) treble and describes vibrant treble.

Spatial – describes how sounds can be heard all around you. Requires digital trickery to pull this of with headphones. Usually to get spatial sound multiple speakers are required placed around the listening position. Related terms: crossfeed and soundstage/headstage.

Sweet – Corresponds with ‘lush’. It describes a ‘soft’ sound signature usually with slightly elevated lows and soft (not grainy or splashy) highs.

Timbre – The tone of a note from an instrument. Timbre is determined by the ratio between the fundamental tone and its harmonics. Headphones with a ‘flat’ tonal response should have the proper timbre. When the tonal balance is not flat timbre can be affected and changed. This can sometime be for the better or worse and depends on the recording. Do note that in studios timbre of each individual instrument is often adjusted to fit in the recording. This can’t be undone.

Tonal Balance – describes how ‘flat’ a headphone is. When the bass is boosted the tonal balance is bassy, when treble is boosted the tonal balance is ‘bright’. When a headphone is described as tonally balanced no specific frequency bands are popping out or are subdued.
The real snag here is that some owners may feel a headphone is tonally balanced but in reality it may be bassy, bass-shy, warm, cold, midrangy or bright for instance. Tonally balanced = realistic sounding with no emphasis on anything.
Objectively tonal balance is difficult to prove headphone measurements can differ substantially between test-rigs.

Transparent – Similar to clarity it is a clean clear open and detailed quality.

Warm/warmth – a general downward tilt in the frequency range between 300Hz and 3kHz. The opposite is cold.

Jim Shaw said:
For example, what does unusually low impedance mean in ohms?

For headphones I would say anything below 16 ohm
For speakers I would say impedances dropping below 3 ohm

Jim Shaw said:
What does excessive distortion in various conditions mean?

When you can hear the sound being degraded it is excessive. You cannot pin a number on it because there are several kinds of distortion and music can mask distortion.

Jim Shaw said:
What is sub-bass?

see post # 4

Jim Shaw said:
What are mid-lows?

see post #4

Jim Shaw said:
Can anyone define Dry? Stage? Depth?

Dry: lack of 'live sound', artificial studio sound.

Stage: Good stage = one can easily pinpoint where instruments vocals are from left to right.
Sound stage for speakers, headstage for headphones

Depth: Not the same experience for everyone ... is is brain and equipment dependent.. with speakers you can get a sense of sounds coming from behind other sounds. Often micing, mixing, reverb, hall effect related which allows the brain to 'guess' from which distance sounds appear to come from.

There are also lots of people giving a different meaning or interpretation to all above mentioned 'sound descriptions'.

Jim Shaw · Apr 5, 2022

solderdude said:
Airy – Describes the sense of space and openness of the music. Best heard in cymbals where the shimmer is realistic. This mostly has a relation with treble extension. It should be mentioned that not all well extended headphones sound ‘airy’ and some not so well extended headphones can still sound sort of ‘airy’. The superlative (exaggerated) form af ‘airy’ sound is ‘ethereal‘ where the sound feels very ‘fragile’ and overly ‘delicate’.

Analytical – A term used to describe a product that produces a high level of detail in reference to the music being played back. In technical terms this means the tonal response is between ‘flat’ and slightly elevated above 3kHz.

Balance – This could mean balance between channels or sometimes people use the term to describe Tonal Balance.
Balance between channels means the stereo image is right in the middle. In some cases balance can be correct for a certain part of the frequency range and not for higher or lower frequencies. This can be spotted in measurements. In sound it can lead to a fuzzy position of a stereo image.
Also one sided seal loss with headphones can cause this when bass is less prominent in the leaky side. On measurements with perfect seal this won’t be obvious.

Bassy – This is the lower end frequency of the human hearing. Bass can be measured in quantity (heaviness) and quality. Other bass descriptors are muddy, punchy, rubbery, boomy and fat.
The bass area is between 25Hz and 300Hz but harmonics can go as high as 5kHz. Peaks and dips in the frequency range between 20Hz and 500Hz can make the bass sound different from realistic. For descriptors and the frequency range they fall into see the chart above.
The opposite of bassy is Lean

Bass (Lows) – can be divided 4 ranges: sub-bass (10Hz-25Hz) this is not audible as tones or notes but deep rumbles in movies are an example. When it is absent most people won’t miss it. That is once you heard what bass extension does it can add to the musical experience. For instance with Piano music where pedals are used this lowest part can make the piano sound more real.
Lower bass (25Hz-50Hz) is the part of the bass where the deepest musical notes are. A small and gentle roll-off usually isn’t very audible. A substantial roll-off (-10dB) is quite audible.
Bass (50Hz-120Hz) is the part where most of the ‘body’ of bass instruments is. Sharp dips or peaks and emphasis or subdued parts here can color the bass, make it sound boomy, exaggerated, full, tight, dis-attached or muddy. The best sounding headphones/speakers have a flat response in the area. Some folks prefer substantially more bass than others. When traveling a bit more bass usually is welcome. High-bass (120Hz-250Hz) is where the harmonics of most bass sound producing instruments are. A small peak there may make bass sound tighter. Too much (>5dB) can make bass sound muddy and boomy. A dip in this area can make bass instruments sound ‘wrong’ or ‘disattached’. This is where bass appears to be ‘apart’ from the music. Some people actually prefer a dip, certainly when the lower frequencies are elevated.

Bloat – Bloat is present usually in the mid bass. When frequencies below 400Hz are lifted opposite the mids and highs the bass can sound ‘bloated’ and overdone. When there is a dip around 150Hz this is the opposite of bloated and the bass sounds disattached.
Bloated bass often also causes ‘bloom‘ in mens voices. In this case the male voice sounds unnaturally bassy/full and lacks clarity.
When only the part below 150Hz is boosted the bass doesn’t sound ‘bloated’.

Bright/Brightness – A small emphasis of upper mids. The frequency band between 1kHz and 7kHz. Brightness is a wider bandwidth form of Clarity + Detail, when exaggerated even more it becomes Shrill.

Clarity – A few dB emphasis between 1kHz and 3kHz can add some clarity for voices and instruments. The opposite (a dip) of clarity (seen a lot in headphones) could be described as Laid back, Sweet, lacking Attack. An exaggerated form is shrill, cold, hard.

Coarse – is another word for Grainy or Gritty and describes uneven treble response with deep dips and peaks. Note that measurements made with measurement gear using a fake Pinna can show substantial peaks and dips above 5kHz caused by the fake Pinna but may not indicate roughness nor should one try to EQ this out.

Cold – a general upward tilt in the frequency range between 300Hz and 3kHz. The opposite is warmth.

Congestion – Something sounds congested when it the sound is muffled and not very dynamic. It usually is a combination of emphasis in warmth and laid back sound / lacking in clarity.

Compressed – This can mean 2 things. There is compressed data and compressed music. Compressed data means that a file has been shrunk in size. Most people have heard of ZIP and RAR where one or more files are smaller in size so they can be sent as an attachment. This is lossless compression where the file can be reconstructed to its original size and then can be used. In Audio there are many lossless formats. The biggest difference with something like ZIP is that a ZIP file needs to be ‘unpacked’ before one can use the file (song or album) where audio files are de-compressed while the song is being played back. One can also compress files by leaving out data which can not be recovered later. The most well known lossy compression files are JPG (for pictures) and MP3 for music but there are many more formats. During the compression (encoding) one can usually select how much one throws away. More compression = more loss of fidelity.
When music is compressed the louder parts in a recording and lowered in volume and/or softer parts of a recording are amplified. Usually this is done in studios to get a more even sound. There is a fine line between this being done craft-fully (usually for each instrument its own compression) and it being overdone in dramatic ways. This sadly is very common for most popular recordings. Search for ‘loudness wars’.
When a headphone sounds compressed it usually is bit bloated sounding and lacks realism/clarity/dynamics.

Crisp – is another description of having clarity, an exaggerated form is shrill. The opposite is laid back.

Dark/Darkness – Describes a gradual downward slope of frequencies above 3kHz often combined with an emphasis on bass/lower mids. An exaggerated form is Muffled. The opposite is Shrill

Decay – How a sound/note/resonance fades away over time. This is not always visible in the frequency response but could be visible in Cumulative Spectral Decay (Waterfall) plots. When there is some clarity and no mid bass emphasis most people describe the decay as fast.

Depth – How far away the instruments spacing is from back to front. In headphones this is not always perceived as such. Sometimes a gradual downward slope between 300Hz and 10kHz can also give a sense of depth imitating a greater distance from an actual music performance.

Detail – When something sounds detailed the frequency band between 1kHz and 3kHz is usually audibly flat or neutral and is more or less flat between 3kHz and 10kHz or has some mild emphasis (a few dB) or a peak in that part of the frequency range. An exaggerated form is Edgy, Sharpness, Sibliance.

Dynamic – When a headphone sounds dynamic it sounds ‘lively’ and instruments and voices ‘pop-out’ of clearly. It is the opposite of a relaxed, muffled or compressed sound. When a recording is truly dynamic there is a substantial difference between soft sounds and peaks.
You need to play the music loud to hear small details. Funnily enough when you can hear small details already at lower playback levels the sound is actually compressed yet many say, incorrectly, this is a dynamic sound.
The word dynamic is also used to describe a type of headphone driver (the most common type) where a magnet and voice-coil is used to create sound.
Then there is also dynamic range in an electrical sense. This describes how many difference in decibel there is between the signal and the noise and distortion products.
And finally Dynamic Range as in DR-rating. This describes how the difference between the average levels and peak levels in a recording.
Usually the recordings with DR ratings between 10 and 20 sound more realistic than recordings between 2 and 5.

Edgy or Etched – Between 3kHz and 7kHz, as a subset of brightness and Detailed, an exaggerated form of brightness/detail is Edgy, Etched, Sharpness, Sibliance. The opposite (a dip) can be described as lacking Finesse, dis-attached Treble.

Ethereal – Describes a slight treble elevated sound that feels ‘fragile’ and slightly unnatural but otherwise neutral and pleasant.

Finesse – This describes fine and delicate details heard in music. When one can hear small details clearly, realistic and well defined as in not ‘coarse’.
The opposite of finesse is grainy.

Forward – When voices and instruments pop-out. This usually is caused by a mild peak in the 1kHz to 2kHz range. A small peak is usually not very detrimental to the sound at all. The opposite of forward is ‘laid-back’.

Flat – When the frequency response does not deviate in the audible range something is considered flat.
When this word is used with headphones or speakers one usually finds that it doesn’t color the sound (too much). Strangely enough not everyone that describes a sound as flat it actually is flat. So one man’s ‘flat’ can be another ones bassy, boring, midrangy, or sharp sound.

Fun – This often describes a sound with elevated treble (>5kHz) and bass (<100Hz), also called ‘U shaped’ or ‘bathtub shaped’ sound.

Grainy – Also called ‘gritty’ and describes a sound that lacks finesse. The finer details are rendered a bit coarse. Often this is caused by lots of sharp peaks and deep dips in the treble area above 6kHz.

Harsh – This describes a sound similar as grainy but more concentrated in the midrange from 1kHz to 5kHz rather than in the treble range. Often distortion levels >1% in the midrange and or sharp peaks in that frequency range are the cause.

Highs – Also called treble, these are the upper frequencies above 5kHz.

Imaging – When a headphone or speaker images well it is very easy to pinpoint where instruments are and that image is stable and ‘sharp’. This requires a flat frequency response that is also matched between the two drivers, and detailed recordings. Slightly angled headphone drivers can have better ‘imaging’ as they make use of the ear-shape.

Laid back – describes a relaxed sound signature and is caused by an audible and rather wide dip around 2-4kHz. It is the opposite of forward and dynamic. Upper mids and lower treble is under-emphasized and is found to be pleasant for longer listening sessions when the dip is just slight (between 5 and 10dB).

Lush – is similar to Laid back. It describes a rich, warm-pillowy sound usually with slightly elevated lower frequencies and rather wide dip around 2-4kHz with a good treble quality that is a bit higher in level than when said to be laid back.

Microphonics – sensitivity to touch. It could be present in some electronic components but in the case of headphones it refers to mechanically ‘conducted’ sounds of the cable. When one touches the earcups of a headphone one can hear this quite clearly. Depending on how a cable is coupled to the cup touching the cable does a similar thing but attenuated compared to touching the cups directly.
Cloth covered, thick and rigid cables usually are worse in this aspect then softer and thinner smooth surface cables. When microphonics are bad you can hear a cable rubbing against clothes when moving when music isn’t even playing that loud. Some are annoyed others simply don’t mind.

Mids/Midrange – Frequencies between roughly 250Hz and 4kHz. One can divide this in lower mids (250Hz – 500Hz) where the ‘body’ of voices and instrumens is. The mids (500Hz – 1500Hz) and upper mids (1.5kHz to 4kHz). In the upper mids the ‘clarity’ of voices and instruments as well as ‘attack’ of musical instruments can be found.

Muddy – Describes the sound of elevated lower mids/upper bass where the bass is not clearly defined and ‘blends in’ with the rest of the music in an unnaturally elevated way.

Muffled – sound is when it would appear as though the sound is coming from a speaker with a wool blanket draped over it. Lacking in clarity and treble. Frequencies above 2kHz are subdued and or rolled-off.

Natural – When a headphone sounds natural all instruments and voices are reproduced in a realistic manner without coloration, a correct stereo-image and with a dynamic sound.

Neutral – and neutral are closely the same but neutral is more about the reproduction of sound not being colored. All frequencies are reproduced at the proper level.
There can be concensus about how a headphone should measure to be considered neutral.
However, there are many people stating a headphone or speaker sounds neutral to them but in reality is quite colored. The term neutral used in subjective reviews thus can be considered a meaningless term unless comparisons are made to known neutral headphones.

Openness – Has little to do with a headphone being open or closed. It usually refers to a detailed and neutral sounding headphone where instruments have a clear separation. Headphones with a good clarity are often described as being ‘open’ sounding.

Punch – powerful bass and drum hits. A small emphasis around 100-150Hz can relatively boost harmonics of bass drum/bass.

Rolled-off – Sound can be rolled-off in the lows and in the treble or both. Rolled-off bass is lacking in lower bass and can have slightly subdued bass as well. When the treble is rolled off frequencies above 10kHz usually are low in amplitude. When this happens usually the sound is not considered airy and lacks sparkle.

Sharpness – An emphasis between 2kHz and 6kHz can give instruments and voices the impression of being ‘highlighted’ and sharply detailed.

Shrill – An exaggerated form of sharpness usually with an emphasis between 4kHz and 10kHz and is unpleasant, shreaky.

Sibilant / sibilance – A sharp peak in the frequency band between 5kHz and 8kHz can emphasize the ‘s’ sounds in words. A high peak (>5dB) can sound piercing and sharp and is usually fatiguing and unpleasant during longer listening sessions. A lower peak between 3dB and 5dB can create an illusion of ‘detail’ in music and ‘sharply’ defined instruments. In the long run also fatiguing.

Signature – the overall tonal balance of a headphone. One could describe a signature in many ways as found in this article.

Soft – can be used to describe the loudness (amplitude as in how loud the sound is) or the ‘texture’ of the treble. For the latter it can describe the amplitude opposite the bass and mids or define treble quality. When treble is not splashy elevated, sibilant or coarse but ‘sweet’ and pleasant it is often described as soft. When the frequency response is quite ‘flat’ between 6kHz and 15kHz without sharp peaks and dips the treble quality usually is good and depending on the relative level can be said to be soft or sweet.

Soundstage – Describes in 3d terms (height, width and depth) where one perceives recorded instruments. For headphones it is very rare to hear instruments ‘projected’ clearly meters in front of you without special recordings or software. In the vast majority of cases sounds are between left and right ear only. For headphones instead of soundstage the word headstage is used.
This describes how well instruments are defined between left and right and how ‘sharp’ they can be pinpointed. One has to realize the stereo image is created in the studio during mixing/mastering process and is artificial in most cases. For headphones there is crossfeed which makes some recordings easier to listen to but this too is just a ‘trick’.

Sparkle – Usually caused by a slight emphasis in the (upper) treble and describes vibrant treble.

Spatial – describes how sounds can be heard all around you. Requires digital trickery to pull this of with headphones. Usually to get spatial sound multiple speakers are required placed around the listening position. Related terms: crossfeed and soundstage/headstage.

Sweet – Corresponds with ‘lush’. It describes a ‘soft’ sound signature usually with slightly elevated lows and soft (not grainy or splashy) highs.

Timbre – The tone of a note from an instrument. Timbre is determined by the ratio between the fundamental tone and its harmonics. Headphones with a ‘flat’ tonal response should have the proper timbre. When the tonal balance is not flat timbre can be affected and changed. This can sometime be for the better or worse and depends on the recording. Do note that in studios timbre of each individual instrument is often adjusted to fit in the recording. This can’t be undone.

Tonal Balance – describes how ‘flat’ a headphone is. When the bass is boosted the tonal balance is bassy, when treble is boosted the tonal balance is ‘bright’. When a headphone is described as tonally balanced no specific frequency bands are popping out or are subdued.
The real snag here is that some owners may feel a headphone is tonally balanced but in reality it may be bassy, bass-shy, warm, cold, midrangy or bright for instance. Tonally balanced = realistic sounding with no emphasis on anything.
Objectively tonal balance is difficult to prove headphone measurements can differ substantially between test-rigs.

Transparent – Similar to clarity it is a clean clear open and detailed quality.

Warm/warmth – a general downward tilt in the frequency range between 300Hz and 3kHz. The opposite is cold.

For headphones I would say anything below 16 ohm
For speakers I would say impedances dropping below 3 ohm

When you can hear the sound being degraded it is excessive. You cannot pin a number on it because there are several kinds of distortion and music can mask distortion.

see post # 4

see post #4

Dry: lack of 'live sound', artificial studio sound.

Stage: Good stage = one can easily pinpoint where instruments vocals are from left to right.
Sound stage for speakers, headstage for headphones

Depth: Not the same experience for everyone ... is is brain and equipment dependent.. with speakers you can get a sense of sounds coming from behind other sounds. Often micing, mixing, reverb, hall effect related which allows the brain to 'guess' from which distance sounds appear to come from.

There are also lots of people giving a different meaning or interpretation to all above mentioned 'sound descriptions'.

See, this is what I was hoping for. Sort of a minimal Wiki of audio terms. Bravo. This and the reply in comment #4 deserve places in such a Wiki. Thanks for providing these. This is what I was requesting. I can now look away when sysads scoff at the value of this so they can simply wax eloquent on expressing subjective information.

Additionally, it would be rational to attempt to follow a certain order prescription, for example, most important first in a review. Sort of how a scientific approach -- even to subjective observations -- might be conducted.

Thanks again. I wonder if there might be some useful effect on the 'subjective reviewers' if this became well known to all? I know they will at first react negatively.

Kal Rubinson · Apr 5, 2022

Jim Shaw said:
Thanks again. I wonder if there might be some useful effect on the 'subjective reviewers' if this became well known to all?

I think most 'subjective reviewers' do know this. The issue is that many will say that it constrains their ability to communicate although I think it quite the opposite.

Jim Shaw · Apr 5, 2022

Jim Shaw said:
See, this is what I was hoping for. Sort of a minimal Wiki of audio terms. Bravo. This and the reply in comment #4 deserve places in such a Wiki. Thanks for providing these. This is what I was requesting. I can now look away when sysads scoff at the value of this so they can simply wax eloquent on expressing subjective information.

Additionally, it would be rational to attempt to follow a certain order prescription, for example, most important first in a review. Sort of how a scientific approach -- even to subjective observations -- might be conducted.

Thanks again. I wonder if there might be some useful effect on the 'subjective reviewers' if this became well known to all? I know they will at first react negatively.

I liked to responses so well that I reproduce them below. (I ran them through Grammarly to fix some spellings and maybe improve punctuation. I tried to include credits for the information provided. I take no credit for the information provided, just some copying and pasting.

Perhaps this could be a working document capable of continued use and extension.

-------
From a response by Solderdude thru ASR:
Airy – Describes the sense of space and openness of the music. Best heard in cymbals where the shimmer is realistic. This mostly has a relation with treble extension. It should be mentioned that not all well-extended headphones sound ‘airy’ and some not so well extended headphones can still sound sort of ‘airy’. The superlative (exaggerated) form of an ‘airy’ sound is ‘ethereal‘ where the sound feels very ‘fragile’ and overly ‘delicate’.

Analytical – A term used to describe a product that produces a high level of detail in reference to the music being played back. In technical terms, this means the tonal response is between ‘flat’ and slightly elevated above 3kHz.

Balance – This could mean balance between channels or sometimes people use the term to describe Tonal Balance.
Balance between channels means the stereo image is right in the middle. In some cases, balance can be correct for a certain part of the frequency range and not for higher or lower frequencies. This can be spotted in measurements. In sound, it can lead to a fuzzy position of a stereo image.
Also, one-sided seal loss with headphones can cause this when bass is less prominent in the leaky side. On measurements with a perfect seal this won’t be obvious.

Bassy – This is the lower end frequency of human hearing. Bass can be measured in quantity (heaviness) and quality. Other bass descriptors are muddy, punchy, rubbery, boomy, and fat.
The bass area is between 25Hz and 300Hz but harmonics can go as high as 5kHz. Peaks and dips in the frequency range between 20Hz and 500Hz can make the bass sound different from realistic. For descriptors and the frequency range, they fall into see the chart above.
The opposite of bassy is Lean

Bass (Lows) – can be divided into 4 ranges: sub-bass (10Hz-25Hz) this is not audible as tones or notes but deep rumbles in movies are an example. When it is absent most people won’t miss it. That is once you heard what bass extension does it can add to the musical experience. For instance, with Piano music where pedals are used this lowest part can make the piano sound more real.
Lower bass (25Hz-50Hz) is the part of the bass where the deepest musical notes are. A small and gentle roll-off usually isn’t very audible. A substantial roll-off (-10dB) is quite audible.
Bass (50Hz-120Hz) is the part where most of the ‘body’ of bass instruments is. Sharp dips or peaks and emphasis or subdued parts here can color the bass, make it sound boomy, exaggerated, full, tight, dis-attached or muddy. The best sounding headphones/speakers have a flat response in the area. Some folks prefer substantially more bass than others. When traveling a bit more bass usually is welcome. High-bass (120Hz-250Hz) is where the harmonics of most bass sound-producing instruments are. A small peak there may make bass sound tighter. Too much (>5dB) can make bass sound muddy and boomy. A dip in this area can make bass instruments sound ‘wrong’ or ‘disattached’. This is where bass appears to be ‘apart’ from the music. Some people actually prefer a dip, certainly when the lower frequencies are elevated.

Bloat – Bloat is present usually in the mid-bass. When frequencies below 400Hz are lifted opposite the mids and highs the bass can sound ‘bloated’ and overdone. When there is a dip around 150Hz this is the opposite of bloated and the bass sounds disattached.
Bloated bass often also causes ‘bloom‘ in men's voices. In this case, the male voice sounds unnaturally bassy/full and lacks clarity. When only the part below 150Hz is boosted the bass doesn’t sound ‘bloated’.
Bright/Brightness – A small emphasis of upper mids. The frequency band between 1kHz and 7kHz. Brightness is a wider bandwidth form of Clarity + Detail, when exaggerated even more it becomes Shrill.

Clarity – A few dB emphasis between 1kHz and 3kHz can add some clarity for voices and instruments. The opposite (a dip) of clarity (seen a lot in headphones) could be described as Laidback, Sweet, lacking Attack. An exaggerated form is shrill, cold, hard.

Coarse – is another word for Grainy or Gritty and describes uneven treble response with deep dips and peaks. Note that measurements made with measurement gear using a fake Pinna can show substantial peaks and dips above 5kHz caused by the fake Pinna but may not indicate roughness nor should one try to EQ this out.

Cold – a general upward tilt in the frequency range between 300Hz and 3kHz. The opposite is warmth.

Congestion – Something sounds congested when it the sound is muffled and not very dynamic. It usually is a combination of emphasis on warmth and laid-back sound / lacking in clarity.

Compressed – This can mean 2 things. There is compressed data and compressed music. Compressed data means that a file has been shrunk in size. Most people have heard of ZIP and RAR where one or more files are smaller in size so they can be sent as an attachment. This is lossless compression where the file can be reconstructed to its original size and then can be used. In Audio there are many lossless formats. The biggest difference with something like ZIP is that a ZIP file needs to be ‘unpacked’ before one can use the file (song or album) where audio files are de-compressed while the song is being played back. One can also compress files by leaving out data that can not be recovered later. The most well known lossy compression files are JPG (for pictures) and MP3 for music but there are many more formats. During the compression (encoding) one can usually select how much one throws away. More compression = more loss of fidelity.
When music is compressed the louder parts in a recording and lowered in volume and/or softer parts of a recording are amplified. Usually, this is done in studios to get a more even sound. There is a fine line between this being done craft-fully (usually for each instrument its own compression) and it being overdone in dramatic ways. This sadly is very common for most popular recordings. Search for ‘loudness wars’.
When a headphone sounds compressed it usually is a bit bloated sounding and lacks realism/clarity/dynamics.

Crisp – is another description of having clarity, an exaggerated form is shrill. The opposite is laid back.

Dark/Darkness – Describes a gradual downward slope of frequencies above 3kHz often combined with an emphasis on bass/lower mids. An exaggerated form is Muffled. The opposite is Shrill

Decay – How a sound/note/resonance fades away over time. This is not always visible in the frequency response but could be visible in Cumulative Spectral Decay (Waterfall) plots. When there is some clarity and no mid-bass emphasis most people describe the decay as fast.

Depth – How far away the instruments spacing is from back to front. In headphones, this is not always perceived as such. Sometimes a gradual downward slope between 300Hz and 10kHz can also give a sense of depth imitating a greater distance from an actual music performance.

Detail – When something sounds detailed the frequency band between 1kHz and 3kHz is usually audibly flat or neutral and is more or less flat between 3kHz and 10kHz or has some mild emphasis (a few dB) or a peak in that part of the frequency range. An exaggerated form is Edgy, Sharpness, Sibilance.

Dynamic – When a headphone sounds dynamic it sounds ‘lively’ and instruments and voices ‘ pop out’ of clearly. It is the opposite of a relaxed, muffled, or compressed sound. When a recording is truly dynamic there is a substantial difference between soft sounds and peaks.
You need to play the music loud to hear small details. Funnily enough, when you can hear small details already at lower playback levels the sound is actually compressed yet many say, incorrectly, this is a dynamic sound.
The word dynamic is also used to describe a type of headphone driver (the most common type) where a magnet and voice-coil are used to create sound.
Then there is also dynamic range in an electrical sense. This describes how many differences in decibel there is between the signal and the noise and distortion products.
And finally Dynamic Range as in DR-rating. This describes how the difference between the average levels and peak levels in a recording.
Usually the recordings with DR ratings between 10 and 20 sound more realistic than recordings between 2 and 5.

Edgy or Etched – Between 3kHz and 7kHz, as a subset of brightness and Detailed, an exaggerated form of brightness/detail is Edgy, Etched, Sharpness, Sibilance. The opposite (a dip) can be described as lacking Finesse, dis-attached Treble.

Ethereal – Describes a slight treble elevated sound that feels ‘fragile’ and slightly unnatural but otherwise neutral and pleasant.

Finesse – This describes fine and delicate details heard in music. When one can hear small details clearly, realistic and well defined as in not ‘coarse’.
The opposite of finesse is grainy.

Forward – When voices and instruments pop out. This usually is caused by a mild peak in the 1kHz to 2kHz range. A small peak is usually not very detrimental to the sound at all. The opposite of forward is ‘laid-back’.

Flat – When the frequency response does not deviate in the audible range something is considered flat.
When this word is used with headphones or speakers one usually finds that it doesn’t color the sound (too much). Strangely enough, not everyone that describes a sound as flat is is it actually is flat. So one man’s ‘flat’ can be another one's bassy, boring, midrangy, or sharp sound.

Fun – This often describes a sound with elevated treble (>5kHz) and bass (<100Hz), also called ‘U shaped’ or ‘bathtub shaped’ sound.

Grainy – Also called ‘gritty’ and describes a sound that lacks finesse. The finer details are rendered a bit coarse. Often this is caused by lots of sharp peaks and deep dips in the treble area above 6kHz.

Harsh – This describes a sound similar as grainy but more concentrated in the midrange from 1kHz to 5kHz rather than in the treble range. Often distortion levels >1% in the midrange and or sharp peaks in that frequency range are the cause.

Highs – Also called treble, these are the upper frequencies above 5kHz.

Imaging – When a headphone or speaker images well it is very easy to pinpoint where instruments are and that image is stable and ‘sharp’. This requires a flat frequency response that is also matched between the two drivers, and detailed recordings. Slightly angled headphone drivers can have better ‘imaging’ as they make use of the ear shape.

Laidback – describes a relaxed sound signature and is caused by an audible and rather wide dip around 2-4kHz. It is the opposite of forward and dynamic. Upper mids and lower treble is under-emphasized and is found to be pleasant for longer listening sessions when the dip is just slight (between 5 and 10dB).

Lush – is similar to Laid back. It describes a rich, warm-pillowy sound usually with slightly elevated lower frequencies and rather a wide dip around 2-4kHz with a good treble quality that is a bit higher in level than when said to be laid back.

Microphonics – sensitivity to touch. It could be present in some electronic components but in the case of headphones, it refers to mechanically ‘conducted’ sounds of the cable. When one touches the earcups of a headphone one can hear this quite clearly. Depending on how a cable is coupled to the cup touching the cable does a similar thing but is attenuated compared to touching the cups directly.
Cloth-covered, thick and rigid cables usually are worse in this aspect than softer and thinner smooth surface cables. When microphonics are bad you can hear a cable rubbing against clothes when moving when music isn’t even playing that loud. Some are annoying, others simply don’t mind.

Mids/Midrange – Frequencies between roughly 250Hz and 4kHz. One can divide this in lower mids (250Hz – 500Hz) where the ‘body’ of voices and instruments is. The mids (500Hz – 1500Hz) and upper mids (1.5kHz to 4kHz). In the upper mids the ‘clarity’ of voices and instruments, as well as the ‘attack’ of musical instruments, can be found.

Muddy – Describes the sound of elevated lower mids/upper bass where the bass is not clearly defined and ‘blends in’ with the rest of the music in an unnaturally elevated way.

Muffled – sound is when it would appear as though the sound is coming from a speaker with a wool blanket draped over it. Lacking in clarity and treble. Frequencies above 2kHz are subdued and or rolled off.

Natural – When a headphone sounds natural all instruments and voices are reproduced in a realistic manner without coloration, a correct stereo image, and with a dynamic sound.

Neutral – and neutral are closely the same but neutral is more about the reproduction of sound not being colored. All frequencies are reproduced at the proper level.
There can be consensus about how a headphone should measure to be considered neutral.
However, there are many people stating a headphone or speaker sounds neutral to them but in reality, is quite colored. The term neutral used in subjective reviews thus can be considered a meaningless term unless comparisons are made to known neutral headphones.

Openness – Has little to do with a headphone being open or closed. It usually refers to a detailed and neutral-sounding headphone where instruments have a clear separation. Headphones with good clarity are often described as being ‘open’ sounding.

Punch – powerful bass and drum hits. A small emphasis around 100-150Hz can relatively boost the harmonics of bass drum/bass.

Rolled-off – Sound can be rolled-off in the lows and in the treble or both. Rolled-off bass is lacking in lower bass and can have slightly subdued bass as well. When the treble is rolled off frequencies above 10kHz usually are low in amplitude. When this happens usually the sound is not considered airy and lacks sparkle.

Sharpness – An emphasis between 2kHz and 6kHz can give instruments and voices the impression of being ‘highlighted’ and sharply detailed.

Shrill – An exaggerated form of sharpness usually with an emphasis between 4kHz and 10kHz and is unpleasant, shreaky.

Sibilant/sibilance – A sharp peak in the frequency band between 5kHz and 8kHz can emphasize the ‘s’ sounds in words. A high peak (>5dB) can sound piercing and sharp and is usually fatiguing and unpleasant during longer listening sessions. A lower peak between 3dB and 5dB can create an illusion of ‘detail’ in music and ‘sharply’ defined instruments. In the long run also fatiguing.

Signature – the overall tonal balance of a headphone. One could describe a signature in many ways as found in this article.

Soft – can be used to describe the loudness (amplitude as in how loud the sound is) or the ‘texture’ of the treble. For the latter, it can describe the amplitude opposite the bass and mids or define treble quality. When treble is not splashy elevated, sibilant or coarse but ‘sweet’ and pleasant it is often described as soft. When the frequency response is quite ‘flat’ between 6kHz and 15kHz without sharp peaks and dips the treble quality usually is good and depending on the relative level can be said to be soft or sweet.

Soundstage – Describes in 3d terms (height, width, and depth) where one perceives recorded instruments. For headphones, it is very rare to hear instruments ‘projected’ clearly meters in front of you without special recordings or software. In the vast majority of cases, sounds are between the left and right ear only. For headphones instead of soundstage, the word headstage is used.
This describes how well instruments are defined between left and right and how ‘sharp’ they can be pinpointed. One has to realize the stereo image is created in the studio during the mixing/mastering process and is artificial in most cases. For headphones, there is a crossfeed which makes some recordings easier to listen to but this too is just a ‘trick’.

Sparkle – Usually caused by a slight emphasis in the (upper) treble and describes vibrant treble.

Spatial – describes how sounds can be heard all around you. Requires digital trickery to pull this of with headphones. Usually to get spatial sound multiple speakers are required placed around the listening position. Related terms: crossfeed and soundstage/headstage.

Sweet – Corresponds with ‘lush’. It describes a ‘soft’ sound signature usually with slightly elevated lows and soft (not grainy or splashy) highs.

Timbre – The tone of a note from an instrument. Timbre is determined by the ratio between the fundamental tone and its harmonics. Headphones with a ‘flat’ tonal response should have the proper timbre. When the tonal balance is not flat timbre can be affected and changed. This can sometimes be for the better or worse and depends on the recording. Do note that in studios timbre of each individual instrument is often adjusted to fit in the recording. This can’t be undone.

Tonal Balance – describes how ‘flat’ a headphone is. When the bass is boosted the tonal balance is bassy, when treble is boosted the tonal balance is ‘bright’. When a headphone is described as tonally balanced no specific frequency bands are popping out or are subdued.
The real snag here is that some owners may feel a headphone is tonally balanced but in reality, it may be bassy, bass-shy, warm, cold, midrangy or bright for instance. Tonally balanced = realistic sounding with no emphasis on anything.
Objectively tonal balance is difficult to prove headphone measurements can differ substantially between test rigs.

Transparent – Similar to clarity it is a clean clear open and detailed quality.

Warm/warmth – a general downward tilt in the frequency range between 300Hz and 3kHz. The opposite is cold.

Low Impedances
For headphones, I would say anything below 16 ohm
For speakers, I would say impedances dropping below 3 ohm

Audible Distortion
When you can hear the sound being degraded it is excessive. You cannot pin a number on it because there are several kinds of distortion and music can mask distortion.

I know that reviewers should have considerable freedom, especially to be wrong. But in the science of audio, V-curved, bright, excessive bass, or comments about male/female vocals would be nice to narrow down. Can anyone define "warm"? Dry? Stage? Depth?

Warm: see post # 4

Dry: lack of 'live sound', artificial studio sound.

Stage: Good stage = one can easily pinpoint where instruments and/or vocals are from left to right.
Sound stage for speakers, headstage for headphones

Depth: Not the same experience for everyone ... is brain and equipment dependent.. with speakers you can get a sense of sounds coming from behind other sounds. Often micing, mixing, reverb, hall effect are related which allows the brain to 'guess' from which distance sounds appear to come from.

There are also lots of people giving a different meaning or interpretation to all above mentioned 'sound descriptions'.

—----------------------------

Comment 4 reply
from Solderdude via ASR

aJ_v6VP9befFQn0NpCn0QlxaceKKvFE3WQS_J1MX5kgA6bMCsVM0-NTEbJ-w3mtzwGJN9ENtSddntdMzjMdZb3mb6U-0HmJfHGU2XaJPZMOFb-DIfYfPqqghQis0tLnItFM4tkn3

diddley · Apr 5, 2022

solderdude said:
Airy – Describes the sense of space and openness of the music. Best heard in cymbals where the shimmer is realistic. This mostly has a relation with treble extension. It should be mentioned that not all well extended headphones sound ‘airy’ and some not so well extended headphones can still sound sort of ‘airy’. The superlative (exaggerated) form af ‘airy’ sound is ‘ethereal‘ where the sound feels very ‘fragile’ and overly ‘delicate’.

Analytical – A term used to describe a product that produces a high level of detail in reference to the music being played back. In technical terms this means the tonal response is between ‘flat’ and slightly elevated above 3kHz.

Balance – This could mean balance between channels or sometimes people use the term to describe Tonal Balance.
Balance between channels means the stereo image is right in the middle. In some cases balance can be correct for a certain part of the frequency range and not for higher or lower frequencies. This can be spotted in measurements. In sound it can lead to a fuzzy position of a stereo image.
Also one sided seal loss with headphones can cause this when bass is less prominent in the leaky side. On measurements with perfect seal this won’t be obvious.

Bassy – This is the lower end frequency of the human hearing. Bass can be measured in quantity (heaviness) and quality. Other bass descriptors are muddy, punchy, rubbery, boomy and fat.
The bass area is between 25Hz and 300Hz but harmonics can go as high as 5kHz. Peaks and dips in the frequency range between 20Hz and 500Hz can make the bass sound different from realistic. For descriptors and the frequency range they fall into see the chart above.
The opposite of bassy is Lean

Bass (Lows) – can be divided 4 ranges: sub-bass (10Hz-25Hz) this is not audible as tones or notes but deep rumbles in movies are an example. When it is absent most people won’t miss it. That is once you heard what bass extension does it can add to the musical experience. For instance with Piano music where pedals are used this lowest part can make the piano sound more real.
Lower bass (25Hz-50Hz) is the part of the bass where the deepest musical notes are. A small and gentle roll-off usually isn’t very audible. A substantial roll-off (-10dB) is quite audible.
Bass (50Hz-120Hz) is the part where most of the ‘body’ of bass instruments is. Sharp dips or peaks and emphasis or subdued parts here can color the bass, make it sound boomy, exaggerated, full, tight, dis-attached or muddy. The best sounding headphones/speakers have a flat response in the area. Some folks prefer substantially more bass than others. When traveling a bit more bass usually is welcome. High-bass (120Hz-250Hz) is where the harmonics of most bass sound producing instruments are. A small peak there may make bass sound tighter. Too much (>5dB) can make bass sound muddy and boomy. A dip in this area can make bass instruments sound ‘wrong’ or ‘disattached’. This is where bass appears to be ‘apart’ from the music. Some people actually prefer a dip, certainly when the lower frequencies are elevated.

Bloat – Bloat is present usually in the mid bass. When frequencies below 400Hz are lifted opposite the mids and highs the bass can sound ‘bloated’ and overdone. When there is a dip around 150Hz this is the opposite of bloated and the bass sounds disattached.
Bloated bass often also causes ‘bloom‘ in mens voices. In this case the male voice sounds unnaturally bassy/full and lacks clarity.
When only the part below 150Hz is boosted the bass doesn’t sound ‘bloated’.

Bright/Brightness – A small emphasis of upper mids. The frequency band between 1kHz and 7kHz. Brightness is a wider bandwidth form of Clarity + Detail, when exaggerated even more it becomes Shrill.

Clarity – A few dB emphasis between 1kHz and 3kHz can add some clarity for voices and instruments. The opposite (a dip) of clarity (seen a lot in headphones) could be described as Laid back, Sweet, lacking Attack. An exaggerated form is shrill, cold, hard.

Coarse – is another word for Grainy or Gritty and describes uneven treble response with deep dips and peaks. Note that measurements made with measurement gear using a fake Pinna can show substantial peaks and dips above 5kHz caused by the fake Pinna but may not indicate roughness nor should one try to EQ this out.

Cold – a general upward tilt in the frequency range between 300Hz and 3kHz. The opposite is warmth.

Congestion – Something sounds congested when it the sound is muffled and not very dynamic. It usually is a combination of emphasis in warmth and laid back sound / lacking in clarity.

Compressed – This can mean 2 things. There is compressed data and compressed music. Compressed data means that a file has been shrunk in size. Most people have heard of ZIP and RAR where one or more files are smaller in size so they can be sent as an attachment. This is lossless compression where the file can be reconstructed to its original size and then can be used. In Audio there are many lossless formats. The biggest difference with something like ZIP is that a ZIP file needs to be ‘unpacked’ before one can use the file (song or album) where audio files are de-compressed while the song is being played back. One can also compress files by leaving out data which can not be recovered later. The most well known lossy compression files are JPG (for pictures) and MP3 for music but there are many more formats. During the compression (encoding) one can usually select how much one throws away. More compression = more loss of fidelity.
When music is compressed the louder parts in a recording and lowered in volume and/or softer parts of a recording are amplified. Usually this is done in studios to get a more even sound. There is a fine line between this being done craft-fully (usually for each instrument its own compression) and it being overdone in dramatic ways. This sadly is very common for most popular recordings. Search for ‘loudness wars’.
When a headphone sounds compressed it usually is bit bloated sounding and lacks realism/clarity/dynamics.

Crisp – is another description of having clarity, an exaggerated form is shrill. The opposite is laid back.

Dark/Darkness – Describes a gradual downward slope of frequencies above 3kHz often combined with an emphasis on bass/lower mids. An exaggerated form is Muffled. The opposite is Shrill

Decay – How a sound/note/resonance fades away over time. This is not always visible in the frequency response but could be visible in Cumulative Spectral Decay (Waterfall) plots. When there is some clarity and no mid bass emphasis most people describe the decay as fast.

Depth – How far away the instruments spacing is from back to front. In headphones this is not always perceived as such. Sometimes a gradual downward slope between 300Hz and 10kHz can also give a sense of depth imitating a greater distance from an actual music performance.

Detail – When something sounds detailed the frequency band between 1kHz and 3kHz is usually audibly flat or neutral and is more or less flat between 3kHz and 10kHz or has some mild emphasis (a few dB) or a peak in that part of the frequency range. An exaggerated form is Edgy, Sharpness, Sibliance.

Dynamic – When a headphone sounds dynamic it sounds ‘lively’ and instruments and voices ‘pop-out’ of clearly. It is the opposite of a relaxed, muffled or compressed sound. When a recording is truly dynamic there is a substantial difference between soft sounds and peaks.
You need to play the music loud to hear small details. Funnily enough when you can hear small details already at lower playback levels the sound is actually compressed yet many say, incorrectly, this is a dynamic sound.
The word dynamic is also used to describe a type of headphone driver (the most common type) where a magnet and voice-coil is used to create sound.
Then there is also dynamic range in an electrical sense. This describes how many difference in decibel there is between the signal and the noise and distortion products.
And finally Dynamic Range as in DR-rating. This describes how the difference between the average levels and peak levels in a recording.
Usually the recordings with DR ratings between 10 and 20 sound more realistic than recordings between 2 and 5.

Edgy or Etched – Between 3kHz and 7kHz, as a subset of brightness and Detailed, an exaggerated form of brightness/detail is Edgy, Etched, Sharpness, Sibliance. The opposite (a dip) can be described as lacking Finesse, dis-attached Treble.

Ethereal – Describes a slight treble elevated sound that feels ‘fragile’ and slightly unnatural but otherwise neutral and pleasant.

Finesse – This describes fine and delicate details heard in music. When one can hear small details clearly, realistic and well defined as in not ‘coarse’.
The opposite of finesse is grainy.

Forward – When voices and instruments pop-out. This usually is caused by a mild peak in the 1kHz to 2kHz range. A small peak is usually not very detrimental to the sound at all. The opposite of forward is ‘laid-back’.

Flat – When the frequency response does not deviate in the audible range something is considered flat.
When this word is used with headphones or speakers one usually finds that it doesn’t color the sound (too much). Strangely enough not everyone that describes a sound as flat it actually is flat. So one man’s ‘flat’ can be another ones bassy, boring, midrangy, or sharp sound.

Fun – This often describes a sound with elevated treble (>5kHz) and bass (<100Hz), also called ‘U shaped’ or ‘bathtub shaped’ sound.

Grainy – Also called ‘gritty’ and describes a sound that lacks finesse. The finer details are rendered a bit coarse. Often this is caused by lots of sharp peaks and deep dips in the treble area above 6kHz.

Harsh – This describes a sound similar as grainy but more concentrated in the midrange from 1kHz to 5kHz rather than in the treble range. Often distortion levels >1% in the midrange and or sharp peaks in that frequency range are the cause.

Highs – Also called treble, these are the upper frequencies above 5kHz.

Imaging – When a headphone or speaker images well it is very easy to pinpoint where instruments are and that image is stable and ‘sharp’. This requires a flat frequency response that is also matched between the two drivers, and detailed recordings. Slightly angled headphone drivers can have better ‘imaging’ as they make use of the ear-shape.

Laid back – describes a relaxed sound signature and is caused by an audible and rather wide dip around 2-4kHz. It is the opposite of forward and dynamic. Upper mids and lower treble is under-emphasized and is found to be pleasant for longer listening sessions when the dip is just slight (between 5 and 10dB).

Lush – is similar to Laid back. It describes a rich, warm-pillowy sound usually with slightly elevated lower frequencies and rather wide dip around 2-4kHz with a good treble quality that is a bit higher in level than when said to be laid back.

Microphonics – sensitivity to touch. It could be present in some electronic components but in the case of headphones it refers to mechanically ‘conducted’ sounds of the cable. When one touches the earcups of a headphone one can hear this quite clearly. Depending on how a cable is coupled to the cup touching the cable does a similar thing but attenuated compared to touching the cups directly.
Cloth covered, thick and rigid cables usually are worse in this aspect then softer and thinner smooth surface cables. When microphonics are bad you can hear a cable rubbing against clothes when moving when music isn’t even playing that loud. Some are annoyed others simply don’t mind.

Mids/Midrange – Frequencies between roughly 250Hz and 4kHz. One can divide this in lower mids (250Hz – 500Hz) where the ‘body’ of voices and instrumens is. The mids (500Hz – 1500Hz) and upper mids (1.5kHz to 4kHz). In the upper mids the ‘clarity’ of voices and instruments as well as ‘attack’ of musical instruments can be found.

Muddy – Describes the sound of elevated lower mids/upper bass where the bass is not clearly defined and ‘blends in’ with the rest of the music in an unnaturally elevated way.

Muffled – sound is when it would appear as though the sound is coming from a speaker with a wool blanket draped over it. Lacking in clarity and treble. Frequencies above 2kHz are subdued and or rolled-off.

Natural – When a headphone sounds natural all instruments and voices are reproduced in a realistic manner without coloration, a correct stereo-image and with a dynamic sound.

Neutral – and neutral are closely the same but neutral is more about the reproduction of sound not being colored. All frequencies are reproduced at the proper level.
There can be concensus about how a headphone should measure to be considered neutral.
However, there are many people stating a headphone or speaker sounds neutral to them but in reality is quite colored. The term neutral used in subjective reviews thus can be considered a meaningless term unless comparisons are made to known neutral headphones.

Openness – Has little to do with a headphone being open or closed. It usually refers to a detailed and neutral sounding headphone where instruments have a clear separation. Headphones with a good clarity are often described as being ‘open’ sounding.

Punch – powerful bass and drum hits. A small emphasis around 100-150Hz can relatively boost harmonics of bass drum/bass.

Rolled-off – Sound can be rolled-off in the lows and in the treble or both. Rolled-off bass is lacking in lower bass and can have slightly subdued bass as well. When the treble is rolled off frequencies above 10kHz usually are low in amplitude. When this happens usually the sound is not considered airy and lacks sparkle.

Sharpness – An emphasis between 2kHz and 6kHz can give instruments and voices the impression of being ‘highlighted’ and sharply detailed.

Shrill – An exaggerated form of sharpness usually with an emphasis between 4kHz and 10kHz and is unpleasant, shreaky.

Sibilant / sibilance – A sharp peak in the frequency band between 5kHz and 8kHz can emphasize the ‘s’ sounds in words. A high peak (>5dB) can sound piercing and sharp and is usually fatiguing and unpleasant during longer listening sessions. A lower peak between 3dB and 5dB can create an illusion of ‘detail’ in music and ‘sharply’ defined instruments. In the long run also fatiguing.

Signature – the overall tonal balance of a headphone. One could describe a signature in many ways as found in this article.

Soft – can be used to describe the loudness (amplitude as in how loud the sound is) or the ‘texture’ of the treble. For the latter it can describe the amplitude opposite the bass and mids or define treble quality. When treble is not splashy elevated, sibilant or coarse but ‘sweet’ and pleasant it is often described as soft. When the frequency response is quite ‘flat’ between 6kHz and 15kHz without sharp peaks and dips the treble quality usually is good and depending on the relative level can be said to be soft or sweet.

Soundstage – Describes in 3d terms (height, width and depth) where one perceives recorded instruments. For headphones it is very rare to hear instruments ‘projected’ clearly meters in front of you without special recordings or software. In the vast majority of cases sounds are between left and right ear only. For headphones instead of soundstage the word headstage is used.
This describes how well instruments are defined between left and right and how ‘sharp’ they can be pinpointed. One has to realize the stereo image is created in the studio during mixing/mastering process and is artificial in most cases. For headphones there is crossfeed which makes some recordings easier to listen to but this too is just a ‘trick’.

Sparkle – Usually caused by a slight emphasis in the (upper) treble and describes vibrant treble.

Spatial – describes how sounds can be heard all around you. Requires digital trickery to pull this of with headphones. Usually to get spatial sound multiple speakers are required placed around the listening position. Related terms: crossfeed and soundstage/headstage.

Sweet – Corresponds with ‘lush’. It describes a ‘soft’ sound signature usually with slightly elevated lows and soft (not grainy or splashy) highs.

Timbre – The tone of a note from an instrument. Timbre is determined by the ratio between the fundamental tone and its harmonics. Headphones with a ‘flat’ tonal response should have the proper timbre. When the tonal balance is not flat timbre can be affected and changed. This can sometime be for the better or worse and depends on the recording. Do note that in studios timbre of each individual instrument is often adjusted to fit in the recording. This can’t be undone.

Tonal Balance – describes how ‘flat’ a headphone is. When the bass is boosted the tonal balance is bassy, when treble is boosted the tonal balance is ‘bright’. When a headphone is described as tonally balanced no specific frequency bands are popping out or are subdued.
The real snag here is that some owners may feel a headphone is tonally balanced but in reality it may be bassy, bass-shy, warm, cold, midrangy or bright for instance. Tonally balanced = realistic sounding with no emphasis on anything.
Objectively tonal balance is difficult to prove headphone measurements can differ substantially between test-rigs.

Transparent – Similar to clarity it is a clean clear open and detailed quality.

Warm/warmth – a general downward tilt in the frequency range between 300Hz and 3kHz. The opposite is cold.

For headphones I would say anything below 16 ohm
For speakers I would say impedances dropping below 3 ohm

When you can hear the sound being degraded it is excessive. You cannot pin a number on it because there are several kinds of distortion and music can mask distortion.

see post # 4

see post #4

Dry: lack of 'live sound', artificial studio sound.

Stage: Good stage = one can easily pinpoint where instruments vocals are from left to right.
Sound stage for speakers, headstage for headphones

Depth: Not the same experience for everyone ... is is brain and equipment dependent.. with speakers you can get a sense of sounds coming from behind other sounds. Often micing, mixing, reverb, hall effect related which allows the brain to 'guess' from which distance sounds appear to come from.

There are also lots of people giving a different meaning or interpretation to all above mentioned 'sound descriptions'.

@solderdude: where would we be without you, thanks!

Equipment Review Organization And Uniform Terms

Jim Shaw

Addicted to Fun and Learning

DWPress

Major Contributor

amirm

Founder/Admin

Vini darko

Major Contributor

Vini darko

Major Contributor

DVDdoug

Major Contributor

Jim Shaw

Addicted to Fun and Learning

Midwest Blade

Senior Member

solderdude

Grand Contributor

Jim Shaw

Addicted to Fun and Learning

Kal Rubinson

Master Contributor

Jim Shaw

Addicted to Fun and Learning

diddley

Addicted to Fun and Learning

Similar threads