Pitfalls of AB testing

The pitfalls of AB testing are centered around how our brain works. During the normal day to day operation of the brain it has to map stimuli to a labeled branch of knowledge that means it collapses similar branches to one branch. If it has already heard a sound and then hears a distorted version of that sound it will map the distorted sound to the cleaner sound. That means when AB testing amplifiers that are different, the difference may only be heard the first time. On the second run the amplifiers will sound similar on the same sound track. Does that mean AB testing is impossible and unnecessary? As can be seen in these results here, these amplifiers are different in performance but the brain will correct up to 15% error so that they sound similar, the more correction applied the more pleasant as the brain is replaying the recorded familiar sound rather than the live amplifier sound.
 
  • Like
Reactions: head_unit
I would start at the very beginning. A key test for audio is the connection of channel-separated power supplies, "double mono". The 'differences gradually become smaller or inaudible', there is no such thing here.
I also recommend taking a short break from listening, because differences then become contrastively clearer: the whole organism hears, and it is an swing system that has to swing in and - at least somewhat - out.
And as always: do it first, test statements, then answer. I have enough A-B. Works great. Switching while listening: the smaller the differences, the more difficult it is to perceive them.

Soon I will post a video here in which I let you hear transistors of one type but of different batches, A-B tests;-)))

Aside: Almost all of the amplifiers "measured" above are not audio amplifiers - strictly. They are many stage complementary transistors pp amplifiers, many of them also half-wave unsymmetric. They do not produce a single clean tone. If you compare them with each other, you are comparing the same with the same.
And: THD is not a statement regarding sound.
 
Higher IMD values of 1% to 20% are heavily coveted as these are more meaty and fun as opposed to fast accurate and lean. Its just like using compression in mastering to add meat to the bass. This is more to do with what the brain does with stimulus it receives.
combined_dut_comparison_grid.png
 
The pitfalls of AB testing are centered around how our brain works. During the normal day to day operation of the brain it has to map stimuli to a labeled branch of knowledge that means it collapses similar branches to one branch. If it has already heard a sound and then hears a distorted version of that sound it will map the distorted sound to the cleaner sound.
Interesting. Do you have a reference for this or is this just speculation on your part? Your argument seems to hinge on this, so it would be nice to know if this is an established effect or not.

Tom
 
  • Like
Reactions: aslepekis
Yet skilled listeners can hear very small differences (so small they can't describe how they are different).
Sometimes these small differences are in overlooked uncontrolled variables. Which makes the test meaningless.
I think if differences are that small they become fairly irrelevant. It's also possible to think there was a difference when there wasn't because of the way we focus on the music each time. Instantaneous switching is the only way. A break relies on memory which is not reliable.
 
I disagree, I've experienced this phenomena. I hear something that one set up does that another set up doesn't do so well and I can't put my finger on it too well but the one system becomes irritating once I've heard the 'better' one. It's repeatable and not time-limited. I have gotten better at being able to describe it by reading audio reviews in which others with better language skills have found ways to do so.
 
  • Like
Reactions: Miller-8 and rayma
It's repeatable in the way that once a sound doesn't seem 'right' anymore because I heard a different system, it retains that impression when I listen again at future times. It's "sighted" testing of course but in full disclosure, I have no interest in blind testing as my goals don't require it, but no malice against those who pursue it.
 
It's repeatable in the way that once a sound doesn't seem 'right' anymore because I heard a different system, it retains that impression when I listen again at future times. It's "sighted" testing of course but in full disclosure, I have no interest in blind testing as my goals don't require it, but no malice against those who pursue it.
As long as it's sighted it's open to placebo, confirmation bias, expectation bias. It's therefore meaningless in the grand scheme of things. Audiophiles hate talking about these psychological effects and many try to hide from them.