• Welcome to ASR. There are many reviews of audio hardware and expert members to help answer your questions. Click here to have your audio equipment measured for free!

Mechano23: Klippel, VituixCAD, and LoudspeakerLab comparison and real measurements v. spec sheets

wigginjs

Member
Joined
Mar 26, 2026
Messages
6
Likes
30
Mechano23 is an unusually good DIY validation case. XMechanik published an excellent open-source speaker design and shared the driver measurements and VituixCAD project files in the original Mechano23 AudioScienceReview thread. Amir later measured the finished speaker on the Klippel NFS and shared images and measurement data in the Mechano23 AudioScienceReview review thread. That gives us a rare chance to compare the full chain: raw driver measurements, design software predictions (VituixCAD, LoudspeakerLab), and real-world loudspeaker performance. Design meausrement data from LoudspeakerLab is attached.

The goal of this post is to explor how close modeled designs get to real-world results and how does the completness of the meausrement dataset affect the accuracy. Also, how does LoudspeakerLab compare to the current DIY reference workflow in VituixCAD? Finally, how much do we give up if we design from manufacturer spec-sheet data instead of in-cabinet measurements?
on_axis_overlay.png

cta_overlay.png

polar_directivity_slices.png



h-contour-compare.jpg


v-contour-compare.jpg

impedance_overlay.png

crossover_transfer_overlay.png

klippel_error_heatmap.png

Compared Sources

  1. Klippel NFS measurement from Amir's Mechano23 AudioScienceReview review, treated here as ground truth.
  2. VituixCAD from XMechanik's original Mechano23 AudioScienceReview post, using exported frequency response, impedance, crossover-transfer, CTA, and directivity data from the shared VituixCAD project.
  3. Mechano23 LoudspeakerLab, using the in-cabinet driver measurements.
  4. Mechano23 LoudspeakerLab spec sheet, using manufacturer spec-sheet FR/ZMA data.

Reference Notes And Caveats

  • Klippel is treated as ground truth for acoustic response.
  • Klippel On-Axis, Listening Window, Early Reflections, and polar comparisons use Amir's provided horizontal and vertical SPL data.
  • Klippel Sound Power and Directivity Index use the best available extraction from the CEA2034 image, so those conclusions are lower confidence than the On-Axis, Listening Window, Early Reflections, and polar-shape conclusions.
  • Klippel impedance/phase are digitized from the supplied impedance image. This is good enough to compare the main impedance shape and minima, but lower precision than source text data.
  • Crossover-transfer errors are referenced to VituixCAD exported crossover-transfer data because Klippel does not provide electrical transfer-function data.
  • LoudspeakerLab data comes from the two public designs linked above, using their generated frequency response, impedance, CTA, crossover-transfer, and polar outputs.
  • The VituixCAD preference score shown in the shared VituixCAD materials is 8.139, using the VituixCAD default of omitting the low-frequency extension score; LoudspeakerLab's Preference Score is based more closely on Olive standard and includes the low-frequency extension penalty by default, but a "w/ Sub" Preference Score is also calculated which omits the low-frequency extension and produces a result more comparible to the defaul VituixCAD score.
Error cells are median / p95 / max dB over 100 Hz-16 kHz. SPL curves are level-aligned over 300 Hz-1 kHz before acoustic shape comparisons; DI is not level-aligned.

Input Data


Source

Input data

Angular coverage used by LL

VituixCAD

Shared Mechano23 VituixCAD project exports from in-cabinet driver measurements

As exported by VituixCAD

LoudspeakerLab

Same in-cabinet driver measurement family, public LL design

H 10..180 plus signed V -170..180 for both drivers

LoudspeakerLab (spec sheet)

Manufacturer FR/ZMA and sparse manufacturer horizontal polars

H30/H60 only; LL estimates the missing vertical and rear surface

Core Metrics Vs Klippel

This shows the delta (p95 error) between the Klippel data from Amir and the values from VituixCAD, LoudspeakerLab, and LoudspeakerLab using spec sheet. Lower is better on all scored except Preference Rating. p95 error is in dB over 100 Hz-16 kHz after level-aligning SPL curves over 300 Hz-1 kHz. Directivity Index is not level-aligned.


Source

On-Axis
Listening Window
Early Reflections
Sound Power
Predicted In-Room
Directivity Index
Preference Rating (no LF)
VituixCAD
2.04
2.07
2.14
2.22
2.12
1.38
8.148
LoudspeakerLab
1.39
1.34
1.15
1.13
1.05
1.74
8.081
LoudspeakerLab (spec sheet)
3.18
2.34
1.56
2.64
1.62
4.00
7.122

Klippel's VXC-style score from the extracted reference curves is 7.920. The shared VituixCAD materials report 8.139, which is close to the recomputed value from these curves.

Supporting Electrical Checks

Source
Minimum impedance
Impedance p95 vs Klippel
Crossover transfer p95 vs VituixCAD
VituixCAD
4.13 ohm @ 219 Hz
1.56 ohm
reference
LoudspeakerLab
4.13 ohm @ 217 Hz
1.48 ohm
W 0.07 dB / T 0.01 dB
LoudspeakerLab (spec sheet)
1.42 ohm @ 40 Hz
3.08 ohm
W 1.48 dB / T 0.35 dB

Conclusions

1. Modeled speakers can match the real speaker surprisingly well when the input data is good

The in-cabinet models are close enough to the Klippel curves to be useful design tools rather than rough sketches. VituixCAD lands at 2.04 dB p95 on-axis error and 2.07 dB Listening Window p95 error after level alignment. LoudspeakerLab gives lower error at 1.39 dB and 1.34 dB on the same metrics.

The practical takeaway is that robust in-cabinet measurements remain the high-confidence path. They already contain the real baffle, mounting, diffraction, grille-less driver integration, sample variation, and low-frequency loading behavior. The software still has to sum drivers, apply offsets, apply crossover transfer functions, and calculate CTA curves, but it is no longer being asked to invent the loudspeaker from generic driver curves.

One key difference between LoudspeakerLab and similar speaker modeling tools is it's ability to "unload" and "re-load" the box and baffle from measurements if those measurements were taken in a cabinet versus on a large measurement baffle. This cabinet/baffle unload-reload path exists for the core purpose of measurement re-use. An in-cabinet driver measurement is not just the driver; it also contains the measurement box, baffle, mounting, and low-frequency loading. VituixCAD's classic workflow works when those measurements are already from the final cabinet. LL's unload/re-load process makes the same driver profile reusable in other designs by estimating measured in cabinet A -> remove cabinet A/baffle A -> apply cabinet B/baffle B. The higher agreement here is best read as a useful by-product of that architecture, incorporating accurate box and baffle models based on T/S parameters to help estimate anechoic speaker behavior.


2. Why LoudspeakerLab and VituixCAD differ with the same input data

On the directly measured response curves, LoudspeakerLab is lower-error on On-Axis, Listening Window, and Early Reflections, at least for Mechano23. VituixCAD is lower-error on Directivity Index (1.38 dB p95 for VituixCAD versus 1.74 dB for LoudspeakerLab). That is the main place where LL trails in the headline graphic.

I studied this to try to better understand why, since they use the same source measures and crossover, and the strongest clue is the cabinet/baffle ablation. When I used the same LL in-cabinet FRDs directly and by-passed LL's measurement-cabinet/baffle unload and target-cabinet/baffle reload step, the main errors become VituixCAD-like: On-Axis 2.05 dB, Listening Window 2.06 dB, and Early Reflections 2.10 dB. With the normal LL process, those are ~30-40% lower: 1.39, 1.34, and 1.15 dB. That suggests the unload/reload process is likely a real contributor to LL's stronger front-curve agreement in this Mechano23 case.

The electrical transfer overlay is the strongest sanity check: LoudspeakerLab's in-cabinet crossover transfer differs from VituixCAD by only about 0.07 dB p95 on the woofer and 0.01 dB p95 on the tweeter, essentially the same. So it's acoustic modeling, not electrical, that produces this lower error.


3. Spec-sheet modeling is useful, but it is not equivalent to in-cabinet measurement

The spec-sheet model gives a view into the widely accessible speaker design use case. Making in-cabinet spherical measurements requires a lot of expertise, expense, and effort. You have to buy the drivers, build the cabinet, and then actually take the 72 measurements correctly. Alternatively, you could use manufacturer FR/ZMA files or scraped spec sheet data to get sparse manufacturer horizontal polars, then asks LL to predict the cabinet/baffle transformation, vertical behavior, rear radiation, acoustic offsets, and system integration. This is not as accurate as the in-cabinet spherical mesuring process, but the quantification of the gap is interesting. On-Axis p95 error is 3.18 dB, Listening Window is 2.34 dB, and Directivity Index is 4.00 dB, higher, but not unusable to create a high-quality speaker design.

The electrical side points in the same direction. The spec-sheet model's impedance mismatch is much larger than the in-cabinet models, especially in the low-frequency region where box alignment and driver parameters dominate. That is a reminder that manufacturer ZMA/T/S data can be perfectly legitimate for its fixture and still be a poor stand-in for the exact driver/box/crossover combination being built.

That does not make the spec-sheet workflow worthless. It can get a plausible design into the right neighborhood, especially for early crossover exploration and enclosure sizing when no measurements exist. This can be helpful in making driver purchase decision and later making spherical measurements, or accepting the lower accuracy design from spec sheet as final. But this comparison argues against treating it as interchangeable with in-cabinet data. Manufacturer curves are measured on standardized baffles and fixtures, often on different driver samples, with different boundary conditions than the finished speaker. LL can model the transformation, but it cannot recover information that was never present in the input data.


4. The Preference Rating differences are real, but they are not a single-number verdict

The Klippel-derived VXC-equation score is 7.920. VituixCAD computes to 8.148, LL in-cabinet to 8.081, and the spec-sheet LL model to 7.122. Those numbers move because the score is sensitive to smoothness, directivity, and bass extension. A model can be close on on-axis response and still diverge in score if Sound Power, DI, or low-frequency extension shifts.

For DIY design work, the score is best treated as a useful summary statistic, not a substitute for looking at the curves. The score is especially vulnerable when Sound Power and DI are based on reconstructed or sparse angular data. That is exactly the region where this study finds the largest remaining LL/VXC/Klippel disagreement.


5. Where the agreement is strongest in modeled speakers

  • The in-cabinet LL and VXC models both broadly reproduce the real-speaker on-axis and listening-window shape.
  • Electrical impedance for the in-cabinet designs tracks the digitized Klippel impedance shape much better than the spec-sheet design.
  • Crossover transfer functions between LL and VXC are close enough that transfer math is unlikely to be the dominant explanation for acoustic differences.
  • Horizontal polar behavior is much more constrained for the in-cabinet LL design because measured H data extends to 180 degrees.

Bottom Line

If you have good in-cabinet driver measurements, both VituixCAD and LoudspeakerLab can produce a model that is meaningfully close to a real Klippel-measured speaker. If you only have manufacturer data, both tools can also still be useful, but will be limited by the input data. This study says to keep expectations realistic: the spec-sheet path is good for narrowing design space, not for proving final performance. The healthiest conclusion is boring in the best way: better input measurements beat cleverer modeling. The encouraging part is that when the input data is comparable, the output is broadly comparable too.

And here are my Mechano23's. I use them nearfield on my desk everyday. They really are excellent.

unnamed.jpg
 

Attachments

  • h-contour-compare.jpg
    h-contour-compare.jpg
    129.8 KB · Views: 350
  • v-contour-compare.jpg
    v-contour-compare.jpg
    144.4 KB · Views: 350
  • mechano23_ll_share_dataset.zip
    mechano23_ll_share_dataset.zip
    984.2 KB · Views: 23
Last edited:
The VituixCAD preference score shown in the shared VituixCAD materials is 8.139, using the VituixCAD default of omitting the low-frequency extension score; LoudspeakerLab's Preference Score is based more closely on Olive standard and includes the low-frequency extension penalty by default, but a "w/ Sub" Preference Score is also calculated which omits the low-frequency extension and produces a result more comparible to the defaul VituixCAD score.
In VituixCAD you can use the Olive equation by pressing Ctrl plus clicking on "Full space". Doing so with the Mechano23 data from Amirm's review provides a score of 6.079 and using Xmechanik's data 6.398 (likely due to the lower bass extension in Xmechanik's measurements). Kimmo has pointed out that VituixCAD provides a score about 0.3 higher (I have found about 0.5 higher when comparing ASR/Spinorama scores to VituixCAD's) due to different averaging methods and resolution (points per octave (PPO)). ASR/Spinorama Tonality for Mechano23 is 5.5.

Most accurate bass measurement is ground plane if you have the space to do it correctly, but careful farfield + nearfield can be very accurate (especially for a small 2-way) but getting it right takes a good amount of work.
 
Also, for your directivity plots to be comparable, you should change VituixCAD's polar plot to be +/- 180 degrees and set the color profile to Klippel palette.
 
In VituixCAD you can use the Olive equation by pressing Ctrl plus clicking on "Full space". Doing so with the Mechano23 data from Amirm's review provides a score of 6.079 and using Xmechanik's data 6.398 (likely due to the lower bass extension in Xmechanik's measurements). Kimmo has pointed out that VituixCAD provides a score about 0.3 higher (I have found about 0.5 higher when comparing ASR/Spinorama scores to VituixCAD's) due to different averaging methods and resolution (points per octave (PPO)). ASR/Spinorama Tonality for Mechano23 is 5.5.

Most accurate bass measurement is ground plane if you have the space to do it correctly, but careful farfield + nearfield can be very accurate (especially for a small 2-way) but getting it right takes a good amount of work.
Thanks, I did not know you could do that in VituixCAD. The comparable default LoudspeakerLab Preference Rating is 5.8 using Xmechanik's measurements. So, I suppose the LF extension included Preference Rating comparison is:
ASR/Spinorama w/ Klippel data = 5.5
LoudspeakerLab (Xmechanik data) = 5.8
VituixCAD (Xmechanik data) = 6.4
 
Also, for your directivity plots to be comparable, you should change VituixCAD's polar plot to be +/- 180 degrees and set the color profile to Klippel palette.
Yes, I struggled with this a bit. Today, LoudspeakerLab only allows you to view H and V contours +/- 90 degrees, so that's what I set VituixCAD to. But, Amir's original Klippel plots are +/- 180. So you are correct that they all contain +/- 90, but the Klippel image is broader.
 
Yes, I struggled with this a bit. Today, LoudspeakerLab only allows you to view H and V contours +/- 90 degrees, so that's what I set VituixCAD to. But, Amir's original Klippel plots are +/- 180. So you are correct that they all contain +/- 90, but the Klippel image is broader.
I brought it up because a lot of people get confused with the polar settings in VituixCAD. When you right click the mouse on the directivity pane, it gives you choices for +/- 90 degrees and +/- 45 degrees so many people ask how to display the +/- 180. But you just unselect both of those and the default is +/- 180.
 
Yes, I struggled with this a bit. Today, LoudspeakerLab only allows you to view H and V contours +/- 90 degrees, so that's what I set VituixCAD to. But, Amir's original Klippel plots are +/- 180. So you are correct that they all contain +/- 90, but the Klippel image is broader.
This causes a lot of confusion when comparing power and DI charts when the measurement space is not the same. +/- 90 is a half space measurement and should be computed with a 3dB higher overall DI as the DI cannot go below 3dB for a half space condition.

Vituix has an option setting to choose half or quarter space when the measurement set is restricted. Just restricting the polar display to +/-90 does not change anything in the background other than cropping the graph. So it is important to know if the measurement space is the same as if it is not there can not be a valid comparison of Soundpower and DI. If there is only +/-90 data for one available, all the others would need to have their data truncated at source.
 
@fluid, @a4eaudio Great suggestions. I updated the H and V contour comparison to scale Amir's data to just +/- 90 and changed the VituixCAD palette to Klippel so that the three images are more directly visually comparable.
 
Back
Top Bottom