Why then, don't they do controlled lab test of their theories instead of depending on simulation outcomes?
Because it's too complicated, and because it's unnecessary.
If you have read NIST NCSTAR1-6C, you will note that they had to travel to Canada to find a 35-foot furnace suitable for testing. Full-size fire tests simply aren't done. The tests are expensive and the facilities are rare. Another example, one I cite frequently, is the
Cardington series of fire tests. You will note the extreme complexity and expense of such a test.
The test in NCSTAR1-6C, as I have already noted, was conducted to test the
intact fire rating of the construction. It was not intended to test any specific case, nor could it, for reasons below.
A couple things:
1. Why didn't they run the tests to complete failure?
2. How does NIST know what to compare the tests to?
3. How can they verify the extent of fireproofing that was removed from the impact?
4. 35 ft assemblies were used in two of the tests--which conforms exactly to the shorter lengths of the actual WTC floor assemblies.
It's true that the tests were run with fireproofing on, but that just raises the question of why they didn't take the SFRM off for some of the tests. That just sounds like a poorly designed test.
That's not correct.
The tests were not run to failure because that would destroy the test cell. The furnaces are not designed to have multi-ton steel structures collapse upon them. Cleanup after such a test is hazardous. Remember, this is not a tiny benchtop experiment we're talking about -- each of the short-span full-scale truss tests involved roasting a structure
bigger than my house.
There's no need to run the tests further, anyway. Elementary structural mechanics will tell you what will happen next to a high degree of accuracy. The difference between the stopping point in the UL furnace tests and what happened in the WTC Towers is whether or not the diagonal elements buckled. Up until that point, the NIST models can be verified against the truss tests in terms of displacement as a function of steel temperature, and they were, and the fit was excellent. But after that, there's no point. You're now talking about a dynamic structure, one with much larger uncertainties about condition, but the gross order behavior is completely predictable. Running this test, besides being dangerous, doesn't really tell us anything that we don't already know.
Regarding the scale of tests, the half-scale and full-scale short span tests revealed that contrary to their assumptions, they could not simply "scale up" the design and assume the same fire rating. For that reason, the difference between the short and large spans cannot be estimated without doing an experiment. However, this experiment can be done digitally, and was.
All of the computer models created were verified (that means "tested against known cases") against simple test cases, such as the NCSTAR1-6C tests. They were also tested for sensitivity and validated (that means "tested to show the correct behavior within a bounding envelope of conditions") independently, by varying each of the input conditions and verifying the effect of each, giving us a way to estimate the error for any given test case. The structural models, in total, are simply not all that sensitive to the kinds of errors you suppose. Minor variations in heating or even in displacement are not significant to the overall structure.
Regarding the fireproofing, that's a totally separate issue. It is difficult to be certain of the fireproofing damage. This is why NIST used conservative estimates of the damage, and also ran two fire cases for each tower, verifying the results against the many observations including extent of "hanging objects," appearance and progression of inward bowing at exterior walls, and measured lean of the tower superstructures. What the results show is that there is a minimum of fireproofing damage expected, but that within a wide envelope of performance, the fine details really make little difference.
It seems they have to assume temperatures based on the conclusion that the heat initiated the collapse--not the actual forensic evidence recovered.
That's part of my beef with the investigation. We are to take their word that the temperature assumptions are correct--even though the physical evidence they gathered doesn't comply.
The physical evidence they gathered
does comply. You're wrong about that. Nobody is asking you to take anything as gospel.
So far, all of the difficulties you've had with the report stem from either not reading the report, or from fundamental confusions about the role of simulation in engineering. This is perfectly understandable if you do not come from a technical background. However, at this point you should at the very least give the NIST scientists the benefit of the doubt, trusting that perhaps it is you who has misinterpreted or misunderstood the process.
As always, feel free to ask questions.