Expand Cut Tags

No cut tags
latentbird: (Default)
[personal profile] latentbird
The main conclusion is to finish the random files issue.
TODO: Make a number of shuffling series of 10 shufflings each. Choose a word from the top of occurrences list. The means for that word should not differ more than 5% and the standard deviations shoiuld not differ more that 15%. It it is not true, increase a number of shuffles in each series to 15 and so forth.

TEN recognizes two different measures of standard deviation - the standard deviation of a series (he calls it an individual measurement dispersion) and a standard deviation of averages. This notation is new to me, although I do inderstand tbat given K smapling series, the standard deviation between means of those series will be very low. I just don't understand why we need such a parameter and when can it be used. TEN argues that sigma_averages = sigma_sample/sqrt(N), where N is a number of samples in each series.
This account has disabled anonymous posting.
If you don't have an account you can create one now.
HTML doesn't work in the subject.
More info about formatting

August 2017

S M T W T F S
  12345
6789101112
1314151617 1819
2021222324 2526
2728293031  

Style Credit

Page generated Feb. 27th, 2026 08:46 am
Powered by Dreamwidth Studios