Questions tagged [statistics]
152 questions
4
votes
2 answers
What empirical evidence do we have for or against a correlation between fault density and LOC?
LOC = lines of code
KLOC = Thousand lines of code
Fault (or defect) density = number of reported bugs per line of code.
Software artifact = function, class, module
Reading research papers on fault density and fault prediction, it seems a bit hard to…

Olle Härstedt
- 375
- 1
- 9
3
votes
0 answers
Statistical anomaly detection in time series
I'm looking for some algorithms that detect statistical anomaly in time series.
For example, Google Trend automatically detects peaks of a specific search query in time, and associates those peaks with most popular news stories at that time.
See…

neuron
- 131
- 1
1
vote
0 answers
What is the reason behind getting same result by maximum likelihood estimate and smoothing?
I know that Good-Turing smoothing helps us to trim a bit of probability from some more frequent events and give it to the events we've never seen. Thus it keeps our model from assigning zero probabilities to unobserved events. But while dealing with…

Tripoli
- 11
- 1
1
vote
1 answer
Chebyshev inequality comparison
Given $n$ distinct values $(x_i)_{i=1}^{n}$ with mean $\mu$ and standard deviation $s$, for all $i$, we have $|x_i−\mu| ≤ s \sqrt{n − 1}$.
How does this inequality compare with Chebyshev inequality as $n$ increases?
user126050
0
votes
2 answers
Estimating run time in the absence of knowledge
I’ve always wondered how to make a best guess in the absence of any knowledge.
Let’s say you have a computation running, and you don’t know when it will complete (just that it will). Can you make any estimate when it will be finished just based on…

Julian
- 103
- 2
0
votes
0 answers
Can't import dataset on RStudio
I can't import datasets on RStudio on either my computer or Posit Cloud. I think it's because my laptop is too old to install packages necessary to import datasets on it, and not sure for Posit Cloud. Someone else imported it for me in a lab so I…

Melanie
- 1
0
votes
0 answers
Kullback-Liebler Divergence
For $P(x)=N(\mu,\sigma^2)$ and $Q(x)=N(0,1)$ I am supposed to calculate $KL(P(x)||Q(x))$, here is what I did
\begin{align*}
KL(P(x)||Q(x)) & = \int P(x) \cdot \log\left(\frac{P(x)}{Q(x)}\right) dx \\
& = \int P(x) \cdot…

LRS25
- 11
- 2
-1
votes
2 answers
finding normality from a set of samples of MEAN
I have set of 1000 samples. each sample represents MEAN of X amount transactions response time.
Now I have a running transaction , I know it's current response time but I want to know if this particular transactions elapsed time is normal relative…

deb
- 9
- 2