Questions tagged [statistics]

152 questions
4
votes
2 answers

What empirical evidence do we have for or against a correlation between fault density and LOC?

LOC = lines of code KLOC = Thousand lines of code Fault (or defect) density = number of reported bugs per line of code. Software artifact = function, class, module Reading research papers on fault density and fault prediction, it seems a bit hard to…
Olle Härstedt
  • 375
  • 1
  • 9
3
votes
0 answers

Statistical anomaly detection in time series

I'm looking for some algorithms that detect statistical anomaly in time series. For example, Google Trend automatically detects peaks of a specific search query in time, and associates those peaks with most popular news stories at that time. See…
neuron
  • 131
  • 1
1
vote
0 answers

What is the reason behind getting same result by maximum likelihood estimate and smoothing?

I know that Good-Turing smoothing helps us to trim a bit of probability from some more frequent events and give it to the events we've never seen. Thus it keeps our model from assigning zero probabilities to unobserved events. But while dealing with…
Tripoli
  • 11
  • 1
1
vote
1 answer

Chebyshev inequality comparison

Given $n$ distinct values $(x_i)_{i=1}^{n}$ with mean $\mu$ and standard deviation $s$, for all $i$, we have $|x_i−\mu| ≤ s \sqrt{n − 1}$. How does this inequality compare with Chebyshev inequality as $n$ increases?
user126050
0
votes
2 answers

Estimating run time in the absence of knowledge

I’ve always wondered how to make a best guess in the absence of any knowledge. Let’s say you have a computation running, and you don’t know when it will complete (just that it will). Can you make any estimate when it will be finished just based on…
Julian
  • 103
  • 2
0
votes
0 answers

Can't import dataset on RStudio

I can't import datasets on RStudio on either my computer or Posit Cloud. I think it's because my laptop is too old to install packages necessary to import datasets on it, and not sure for Posit Cloud. Someone else imported it for me in a lab so I…
0
votes
0 answers

Kullback-Liebler Divergence

For $P(x)=N(\mu,\sigma^2)$ and $Q(x)=N(0,1)$ I am supposed to calculate $KL(P(x)||Q(x))$, here is what I did \begin{align*} KL(P(x)||Q(x)) & = \int P(x) \cdot \log\left(\frac{P(x)}{Q(x)}\right) dx \\ & = \int P(x) \cdot…
LRS25
  • 11
  • 2
-1
votes
2 answers

finding normality from a set of samples of MEAN

I have set of 1000 samples. each sample represents MEAN of X amount transactions response time. Now I have a running transaction , I know it's current response time but I want to know if this particular transactions elapsed time is normal relative…
deb
  • 9
  • 2