Solved

Find the best distribution

Posted on 2013-12-04
9
196 Views
Last Modified: 2013-12-14
Hi experts,

I have a data which represent the request comes into development center. I need to know the distribution of the arrival data into that center. I have calculate the inter-arrival time and calculate the frequency of the inter-arrival time. So, how can I know the fit probability distribution for this data?

* Please find a sample of the data file (an example)
test-data.xlsx
0
Comment
Question by:amq10
  • 4
  • 2
  • 2
  • +1
9 Comments
 
LVL 37

Assisted Solution

by:TommySzalapski
TommySzalapski earned 167 total points
ID: 39696821
The most common way I see that type of problem represented is the Poisson distribution. You could start there.

Of course, in reality, the time of day or day of the week can have a pretty big impact on the number of requests so it depends on how sophisticated of a model you want.
0
 
LVL 27

Assisted Solution

by:d-glitch
d-glitch earned 333 total points
ID: 39696842
If you fix up your data so it is all in date format, you change it to integers to see the Julian day number.

Then you can find the interarrival times and find the average.

I don't know if this could be modeled as a Poisson process or not.
0
 
LVL 27

Assisted Solution

by:d-glitch
d-glitch earned 333 total points
ID: 39696870
Issue Type	CREATED				
New Feature	11/10/2011     	40857			
New Feature	2/1/2012	40940	83		
New Feature	4/11/2012	41010	70		
New Feature	4/14/2012	41013	3		
New Feature	5/2/2012	41031	18		
New Feature	7/10/2012	41100	69		
New Feature	7/31/2012	41121	21		
					
			264	 / 6  ==>	44.00

Open in new window

0
 
LVL 27

Assisted Solution

by:d-glitch
d-glitch earned 333 total points
ID: 39696875
The inter arrival times vary between 3 and 83 days.
Is there any reason to think this is a random process?
0
Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

 
LVL 84

Expert Comment

by:ozo
ID: 39696894
7 data points are not enough to determine a model much more sophisticated than Poisson.
It's easy enough to find more complicated models that fit the data, but you'd probably just be fitting idiosyncrasies in the particular data set in a way that won't generalize to any other data.
0
 

Author Comment

by:amq10
ID: 39696999
Thanks for all,

Actually, it is a real data and more than 2000 record. I have calculate the inter arrival time and calculate the frequency to plot it and see which distribution can use. I have attached what I have got from plotting the frequency . The problem is I have problem to scale the data. SO, I plot it in minutes as shown in attached file.
test-data1.xlsx
0
 
LVL 37

Assisted Solution

by:TommySzalapski
TommySzalapski earned 167 total points
ID: 39697008
Try fitting it to a Poisson model and see if that gives you what you need. That's really the go-to distribution for random arrival.
0
 

Author Comment

by:amq10
ID: 39712433
Hi

Still stuck with this problem. I have no idea where is the problem. to simplify the problem,  I have the attached data as a time of arrival tasks to a center, I need to find the fit distribution to set the simulation. So, I calculate the inter-arrival time between coming tasks and try to plot the frequency in histogram to compare it with Poisson distribution.

The question is that: does my steps correct to find the fitness distribution for my arrival point?

How can I calculate the Poisson distribution for this data or how to find the best distribution?
test3.xlsx
0
 
LVL 27

Accepted Solution

by:
d-glitch earned 333 total points
ID: 39714623
You need to cleanup your data.  

Are all of your data in the same date/time format?  It doesn't appear to be.

Most of the interarrival times are less than 1.00, but then there are runs of integers and zeros.
What does a zero value mean?  Do you have multiple events happening at the exact same time?

Looking at the data by eye:
   The events/day for the first seven days are 20, 12, 25, 31, 11, 32, 24.
   This is great.  It is analyzable.  You might even be able to model it as a Poisson process.

But then you have a twenty day gap from JAN-11 thru FEB-2.  
What is going on?  Do you understand why this gap occurs?
You have a number of these gaps in your data.
 
What do you hope to learn from your analysis?
0

Featured Post

Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Improved? Move/Copy Add-in Replacement - How to avoid the annoying, “A formula or sheet you want to move or copy contains the name XXX, which already exists on the destination worksheet.” David Miller (dlmille)  It was one of those days… I wa…
How to quickly and accurately populate Word documents with Excel data, charts and images (including Automated Bookmark generation) David Miller (dlmille) Synopsis In this article you’ll learn how to use ExcelToWord! to copy data,charts, shapes …
The viewer will learn how to create a normally distributed random variable in Excel, use a normal distribution to simulate the return on an investment over a period of years, Create a Monte Carlo simulation using a normal random variable, and calcul…
Many functions in Excel can make decisions. The most simple of these is the IF function: it returns a value depending on whether a condition you describe is true or false. Once you get the hang of using the IF function, you will find it easier to us…

895 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

17 Experts available now in Live!

Get 1:1 Help Now