Wednesday, 18 September 2013

LAW OF PROBABLITY - By Tejas A. Udeshi (2013051)

Probability


A manager frequently faces situations in which neither classical nor empirical
probabilities are useful. For example, in a one-shot situation such as the launch of a
unique product, the probability of success can neither be calculated nor estimated from
repeated trials. However, the manager may make an educated guess of the
probability. This subjective probability can be thought of as a person's degree of
confidence that the event will occur. In absence of better information upon which to rely,
subjective probability may be used to make logically consistent decisions, but the
quality of those decisions depends on the accuracy of the subjective estimate.



1) Law of Addition

Consider the following Venn diagram in which each of the 25 dots represents an
outcome and each of the two circles represents an event.




In the above diagram, Event A is considered to have occurred if an experiment's
Outcome, represented by one of the dots, falls within the bounds of the left circle.
Similarly, Event B is considered to have occurred if an experiment's outcome falls
within the bounds of the right circle. If the outcome falls within the overlapping region of
the two circles, then both Event A and Event B are considered to have occurred.
There are 5 outcomes that fall in the definition of Event A and 6 outcomes that fall in the
definition of Event B. Assuming that each outcome represented by a dot occurs with
equal probability, the probability if Event A is 5/25 or 1/5, and the probability of Event B
is 6/25. The probability of Event A or Event B would be the total number of outcomes in
the orange area divided by the total number of possible outcomes. The probability of
Event A or Event B then is 9/25.


Note that this result is not simply the sum of the probabilities of each event, which
would be equal to 11/25. Since there are two outcomes in the overlapping area, these
outcomes are counted twice if we simply sum the probabilities of the two events. To
prevent this double counting of the outcomes common to both events, we need to
subtract the probability of those two outcomes so that they are counted only once. The
result is the law of addition, which states that the probability of Event A or Event B (or
both) occurring is given by:
P(A or B) = P(A) + P(B) - P(A and B)
This addition rule is useful for determining the probability that at least one event will
occur. Note that for mutually exclusive events there is no overlap of the two events so:
P(A and B) = 0
and the law of addition reduces to:
P(A or B) = P(A) + P(B)


2) Conditional Probability

Sometimes it is useful to know the probability that an event will occur given that another
event occurred. Given two possible events, if we know that one event occurred we can
apply this information in calculating the other event's probability. Consider the Venn
diagram of the previous section with the two overlapping circles. If we know that Event
B occurred, then the effective sample space is reduced to those outcomes associated
with Event B, and the Venn diagram can be simplified as shown:




The probability that Event A also has occurred is the probability of Events A and B
relative to the probability of Event B. Assuming equal probability outcomes, given two
outcomes in the overlapping area and six outcomes in B, the probability that Event A
occurred would be 2/6. More generally,
P(A given B) =             P(A and B)
                                     ----------------------------
                                          P(B)



3) Law of Multiplication

The probability of both events occurring can be calculated by rearranging the terms in
the expression of conditional probability. Solving for P(A and B), we get:

P(A and B) = P(A given B) x P(B)

For independent events, the probability of Event A is not affected by the occurance of
Event B, so P(A given B) = P(A), and

P(A and B) = P(A) x P(B)

Tuesday, 17 September 2013

BENFORD'S LAW - EXPLAINED EASILY - by Viraj S. Modi (2013053)

Benford's Law
If you’ve not heard about Benford’s Law before, you’re in for a real treat with this post. 


 Before we get into the theory, however, indulge with me in a little thought experiment:
 Imagine I have a database of randomly occurring measurements (for instance I just happen to have a database of the altitude of the top 122,000 populated towns in the World). If I were to plot a frequency histogram of this data based on the leftmost digit of each altitude (in feet), 

what do you think the shape of the graph would be?





Something like this perhaps? 
 With sufficient data, you'd expect the first digit should be reasonably randomly represented, right?
Wrong!
That's not the shape we get …


ACTUAL SHAPE: 

In fact, if you plot the actual histogram of first digits, you get the above chart. 
 Yes, that's right, altitudes where the first digit is the number 1 occur significantly more often than the number 2 which, in turn, occurs more frequently than the number 3 … all the way down to the number 9. In truth, the number of times when the first digit is a 1 is almost 30% of the time; Six and a half times as often as it in the number 9 (which occurs less than 5% of the time)!

SURPRISED????
It’s not just altitude of places, I can repeat this exercise with other data sources, such as stock market volume, distances to stars in the Universe … and I'll get comparable distribution patterns. In all these examples, the leading digit is the number 1 approximately 30% of the time, and distribution of the other digits falls off the same way. :-

HISTORY:
This phenomenon is called Benford's Law after physicist Frank Benford, though it was first documented by Simon Newcomb, a Canadian mathematician, much earlier, in 1881.

Benford's formula states that the probability of the leading digit being of a certain value can be described by the following function:


HOW DOES IT WORK??:

I'll try to explain it with another experiment:

Try to imagine a pencil of one unit length (it does not matter what 'one unit' means to you).

Now imagine that pencil slowing growing in length. It grows and it grows. For a long time, it will be of length 1.x units long. In fact, it will have to double in length (100% change) before the leading digit changes from 1 to a 2. However, if it had a leading digit of 2, then it would only need to change in length 50% to change the leading digit from a 2 to a 3.




Look at the logarithmic scale above, you can see that as we move along the scale, there's a shorter distance between each subsequent mark until the next decade is achieved. At the edge of a decade, to change from a leading digits from a 9, requires only an 11% change in the value of the number. 
 We can see that the percentage of time that the leading digit is a 1 occurs approximately 30% of the time (the areas shaded red)

The probability of each digit being represented in the data is proportional to the area of the corresponding regions in the logarithmic chart. I've color coded them in the picture below.



The width of each colored segment is proportianal to log10(d+1) – log10(d).

INTERESTING APPLICATIONS:

1.Accounting fraud detection: 
In 1972, Hal Varian suggested that the law could be used to detect possible fraud in lists of socio-economic data submitted in support of public planning decisions. Based on the plausible assumption that people who make up figures tend to distribute their digits fairly uniformly, a simple comparison of first-digit frequency distribution from the data with the expected distribution according to Benford's Law ought to show up any anomalous results. Following this idea, Mark Nigrini showed that Benford's Law could be used in forensic accounting and auditing as an indicator of accounting and expenses fraud. In practice, applications of Benford's Law for fraud detection routinely use more than the first digit. 

2.Legal status: 
In the United States, evidence based on Benford's Law has been admitted in criminal cases at the federal, state, and local level. 

3.Election data:
Benford's Law has been invoked as evidence of fraud in the 2009 Iranian elections, and also used to analyze other election results. However, other experts consider Benford's Law essentially useless as a statistical indicator of election fraud in general. 

4.Macroeconomic data:
Similarly, the macroeconomic data the Greek government reported to the European Union before entering the Euro Zone was shown to be probably fraudulent using Benford's Law, albeit years after the country joined. 

5.Genome data: 
The number of open reading frames and their relationship to genome size differs between eukaryotes and prokaryotes with the former showing a log-linear relationship and the latter a linear relationship. Benford's Law has been used to test this observation with an excellent fit to the data in both cases. 

6.Scientific fraud detection: 
A test of regression coefficients in published papers showed agreement with Benford's law. As a comparison group subjects were asked to fabricate statistical estimates. The fabricated results failed to obey Benford's law.

SOURCE:  http://www.datagenetics.com/blog/march52012/
TRICKS AND TIPS WE CAN USE ON GOOGLE

 

Get movie times




 


Search on a movie name or just movie to see theater locations and showtimes in your area.

Find recipes




 


After searching for a specific recipe or ingredients, select "Recipes" in the left panel to see recipe result u wil see it.

 

Get the time






To see what time it is anywhere in the world, search time and the city or country.

 

Search for an address




 


To map any location, type in the city name or zip code followed by the word map and you'll see a map of that location. .

Get number conversions






 


Convert any measurement like miles to kilometers or any other by typing in the number and unit of measurement.

 

Get definitions






 


Put define in front of any word to get its definition

 

Check the weather






 


Search weather followed by a U.S. zip code or the name of any city in the world to get the current weather and forecast. Enter weather by itself to get the weather report for your current location.

 

 

Calculate anything



 

Enter any math equation into the search box and we'll calculate your answer.


Source: www.google.com

Monday, 16 September 2013

Group 8 Analysis : By Swapnil Shinde - P.G.D.M 2013050


Box Plot :-

A Box Plot is called a line chart or a bar chart which lets you graph severeal ranges of numbers using the same axis .

We can illustrate the information which is not easily explained by a traditional chart using Box Plot .

Excel database software does it for you . you just have to type in the labels and numbers and the Box Plot is ready on a click .

source of information - www.ehow.com


Group 8 Analysis : By Steffi Mascarenhas P.G.D.M   2013048

 
Stem and Leaf :-
 
Microsoft excel helps u to do regular analysis of statistics . you can work with numbers to find patterns, trends within your data.
 
Stem and Leaf shows frequency in which certain class of value occurs and you can create one using just two columns in your excel sheet.

 source of information - www.ehow.com