Monday, 29 October 2012

Analysis of Variance


1-way ANOVA
-          Analysis of variance is used to compare 3 0r more means w/c contains only 1 variable.
2-way ANOVA
-          ANOVA that involves 2 variables.
Reasons why the T-test should not be used on 3 or more populations
1.       When 1 is comparing 2 means at a time, the means of the rest of the study are ignored. W/ F-test, all the means are tested simultaneously.
2.       When 1 is comparing 2 means at a time, the probability of rejecting the null hypothesis when it is true increased, since more T-tests are conducted, the greater is the likelihood of getting significant differences by chance alone.
3.       The more means are to compare, 3 the mote T-tests are needed.
Assumptions for the F-test in comparing three of more means
1.       The populations in w/c the samples were obtained must be normally distributed or approximately normally distributed.
2.       The sample must be independent for each.
3.       Te variances of the population must be equal.
With the f-test, 2 different estimates of the population variances are made. The 1st estimate is called the between group variance or mean square of the between group that involves finding the variance of the means. The 2nd estimate, the w/in group variance or mean square of the w/in group. This is made by computing all the variances using the data and is not affected by the difference of the means.
No difference in the means.
·         The between group variance estimate is approximately equal to the w/in group variance.
·         F-test value will be approximately equal to 1.
·         The null hypothesis will not be rejected.
Means differ significantly.
·         The between group variance is much larger than the w/in group variance.
·         F-test will be significantly greater then 1.
·         The null hypothesis will be rejected.

Test for Independence


Test Using Contingency Tables
When data can be tabulated in the table from in terms of frewuencies, several hypothesis can be tested by using the chi-square test. Two such tests are the independence of variables test and the homogenety of proportions test.

·         The test of independence is used to determine wether 2 variables are independent of or related to each other when a single sample is selected.
Steps:
·         State the hypotheses
·         Find the degree of freedom
·         Find the expected value
·         Find the test value
·         Find the critical value
·         Make a decision
·         Summarize

Test for homogenety of proportions
·         It is used ti determine wether the proportions for variables are equal when several samples are selected from different populations.
Steps:
·         State the hypotheses
·         Find the critical value
·         Find the test value
·         Make a decision
·         Summarize

Friday, 26 October 2012


                                              Mobile/Smartphone statistics
  • Mobile now accounts for 10% of internet usage worldwide (this has more than doubled over last 18months)
  • 1.08 of the world’s 4 billion mobile phones are smartphones
  • Apple and Android represent more than 75% of the smartphone market
  • 7.96% of all web traffic in the U.S. is mobile traffic. That number skyrockets to 14.85% in Africa, and 17.84% in Asia — up 192.5% since 2010
  • 29% of mobile users are open to scanning a mobile tag to get coupons
  • 39% of instances where a consumer walks out of a store without buying were influenced by smartphones
  • 91% of mobile internet access is for social activities, versus just 79% on desktops 
  • Over 1/3 of Facebook’s users access Facebook Mobile; 50% of Twitter’s users use Twitter Mobile
  • QR code scans increased 300% in 2011 compared to 2010
  • The average tablet user spends 13.9 hours per week with the device
  • 73% of smartphone owners access social networks through apps at least once per day 
  • There was 103% growth in website traffic from smartphones from 2011-2012
  • US consumers spend almost 1 in every 10 ecommerce dollars using a mobile device 
  • There are currently 6 Billion mobile subscribers worldwide
  • This equals 87% of the world’s population
  • China and India account for 30% of this growth
  • There are over 1.2 Billion people accessing the web from their mobiles
  • Over 300,000 apps have been developed in the past 3 year
  • Google earns 2.5 Billion in mobile ad revenue annually


J.Santillan

Wednesday, 24 October 2012

Analysis of Variance (ANOVA)


Analysis of Variance (ANOVA)
Analysis of Variance (ANOVA) is a collection of statistical models, and their associated procedures, in which the observed variance in a particular variable is partitioned into components attributable to different sources of variation. In its simplest form, ANOVA provides a statistical test of whether or not the means of several groups are all equal, and therefore generalizes t-test to more than two groups. Doing multiple two-sample t-tests would result in an increased chance of committing a type I error. For this reason, ANOVAs are useful in comparing two, three, or more means.




POSTED BY: VON ERJUN SABUSAP

Difference Between Means
Hypothesis Testing of the Difference Between Two Means
Do employees perform better at work with music playing.  The music was turned on during the working hours of a business with 45 employees.  There productivity level averaged 5.2 with a standard deviation of 2.4.  On a different day the music was turned off and there were 40 workers.  The workers' productivity level averaged 4.8 with a standard deviation of 1.2.  What can we conclude at the .05 level?
Solution
We first develop the hypotheses
        H0 m1 - m2  =  0       
        H1 m1 - m2  >  0
Next we need to find the standard deviation.  Recall from before, we had that the mean of the difference is 
        mx  =  m1 - m2 
and the standard deviation is 
 sx  =      


We can substitute the sample means and sample standard deviations for a point estimate of the population means and standard deviations.  We have

        
and 
Now we can calculate the t-score.  We have
                    0.4
        t  =                       =  0.988
                   0.405

To calculate the degrees of freedom, we can take the smaller of the two numbers n1 - 1 and n2 - 1.  So in this example we use 39 degrees of freedom.  The t-table gives a value of 1.690 for the t.95 value.  Notice that 0.988 is still smaller than 1.690 and the result is the same.  Since the t-score is smaller than 1.690, we fail to reject the null hypothesis and state that there is insufficient evidence to make a conclusion about employees performing better at work with music playing. 




POSTED BY: VON ERJUN SABUSAP



Tuesday, 23 October 2012


                                       ANOVA for simple linear regression

• Total sum of squared deviations is divided into model (regression) and error
(residual) sums of squares

• Their ratio is the coefficient of determination R2

• These are each divided by their degrees of freedom to obtain the mean SS

• Their ratio is distributed as F and can be tested for significance



J.Santillan
                                         Analysis of Variance (ANOVA)

• Partition the total variance in a population into the model and residual

• If the model has more than one term, also partition the model variance into
components due to each term

• Can be applied to any linear additive design specified by a model

• Each component can be tested for signficance vs. the null hypothesis that it
does not contribute to the model fit



J.Santillan

STATISTICAL FACTS ABOUT TEENAGERS

Every 2 hours a youth is murdered.

In the next twenty four hours, 1,439 Teens will attempt suicide. 

A 1995 study by Children Now and Kaiser Permanente found 40% of teen women know someone who was in an abusive relationship. 

In the next twenty four hours, 2,795 Teenage girls will become pregnant.

Every 4 minutes a youth is arrested for an alcohol related crime.

In the next twenty four hours, 3,506 Teens will run away.

Every 7 minutes a youth is arrested for a drug crime.
















G.K. Elio

Wednesday, 10 October 2012

Analysis of Variance

   The z and t tests should not be used when three orb more means are compared, instead, F-test can be used to compare three or more means. This technique is called analysis of variance or ANOVA .For three groups, the F-test can only show whether or not a difference exists among the three means. It cannot reveal where the difference lies. If the F test indicates that there is a difference among the means, other statistical test are used to find where the difference exists. The most commonly used tests are the Scheffe test and the Tukey test.

One-Way Analysis of Variance
- Analysis of variance used to compare three or more means which contains only one variable.

Two-Way Analysis of Variance
-ANOVA that involves two variables.

Reasons why the t test should not be used on three or more populations:
1. When one is comparing two means at a time, the rest of the means under study are ignored. With the f test, all the means are compared simultaneously.
2. When one is comparing two means at a time, the probability of rejecting the null hypothesis when it is true increased, since more t test are conducted, the greater is the likelihood of getting significant differences by chance alone.
3. The more means there are to compare, 3 the more t tests are needed.

Assumptions for the F test for comparing Three or more means
1. The populations from which the samples were obtained must be normally or approximately normally distributed.
2. The samples must be independent for each.
3. The variances of the population must be equal.


J.Santillan
Procedure in Finding F test Value for the Analysis of Variance

Step 1: Find the mean and variance for each sample.

Step 2: Find the grand mean.

Step 3: Find between- group variance.

Step 4: Find within- group variance.

Step 5: Find the F test value

The degrees of freedom are:
       d.f. N= k-1   where k is the number of groups, and
       d.f D= N-k   where N is the sum of the sample sizes of the groups



J.Santillan

Sunday, 7 October 2012


Test Using Contingency Tables

When data can be tabulated in the table from in terms of frewuencies, several hypothesis can be tested by using the chi-square test. Two such tests are the independence of variables test and the homogenety of proportions test.

Test for Independence
·         The test of independence is used to determine whether 2 variables are independent of or related to each other when a single sample is selected.
Steps:
·      1.   State the hypotheses
·      2.  Find the degree of freedom
·     3.   Find the expected value
·     4.   Find the test value
·    5.     Find the critical value
·    6.     Make a decision
·     7.   Summarize

Test for homogenety of proportions
·         It is used ti determine whether the proportions for variables are equal when several samples are selected from different populations.
Steps:
·        1.  State the hypotheses
·         2. Find the critical value
·      3.   Find the test value
·       4.   Make a decision
·       5.   Summarize



C.Ordoyo

Thursday, 4 October 2012

Majority of establishments is engaged in Water collection, treatment and supply

           The 2010 Annual Survey of Philippine Business and Industry (ASPBI) covered a total of 255 Water supply; Sewerage, Waste Management and Remediation Activities establishments with total employment of 20 and over.
            Among industries, water collection, treatment and supply recorded the highest number of establishments at 232 or 91.0 percent of the total. Waste collection followed a far second with 11 establishments (4.3%). Waste treatment and disposal ranked third with 7 establishments (2.7%) while materials recovery had the lowest number with 5 establishments (2.0%). Figure 1 illustrates the percentage distribution of Water Supply; Sewerage, Waste Management and Remediation Activities establishments by industry group in 2010.

           At the regional level, Central Luzon accounted for the highest number of establishments at 52 or 20.4 percent of the total. This was followed by CALABARZON with 45 establishments (17.6%) and Western Visayas with 20 establishments or 7.8 percent.


 J.Santillan

Statistical Facts About

Google+

                   It is expected that Google’s new social network Google+ will hit 400 million users by the end of 2012. It needs to be kept in mind that Google+ is more about being “core” to Google’s social web strategy as it aims to embed social signals into its whole web and search strategy as it aims to continue to be relevant in an increasingly social and mobile web.
Google makes it mandatory to join Google+ when you register for a Gmail account which really amounts to it being a forced membership. This non organic strategy means that its engagement levels are extremely low at 3 minutes per month compared to Facebook at 405 minutes, in fact they are less than MySpace.
It must be also kept in mind that Google+ is the glue that connects Google’s other social and web properties such as YouTube, Android and of course search.

Some Google+ Facts and Figures

  • It was launched on June 28, 2011
  • Google+ reached 10 million users by July 14, 2011
  • 67% of Google+ users are male
  • Google “+1″ button is served more than 5 billion times daily
  • It is gaining 625,000 users per day
  • In less than one day the Google+ iPhone app became the most popular free application in the Apple App store
Google+ Facts Figures Statistics Infographic

J.Santillan

Correlation and Regression

Relationship
1.Simple
-maybe positive
-maybe negative

2.Multiple

Regression- making prediction
*The more stronger the regression the more accurate.

Scatter plot
(x) Independent variable- being controlled
(y) Dependent variable- affected by other variable


J.Santillan

YouTube

Communication and engagement is much more than talking or writing and the popularity of  YouTube is evidence of that. The availability of cost effective high speed internet access is making it easy for people to express themselves via video. Brands have seized on its power to be a viral media that augments traditional advertising media such as TV. YouTube’s advantage is it is always available and searchable. (it is the worlds second largest search engine after Google.
This capability has been displayed by the success of the Old Spice  marketing campaign among many others.

Some facts and figures:

  • 3rd most visited website according to Alexa
  • 2 billion views per day
  • It handles 10% of the internet’s traffic
  • Average YouTube user spends 900 seconds per day
  • 44% of YouTube’s users are aged between 12 and 34
  • Over 829,000 videos are uploaded every day
  • Average video duration is 2 minutes 46 seconds
YouTube Facts Figures and statistics Infographic 2012

J.Santillan

Monday, 1 October 2012

FACEBOOK STATISTICS

With over 500 million users, Facebook is now used by 1 in every 13 people on earth, with over 250 million of them (over 50%) who log in every day. The average user still has about 130 friends, but that should expand in 2011.
48% of 18-34 year old check Facebook when they wake up, with 28% doing so before even getting out of bed. The 35+ demographic is growing rapidly, now with over 30% of the entire Facebook user base. The core 18-24 year old segment is now growing the fastest at 74% year on year. Almost 72% of all US internet users are on now Facebook, while 70% of the entire user base is located outside of the US.
Over 700 Billion minutes a month are spent on Facebook, 20 million applications are installed per day and over 250 million people interact with Facebook from outside the official website on a monthly basis, across 2 million websites. Over 200 million people access Facebook via their mobile phone. 48% of young people said they now get their news through Facebook. Meanwhile, in just 20 minutes on Facebook over 1 million links are shared, 2 million friend requests are accepted and almost 3 million messages are sent.

 G.K. Elio