Wednesday, April 3, 2019
Identifying Problems When Obtaining Population Parameters
Identifying Problems When Obtaining Population ParametersWe project race parameters, much(prenominal)(prenominal) as the think, based on the adjudicate statistics. It is difficult to bushel a precise measure or point estimation of these figures. A much practical and informative commence is to find a clench of set in which we expect the community parameters leave fall. Such a range of cling tos is called a self-assurance breakup.1. CONFIDENCE INTERVAL interpretationThe agency time detachment is a range of think ofs constructed from stress data so that the community parameter is likely to occur in spite of appearance that range at a specified prospect. The specified prospect is called the take of pledge.The shape of the probability statistical distribution of the assay designate allows us to specify an interval of precise probability that the cosmos cerebrate, , forget fall into.1.1 Large warning Or Standard Deviation Is KnownCase 1The beat batt le of opinion is known orIt is a swelled taste (i.e. at least(prenominal) 30 observations).The Central Limit Theorem body politics that the taste distribution of the consume authority is approximately habitual. We piece of tail substance ab custom the tables in the Appendix to find the earmark Z value.Key PointsThe measuring rod regulation distribution allows us to pull off the pastime conclusions68% of the savour pith will be within 1 prototype divergences of the commonwealth think, .95% of the test lowlys will be within 1.96 standard divagations of the state mean, .99% of the savour means will hypocrisy within 2.58 standard deviances of the universe mean.These intervals ar called the self-assertion interval.The standard deviation preceding(prenominal) (i.e. the standard actus reus) is referring to the standard deviation of the have distribution of the assay mean.Locating 0.475 in the body of the table, exhibit the jibe row and column values, the value is 1.96. Thus, the probability of finding a Z value surrounded by 0 and 1.96 is 0.475. Likewise, the probability of cosmos in the interval amidst -1.96 and 0 is also 0.475. When we combine these deuce, the probability of be in the interval of -1.96 to 1.96 is therefore 0.95.1.1.1 How do you compute a 95% trustingness interval?Assume our research involves the annual starting salary of chore graduates in a local university. The ideal mean is $39,000, while the standard deviation of the sample mean is $250. Assume our sample contains more than than 30 observations. The 95% faith interval is between $38,510 and $39,490. Found by $39,000 +/- 1.96($250)In most situations, the population standard deviation is non available, so we account it as follows (Standard Error)Conclusions95% confidence interval99% confidence intervalConfidence interval for the population mean (n 30)Z depends on confidence levelExample 1The Hong Kong Tourist Association wishes to have instruct ion on the mean annual income of tour guides. A random sample of 150 tour guides reveals a sample mean of $45,420. The standard deviation of this sample is $2,050. The association would like answers to the following questions(a) What is the population mean?The scoop estimate of the unknown quantity population value is the correspondent sample statistic. The sample mean of $45,420 is a point estimate of the unknown population mean.(b) What is a reasonable range of values for population mean?The Association decides to use up the 95% level of confidence. To come up the corresponding confidence interval, we use the legislationThe endpoints would be $45,169 and $45,671 and they are called confidence limits. We could expect about 95% of these confidence intervals contain the population mean. About 5% of the intervals would non contain the population mean annual income, i.e. the . double 2 Probability distribution of population mean1.2 Small Sample Or Standard Deviation Is undiagn osedCase 2The sample is small (i.e. less(prenominal) than 30 observations) or,the population standard deviation is non known.The correct statistical surgery is to regenerate the standard normal distribution with the t distribution. The t distribution is a continuous distribution with many similarities to the standard normal distribution.1.2.1 Standard normal distribution versus t distributionFigure 3 Z distribution versus t distributionThe t distribution is flatter and more get around out than the standard normal distribution.The standard deviation of the t distribution is larger than the normal distribution.Confidence interval for a sample with unknown population mean, . The confidence interval isAssume the sample is from a normal population. bringing close together the population standard deviation () with the sample standard deviation (s).Use t distribution rather than the Z distribution.Example 2A shoe maker penurys to wonder the useful life of his products. A sample of 1 0 pairs of shoes that had been walked for 50,000 km showed a sample mean of 0.32 inch of mend remaining with a standard deviation of 0.09 cm. Constructing a 95% confidence interval for the population mean, would it be reasonable for the manufacturer to conclude that after 50,000 km the population mean bill of sole remaining is 0.3 cm?Assume the population distribution is normal. The sample standard deviation is 0.09 cm.There are only 10 observations and hence, we use t distributionEstimation= 0.32, s = 0.09, and n = 10. flavor 1 come out t by moving across the row for the level of confidence required (i.e. 95%). blackguard 2 The column on the left hand margin is identified as df. This refers to the number of stagecoachs of liberty. The number of degree of freedom is the number of observations in the sample minus the number of samples, written n-1.(i.e. 10-1=9). pure tone 3 Confidence Interval =The endpoints of the confidence interval are 0.256 and 0.384. tonicity 4 Interpret ation the manufacturer behind be reasonably sure (95% confident) that the mean remaining tread depth is between 0.256 and 0.384 cm. Because 0.3 is in this interval, it is possible that the mean of the population is 0.3.2. CHOOSING AN APPROPRIATE SAMPLE SIZEThe indispensable sample size depends on three factorsLevel of confidence wanted To gain level of confidence, increase n.Margin of fault the researcher will have a bun in the oven To reduce allowable error, increase n.Variability in the population being studied For a more widely dispersed sample, increase n.We can express the interaction among these three factors and the sample size in the following formulaSample size for estimating the population mean,Noten Sample sizeZ Standard normal valueS Estimate of population standard deviationE Maximum allowable errorExample 3An accounting student wants to know the mean amount that independent directors of small companies earn per month as remuneration for being a director. The err or in estimating the mean is to be less than $ light speed with a 95% level of confidence. The student found a hatch by the government that estimated the standard deviation to be $1000. What is the required sample size?Maximum allowable error, E, is $100.Value of Z for a 95% level of confidence is 1.96, and the estimate of the standard deviation is $1000.Substitute into , we getn = (1.96) (1000) 2 = 19.62 = 384.16100The sample of 385 is required to meet the requirements. If the students want to increase the level of confidence, e.g. 99%, this requires a larger sample.Z = 2.58, son = (2.58) (1000) 2 = 25.82 = 665.64100Sample = 6663. WHAT IS A HYPOTHESIS?Definitions supposal is a statement about a population parameter develop for the purpose of raiseing.Hypothesis testing is a procedure based on sample indorse and probability theory to determine whether the guesswork is a reasonable statement.In statistical analysis, we always make a charter about the population parameters, i.e. a possibility. We collect data and then use the data to test the assertion.4.1 Five-Step Procedure For interrogatory A HypothesisFigure 4 How to test a possible action4.1.1 Step 1 give in slide fastener possibleness (H0) and alternative conjecture (H1)The first step is to state the hypothesis being tested. It is called the cipher hypothesis. We either pass up or soften to lower the cryptograph hypothesis. Failing to reject the cipher hypothesis does not prove that H0 is square(a).The null hypothesis is a statement that is not jilted unless our sample data provide convincing evidence that it is false.The alternative hypothesis is a statement that is accepted if the sample data provide fitting evidence that the null hypothesis is false.Example 4A journal has disclosed that the mean age of commercial helicopters is 15 divisions. A statistical test of this statement would first want to determine the null and the refilling hypotheses.The null hypothesis represents th e current or reported condition. It is written H0 = 15.The pass over hypothesis is that the statement is not truthful, i.e. H1 15.4.1.2 Step 2 grant a level of significanceThe level of significance is the probability of rejecting the null hypothesis when it is true.A decision is do to use the 5% level, 1% level, 10% level or any other level between 0 and 1. We must decide on the level of significance before formulating a decision rule and collecting sample data. fictitious character I error Rejecting the null hypothesis, H0, when it is true.Type II error Accepting the null hypothesis when it is false.Example 5 hazard AA Watch Ltd has informed bracelet suppliers to press for bring down on the supply of a large amount of bracelets. Suppliers with the terminal bid will be awarded a sizable contract.Suppose the contract specifies that the watch producers quality-assurance department will take samples of the freightage.H0 The expedition of bracelet contains 6% or less wan ting(p) bracelets.H1 More than 6% of the boards are defective.A sample of 50 bracelets received August 2 from BB Metals Ltd revealed that four bracelets, or 8%, were substandard. The payload was rejected because it exceeded the maximum of 6% substandard bracelets. If the shipment was actually substandard, the decision to return the bracelets to the supplier was correct.However, suppose the four substandard bracelets selected in the sample of 50 were the only substandard bracelets in the shipment of 4,000 bracelets. Then only 1/10 of 1% were defective (4/4000 = 0.001). In that case, less than 6% of the entire shipment was substandard and rejecting the shipment was an error.We may have rejected the null hypothesis that the shipment was not substandard when we should have accepted the null hypothesis.By rejecting a true null hypothesis, we committed a Type I error.AA Watch Ltd would commit a Type II error if, unknown to the order an incoming shipment of bracelet from BB Metals Ltd c ontained 15% substandard bracelets, yet the shipment was accepted. How could this happen?Suppose both out of the 50 bracelets in the sample (4%) tested were substandard, and 48 out of the 50 were good bracelets. As the sample contained less than 6% substandard bracelets, the shipment was accepted but it could be purely by chance that the 48 good bracelets selected in the sample were the only agreeable ones in the entire shipment.In conclusionNull HypothesisAccepts H0Rejects H0H0 is trueCorrect decisionType I errorH0 is falseType II errorCorrect decision4.1.3 Step 3 Select the test statisticsThere are many test statistics. In this chapter, we use both Z and t as the test statistic.DefinitionA test statistic is a value, determined from sample information, utilise to determine whether to reject the null hypothesis.In hypothesis testing for the mean () when is known or the sample size is large, the test statistic Z is computed byThe Z value is based on the sampling distribution of , which follows the normal distribution when the sample is reasonably large with a mean () equal to , and a standard deviation , which is equal to . We can thus determine whether the difference between and is statistically significant by finding the number of standard deviations is from , using the formula above.4.1.4 Step 4 Formulate the decision ruleDefinitionA decision rule is a statement of the specific conditions low which the null hypothesis is rejected and the conditions under which it is not rejected.The region or scope of rejection defines the location of all those values that are so large or so small that the probability of their occurrence under a true null hypothesis is rather remote.The area where the null hypothesis is not rejected is to the left of 1.65.The area of rejection is to the right of 1.65.A one-tailed test is being applied.The 0.05 level of significance was chosen.The sampling distribution of the statistic Z is ordinarily distributed.The value 1.65 separa tes the regions where the null hypothesis is rejected and where it is not rejected.The value 1.65 is the vital value.The full of life value is the dividing point between the region where the null hypothesis is rejected and the region where it is not rejected.Figure 5 Area of rejection for the null hypothesis4.1.5 Step 5 Make a decisionThe last(a) step in hypothesis testing is computing the test statistic, study it to the critical value, and making a decision to reject or not to reject the null hypothesis.Based on the information, Z is computed to be 2.34, the null hypothesis is rejected at the 0.05 level of significance. The decision to reject H0 was made because 2.34 lies in the region of rejection, i.e. beyond 1.65.We would reject the null hypothesis, reasoning that it is passing improbable that a computed Z value this large is due to sampling variation.Had the computed value been 1.65 or less, say 0.71, the null hypothesis would not be rejected. It would be reasoned that suc h a small computed value could be attributed to chance.Example 6A large car leasing company wants to deal tires that average about 60,000 km of wear under normal usage. The company will, therefore, reject a shipment of tires if tests reveal that the life of the tires is significantly infra 60,000 km on the average.The company would be glad to accept a shipment if the mean life is greater than 60,000 km. However, it is more tinted that it will have sample evidence to conclude that the tires will average less than 60,000 km of useful life. Thus, the test is set up to satisfy the concern of the car leasers that the mean life of the tires is less than 60,000 km.The null and alternate hypotheses in this case are written H0 60,000 and H1 In this problem, the rejection region is pointing to the left, and is therefore in the left tail.SummaryIf H1 states a education, we use a one-tailed test.If no direction is specified in the alternate hypothesis, we use a two-tailed test.Figure 6 On e-tailed test5. TESTING FOR POPULATION MEAN WITH KNOWN POPULATION well-worn DEVIATION5.1 Two-tailed TestABC Watch Ltd manufactures luxury watches at several(prenominal) plants in Europe. The weekly output of the cast A33 watch at the Swiss set is normally distributed, with a mean of 200 and a standard deviation of 16. Repennyly, because of market expansion, mechanisation has been introduced and employees laid off. The CEO would like to investigate whether there has been a change in the weekly business of the Model A33 watch. To put it another way, is the mean output at Swiss Plant different from 200 at the 0.01 significant levels?5.1.1 Step 1 evince null hypothesis and alternate hypothesisThe null hypothesis is The population mean is 200. H0 = 200.The alternate hypothesis is The mean is different from 200. H1 200.5.1.2 Step 2 Select the level of significanceThe 0.01 level of significance is used. This is , the probability of committing a Type I error, and it is the probabili ty of rejecting a true null hypothesis.5.1.3 Step 3 Select the test statisticThe test statistic for the mean of a large sample is Z.Figure 7 Normalise the standard deviation5.1.4 Step 4 Formulate the decision ruleThe decision rule is conjecture by finding the critical values of Z from Appendix D.Since this is a two-tailed test, half of 0.01, or 0.005, is placed in each tail. The area where H0 is not rejected, i.e. area between the two tails, is 0.99.Appendix D is based on half of the area under the curve, or 0.5. Then 0.5 0.005 is 0.495, so 0.495 is the area between 0 and the critical value.The value nearest to 0.495 is 0.4951. Then read the critical value in the row and column corresponding to 0.4951. It is 2.58. conclusiveness ruleReject H0 if the computed Z value is not between -2.58 and +2.58.Do not reject H0 if Z falls between -2.58 and +2.58.Figure 8 Two-tailed test5.1.5 Make a decision and interpret the result exercise Z and apply the decision rule to decide whether to reje ct H0.The mean number of watches produced weekly for last year is 203.5. The standard deviation of the population is 16 watches.Because 1.55 does not fall in the rejection region, H0 is not rejected. We conclude that the population mean is not different from 200.So we would report to the CEO that the sample evidence does not show that the production rate at the Swiss plant has changed from 200 per week. The difference of 3.5 units between the historical weekly production rate and the mean number of watches produced weekly for last year can reasonably be attributed to sampling error.Figure 9 Rejection regions for the two-tailed testSo did we prove that production rate is still 200 per week?No Failing to disprove the hypothesis that the population mean is 200 is not the same thing as proving it to be true.5.2 P-value In Hypothesis TestingDefinitionP-value is the probability of observing a sample value as extreme as, or more extreme than, the value observed, given that the null hypothe sis is true.How confident are we in rejecting the null hypothesis?This approach reports the probability of getting a value of the test statistic at least as extreme as the value actually obtained. This process compares the probability called the P-value, with the significant level.If the P-value If the P-value significant level, H0 is not rejected.A very small P-value, such as 0.0001, indicates that there is little likelihood the H0 is true. If a P-value of 0.2033 means that H0 is not rejected, there is little likelihood that it is false.Figure 10 P-valueP-valueInterpretationless(prenominal) than 0.1Some evidence that H0 is not trueLess than 0.05Strong evidence that H0 is not trueLess than 0.01Very strong evidence that H0 is not trueLess than 0.001Extremely strong evidence that H0 is not trueThe probability of finding a Z value of 1.55 or more is 0.0606, found by 0.5 0.4394.The probability of obtaining an greater than 203.5 if = 200 is 0.0606.To compute the P-value, we need to be concerned with the region less than -1.55 as well as the values greater than 1.55. The two-tailed P-value is 0.1212, found by 2(0.0606). The P-value of 0.1212 is greater than the significance level of 0.01, so H0 is not rejected.Chapter ReviewThe Central Limit Theorem states that the sampling distribution of the sample means is approximately normal.The standard error refers to the standard deviation of the sampling distribution of the sample mean.We use t distribution when the sample is less than 30 observations and the population standard deviation is not known.The necessary sample size depends on 1) level of confidence wanted 2) margin of error the researcher will tolerate 3)variability in the population.By rejecting a true null hypothesis, we committed a Type I error.We would reject the null hypothesis when it is highly improbable that a computed Z value this large is due to sampling variation.What You Need To KnowConfidence interval A range of values constructed from sample da ta so that the population parameter is likely to occur within that range at a specified probability.Hypothesis A statement about a population parameter developed for the purpose of testing.Hypothesis testing A procedure based on sample evidence and probability theory to determine whether the hypothesis is a reasonable statement.Critical value The dividing point between the region where the null hypothesis is rejected and the region where it is not rejected.P-value The probability of observing a sample value as extreme as, or more extreme than, the value observed, given that the null hypothesis is true. cook Them Out1. The average number of eld in outdoors assignments per year for salespeople employed by an electronic wholesaler of necessity to be estimated with a 0.90 degree of confidence. In a small sample, the mean was 150 days and the standard deviation was 14 days. If the population mean is estimated within two days, how many salespeople should be interviewed?A 134B 152C 111D great hundred2. A random sample of 85 staff of managerial grudge revealed that a person exhausted an average of 6.5 years on the prank before being promoted. The standard deviation of the sample was 1.7 years. Using the 0.95 degree of confidence, what is the confidence interval for the population mean?A 6.19 and 6.99B 6.15 and 7.15C 6.14 and 6.86D 6.19 and 7.193. The mean weight of lorries travelling on a particular(a) highway is not known. A state highway authority needs an estimate of the mean. A random sample of 49 lorries was selected and finds the mean is 15.8 tons, with a standard deviation of 3.8 tons. What is the 95 per cent interval for the population mean?A 14.7 and 16.9B 14.2 and 16.6C 14.0 and 18.0D 16.1 and 18.14. A bank wants to estimate the mean balances owed by platinum Visa card holders. The population standard deviation is estimated to be $300. If a 98% confidence interval is used and an interval of $75 is desired, how many platinum cardholders should be taken into sample?A 84B 82C 62D 875. A sample of 20 is selected from the population. To determine the appropriate critical t-value, what number of degrees of freedom should be used?A 20B 19C 23D 276. If the null hypothesis that two means are equal is true, where will 97% of the computed z-values lie between?A 2.58B 2.38C 2.17D 1.687. Suppose we are testing the difference between two proportions at the 0.05 level of significance. If the computed z is -1.57, what is our decision?A Reject the null hypothesisB Do not reject the null hypothesisC Review the sampleD Own judgment8. The net weights of a sample of bottles make full by a machine fabricate by Dame, and the net weights of a sample fill up by a similar machine manufactured by Putne Inc, are (in grams)Dame 5, 8, 7, 6, 9 and 7Putne 8, 10, 7, 11, 9, 12, 14 and 9Testing the claim at the 0.05 level that the mean weight of the bottles filled by the Putne machine is greater than the mean weight of the bottles filled by the Dame machine , what is the critical value?A 2.215B 2.175C 1.782D 1.6829. Which of the following conditions must be met to conduct a test for the difference in two sample means?A Data must be of interval scaleB Normal distribution for the two populationsC Same variances in the two populationsD All the above are correct10. Take two independent samples from two populations in order to determine if a statistical difference on the mean exists. The number for the first sample and the number in the second sample are 15 and 12 respectively. What is the degree of freedom associated with the critical value?A 24B 25C 26D 27SHORT QUESTIONSA consumer group would like to estimate the mean monthly water charge for a single family house in June within $5 using a 99% level of confidence. same research has found that the standard deviation is estimated to be $25.00.What would be the sample size?The manager of the Kingsway Mall wants to estimate the mean amount spent per shopping visit by customers. A sample of 2 0 customers reveals the following amounts spent.$48 $42 $46 $51 $23 $41 $54 $37 $52 $48$50 $46 $61 $61 $49 $61 $51 $52 $58 $43What is the trounce estimate of the population mean?Determine a 99 per cent confidence interval. Interpret the result.Would it be reasonable to conclude that the population mean is $50? What about $60?ESSAY QUESTION1. ABC pic Ltd knows that a certain favourite photographic film ran an average of 84 days, and the corresponding standard deviation was 10 days. The manager of New Westminster district was kindle in comparing the movies popularity in his region with that in all of Canadas other theatres. He randomly selected 70 theatres in his region and found that they showed the movie for an average of 82 days.(a) State appropriate hypotheses for testing whether there was a significant difference in the length of the pictures run between theatres in the New Westminster district and all of Canadas other theatres.(b) Test these hypotheses at a 1% significance l evel.
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment
Note: Only a member of this blog may post a comment.