.

Monday, February 25, 2019

Z Score

MN 215 A & B October 02, 2012 Z pocks, Z shields and t Tests Overview and Review At the beginning of the course we learned that thither be ii kickoffes of statistics, namely, parametric and non-parametric. Further we learned that parametric statistical processes ar broken down into devil contrary categories, namely descriptive statistical processes and illative. We learned also that descriptive statistics ( squiffy, mode, median, salutary-worn loss, and frequencies) be l mavin(prenominal) to be employ to describe the characteristics of the data rather than draw endings of defecate inferences from the quantity data collected.However, the importance of descriptive statistics send packing non be downstairsmined as they body the basis for the rifleings of inferential statistical processes especi eachy the call back. In data compend one of the most important concepts to remember is that regardless of the progeny or issue being enquired all is based on the think of of a data set. Although we puke non draw refinement or make predictions from descriptive statistics their intakefulness in inferential statistics is remarkable.As verbalize inferential statistics is a branch of statistics that is rehearsed in making inferences nearly traits or characteristics of a greater state on the basis of savor measurement data. The primary mark of inferential statistics is to leap beyond the measurement data at communicate and make inferences closely a greater creation. Take for workout a psychologist who is interested in well-educated whether a impudently behavior fitting harvest forget wantly be a seller in a certain market ara.Knowing that the entire consumer commonwealth cannot be queried as to market acceptance, the psychologist would select a lesson attempt for the ara, administer whatever measurement instrument is necessary to garner the data and, on the basis, of the pattern data results, determine whether or not the new overlap give be profitable. The statistic employ to determine whether or not the audition is re layative of the entire market population would be an inferential statistics.When development inferential statistical processes to gene set up information in order to make predictions about a large population the chosen model must incessantly be on the basis of haphazard selection or random assignment. Without random have or random assignment the mathematical entertain received by way of the statistical analysis are in err. Or, an separate way of putting is to say that the results would be Lies, damn lies about the data analyzed. For convenience purposes throughout the remainder of this course the sideline symbols lead be used most extensively.Statisticians, regardless of area, use English garner to denote attempt statistics and Greek letters to symbolize population parameters. Name exemplar Statistic community tilt _ stand forX (mu) VarianceSD? ? 2 (sigma squa red) Standard DeviationSD ? (sigma) Correlationr ? (rho) Proportionp ? (pi) Regression Coefficient b ? ( beta)? ? When trying to arrive at destructions that extend from the measurement data alone, inferential statistics are the data analysis tools of pickaxe.For example, inferential statistics are used to infer from the specimen data to the larger population data or when in that respect is an need to make judgments of the chance that an observed difference amid groups is an accu graze and dependable one and not those that happened by chance alone. In order too accomplish that which inferential statistics were designed two models are available estimation essaying and surmisal block outing. In the estimation model the take measurement data is used to estimate a parameter (population) and a confidence time interval about the estimate is created.The confidence interval is essentially the range of set that has a high likelihood of containing the parameter. The parameter is a numerical nurture that measures close to spark off or the population measurement constructs or take to bes. The sum use of inferential statistical processes is in scheme evidenceing. The most vulgar manner in which a hypothesis is essayed is by developing what is usually called a straw man which is what a useless hypothesis is call when looking at a situation where in the query investigator wants to determine if the data collected and analyzed is strong adequacy to wipe out the null or straw man hypothesis.Always remember that a null hypothesis is stated that no differences, effects or relationships will occur amidst and or amongst the events, occurrences, phenomenon, items, or situations being evaluated and measured as a result of some variant. A truthful example of a worry null hypothesis would be something like the following on that point exists no statistically significant difference amongst widgets made of alloy A and those made of Alloy B in terms of tinsel strength acceptability. entropy Requirements When use Inferential Statistics.Thinking back to the low part of the course we learned that statistical processes must use certain forms of numeric measurement data and this data is expressed as nominal, ordinal, interval and ratio. For descriptive statistics (frequencies and measures of central tendency) it is nominal data that is used. For inferential statistics the measurement data types to be used are either interval or ratio. However, in the affectionate sciences and business concern arenas ordinal data is lots convictions treated like interval. This is particularly true when studies attempt to assess situations by way of a Lickert scale.For convenience and review the scale presented down the stairs will help to light up the differences between the four scales of measurement discussed earlier in the course. Indications Indicated Direction ofIndicates numerate of Absolute Difference Difference Difference Zero Nomin alX OrdinalX X IntervalX X X RatioX X X On the basis of the information contained in the table to a higher place the following two conditions apply when using inferential statistical processes * Participants selected for participation in a study should be selected randomly. If sampling is not random, then biases occur and contaminate the accuracy of the demoteings. The most commonly used inferential statistics that behavioural look uses are those statistical processes that reserve for the determination of relationships (correlations), differences and effects between and amongst that which is being measured or evaluated. The unique(predicate) analyzes used are the Pearson Correlation Coefficient, Chi Square, Student t Test, analysis of class (Analysis of Variance), and regression. All of these techniques not only require the use of a null hypothesis further independent and dependent variables as healthful. Z write upsCalculating the Z news report for Research Purposes. One of the most often used statistical processes in the behavioural sciences is the Z Score. What a Z Score accomplishes is in taking a raw measurement economic value or score and transforms it into a standard form which then provides a more important description of the individual win within the distribution. This transformation is based on knowledge about the populations imply and standard divergency. Take for example an educational psychologist who is interested in determining how individual school-age childs are comparing to the overall group of students with respect to grades.As we have learned before raw scores alone cannot provide insightful information to the psychologist how well an individual student is performing. However, what the psychologist can easily do is calculate a Z Score for each student and determine whether or not an individual student is functioning to a higher place or to a lower place the mean grade of all students together. When determining the place ment of each individual, the Z Score permits the psychologist to calculate how m whatever a(prenominal) standard aberrancys, or the distance, each student is above or below the mean grade of all students together.If on that point is an academic standard the psychologist is using as a comparative base a opposite statistical verbal expression is used compared to the formula needed when comparing individual feat to a local sample of student. The formulas for each are presented below. Comparing someone to Population Standard Comparing Individual to Sample Standard The gimmick of the two formulas is the comparable with the exception that one uses the mean and standard deviation of a population and the other of a sample.What is very important to remember, specially for the psychologist, is that comparing an individual to a local academic setting may have entirely different results when the same student is compared to the pains standard. Although this expertness appear to be a dilemma, it is actually a come-at-able gracility in disguise. Take for example the same psychologist compares all his students rate of academic success in a local facility and determines they are all functioning well above average, or above the mean, in their grades.What happens if the same students are compared to an academic standard and the results show their grade is well below the industry standard or population mean? The conclusion cadaverous is, therefore, that the students, although having grades are not in line with other educational facilities and corrective computer programming to incr palliate the performance rate must ensue. For ease of infrastanding let us look at a business situation. Example. Suppose an employee is producing 3. 5 widgets per encourage and the sample average number of widgets per hour is 2. 3 with a standard deviation of 0. 33. The Z Score would be calculated as follows X = raw score X band = mean s = standard deviation From this we can concl ude that the employees widget exertion rate per hour of 3. 5 lays 1. 73 standard deviations above the mean. We can conclude further that this employee is function above the mean all others together on the ware line in terms of widget toil and that the employee is doing soften than 95% of the other employees and only 5% of the total employees are producing more widgets.NOTE The percentages are easily found on the back of the very last page of your text book. As stated earlier caution must be exercised when drawing conclusions about a single business sample as the statistical information garnered power not be representative of industry standards. Looking at the same employee on an industry standard basis the information might possibly be different. Taking the same employee with an average widget return rate of 3. 5 widgets per hour with a hypothetical population or industry standard mean of 4. 9 and a population standard deviation of 1. 15 the results would be as follows using t he formula stated above X = Employee raw turnout score = Population standard mean ? = Population standard deviation Z = (3. 5 4. 79) / 1. 15 = -1. 12 What can be readily seen by way of the negative value Z Score is that the employee falls below the standard industry mean with respect to the number of widgets produced in one hour. Concluding further we can say that this employee standing is surpassed by 64% of the entire population manpower for he company. Needless to say, the charabanc needs to take a effective look at the quality of workers in his/her plant. Interpreting the Z Score for Research Purposes. When using Standard Z Scores one must always remember that comparisons are made between individual measurement determine and sample or population mean values. At no time can a one use Z Score values to make predictions or drawn inferences about any given(p) situation. To accomplish this, inferential statistical processes must be used.The value of the Z Score lies in the id ea that individual tracking is necessary and trends can be plotted. Also, one must always keep in thought that X values do not have to be simple individual raw scores but can also fall any investigative variable the researcher chooses to investigate. Z Test When to use the Z Test over the t Test in Research. Although some(prenominal) the Z experiment and the t tally are used in research decision hypothesis interrogationing each is used under a different set of circumstances than the other. The primary distinction between the two lies in the sample sizing requirement.Where t tests can be used for small samples the Z Test cannot and is, therefore, reserved for sample situations that are larger. Both, however, perform the same function, namely to determine whether or not there are differences between the samples being evaluated or comparisons between sample and population measurements. In admission both(prenominal) the Z and t tests make use of the mean scores for raw measureme nt data when calculating differences. Presented below are some examples of using both the Z test and t test in business today. Z Test A harvest safety engineer wants to investigate the average number of possible sorry products in worldwide mathematical product. A sample is drawn sample (in profusion of 30) and mean of the sample is compared to the population mean for evaluation. * Z Test A psychologist wants to investigate whether or not a 10 hour call down will record more safety accidents in product intersection compared to the company wide population standard of eight hour shifts. * Z Test A human election coach wants to investigate whether or not a new employee training program will make up drudgery numbers company wide. t Test A psychologist wants to investigate whether or not the sample of 20 line employees of plant A are producing a significantly greater number of products than the sample of 20 employees of plant A. * t Test A consumer product safety passenger vehi cle wants to investigate whether or not his small firm is producing an equal number of safe products compared to the industry standard. * t Test A human resource manager is interested in knowing if customer service skills of employees in discussion section A are the same as in department B.What is most important to remember is that both the t and Z tests are formulated to arrive at the same conclusion but under different sampling conditions. Keep in disposition as well that the Z test is used when the population mean is known. In addition when using a t test with a small sample base it is assumed the distribution of the data is normal however, in larger samples the distribution does not have to be normal and a Z test can be used for comparative purposes. Further, in both situations the samples drawn must be on a random basis.The unlucky limitation of both tests is in the fact that neither permit any conclusions to be drawn if not differences are found between the sample means or sample and population mean. However, one must always keep in mind that Z and t tests are basically the same as they compare two means to determine whether or not both samples come from the same population. Calculating the Z Test. The example presented below not only provides you with a formula for both population mean testing but sample mean testing as well.What must be closely watched is the effect on sample surface with respect to any resulting Z value Remember that the Z test requires a large sample and should a small sample be used the resulting Z value is contaminated. Formula Sample vs. PopulationSample vs. Sample __ __ __ Z = / Z = X1 X2 N 2(1/N + 1/N) Example Sample vs. PopulationSuppose a product manager is interested in knowing if the number of untimely washing molds being produced in his/her plant in August is indicative of the over-all number of washing mechanisms produced in all plants during the calendar month of August. The product manager draws two samples from his/her assembly line a sample of 10 and a sample of 100. The example being created is to show how the sizing of the sample bears directly on the resulting Z Test value. Formula __ _ Z = / N Data.Sample Test Mean = 30 Population Mean = 25 (Industry Requirement) Population = 15 N = 10 __ Z = / N = 30-25 / 15 / 3. 16 = 15 / 4. 75 Z = 1. 58 Sample Test Mean = 30 Population Mean = 25 (Industry Requirement) Population = 15 N = 100 _ Z = / N = 30-25 / 15 / 10 = 5 / 1. 5 Z = 3. 33 Conclusion The conclusion the production manager can draw from the above measurement example (N=10 and N=100) is relative to the size of the sample used to determine whether or not the sample is representative of the overall faulty washing machine production in August.Had the production manager set the level of confidence at 0. 01 (99%) the Z test score needed in order to reject the null hypothesis that no differences exist in washing machine production is +1. 96. A Z test value for the 10 sample situation of +1 . 58 does not congruous or exceed the demand value of +1. 96. Therefore, the production manager concludes there is not statistically significant difference in the August faulty washing machine production rate for his/her plant and the overall faulty washing machine production rate of all plants.However, when the sample size is increased the resulting Z test value is super different. The 100 sample case, using the same values as in the 10 sample case, provides an entirely different scenario. By increasing the sample size tenfold the resulting Z test value is +3. 33. Obviously this numeric value far exceeds the required +1. 96 value and the production manager can safely conclude that statistically significant differences exist between the faulty washing machine productions in the production managers plant compared to the average faulty washing machine production rate of all plants.The reason for the difference in Z test values in knowing that as sample size increases so does the Z test value. Although not shown in this example, but also extremely important, is in knowing that when the variance of the sample differs from the population variance there will exist a lower Z test value. In the 100 sample test, should the resulting Z test value not met the required 1. 96 value the production manager could have reason that the faulty washing machine production rate of his/her plant meets the production rate of all other plants together for the month of August.As scientific research and applied statistics application are not equipped to lend write up as to why no differences are determined the only conclusion to be drawn is that the lack of differences is a direct result of sample size and variance. Example Sample vs. Sample vs. Sample Formula __ __ Z = X1 X2 2(1/N + 1/N) Example Suppose the same product manager is interested in knowing if the number of faulty washing machines being produced in his/her plant in August is indicative of the number of faulty washing m achines produced in a inhabit plant during the month of August.The product manager draws two samples one from his/her assembly line and one from the neighboring plant a sample of 100 is drawn from both plants. _ Data Sample 1 N=100 X=30 _ Sample 2 N=100 X=25 = 15 (known or assumed) _ _ Z = X1 X2 2(1/N + 1/N) = 30 25 / (15)? (1/100 + 1/100) = 5 / v (225) (. 01 + . 01) = 5 / 4. 5 = 5 / 2. 12 = 2. 35 Conclusion On the basis of the Z test value above the production manager would have to conclude that there exists a statistically significant difference in the production rate of the two plants at the . 1 confidence level (99%) as the required critical value of 1. 96 was matched and exceeded. As such it can be stated that the two washing machine samples are not representative of each other and differences occur. Should the product manager replicate the study and use only 10 washing machines per sample the resulting Z test value would be 1. 11 and the conclusion drawn would be that no s tatistically significant differences are present between the two groups and the population.Again this is an example of how sensitive the Z test is to sample size. One must always keep in mind that re-testing a product or service with artificial conveyances (smaller sample size) in order to show that differences are not present is scientifically and professionally unacceptable. Research results must be allowed to fall wherein the statistical analysis places them. Doing differently is using the statistical process for reasons other than that which they were intended Drawing Conclusions from the Z Test.Business situations are not unlike any other professional situation, including the behavioral sciences, wherein the researcher or investigator is seek information as to possible differences between samples or sample and the general population. When business managers or psychologists at any level are interested in making comparisons between products and or serve the best-fit statistica l tool for large sample situations is the Z test. However, the statistical value is only as substantially as the controls dictated on it and at no time will the actual values give a reason as to why something has happened or why something has not.With regard to the utilization of the Z test in business decision-making the following rules are always to be remembered * Z Tests can be used to compare a sample to a population or sample to a sample for general population inference. * Z Tests are extremely susceptible to size of sample and variance and not useful when population variance is unknown. * Z Tests work best with very large samples but not with small samples as the correction factor cannot accommodate for the fracture associated with small samples. Z Tests are natural introductions to t Tests. * Z Tests work with only one (1) dependent variable. * Z Tests cannot work with correlate data. * Z Tests do not permit the making of strong inferences about differences or effects of the testing instrument or situation. * Z Tests have a non-parametric counterpart wherein small samples can be used. t Test 1a. launching to Difference Testing. Difference testing is used primarily to identify if there is a detectable difference between products, services, people, or situations.These tests are often conducted in business situations to * Ensure a change in face or production introduces no significant change in the end product or service. * Substantiate a claim of a new or improved product or service * Confirm that a new ingredient/supplier does not affect the perceived attributes of the product or service. * Track changes during shelf-life of a product or the length of time of a service. Differences Between Two Independent Sample Means ascorbic acid vs. Pepsi. Let us again look at a business example wherein the independent sample t-tests are sed to compare the means of two independently sampled groups. Example do those boozing coulomb differ on a performance vari able (i. e. numbers of cans consumed in one week) compared to those drinking Pepsi. The individuals are randomly assigned to the Coke and Pepsi groups. With a confidence interval or ?. 05 (corresponding probability level of 95 %) the researcher concludes the two groups are significantly different in their means (average consumption rate of Coke and Pepsi over a one week period of time) if the t test value meets or exceeds the required value.If the t value does not meet the critical t value required then the research investigator simply concludes that no differences exist. Further explanation is not required. Presented below is a more useable situation. Example As a manager of production let us suppose you are wanting to determine whether or not work performance is significantly (statistically) different in a noise related production line vs. a non-noise related production line. Individual Noise mathematical product Non-Noise Production difference 1-2 38 32 6 2 10 16 -6 3 84 57 2 7 4 36 28 8 5 50 55 -5 6 35 12 23 7 73 61 12 8 48 29 19 Mean 46. 8 36. 2 10. 5 Standard dev 23 19 12 Variance 529 361 N = 16 Using the raw data and formula above to calculate the t test value the actual t test value, when calculated properly, is 2. 43. Always remember that S = Standard deviation and that the mean is often times shown by the enceinte letter M rather than a bar mark over a capital X.By tone ending to the appropriate t tables in your text book distinguish the critical value for t at the . 05 confidence interval. The value you should respect is 1. 761 Differences Between Two Means of Correlated Samples Red Bull vs. index Drink. Again using a business example correlated t test statistical processes are used to determine whether or not there is a relationship of a particular measurement variable on a pre and mail service test basis. Often times when there exists a statistically significant relationship on a pre and post test basis the business manager can use th e premier measurement values to predict the second in future situations without having to present a post test situation.Example Using the same data presented above let us assume that there are not two independent groups but the same group under two different conditions noise production environment and non-noise production environment. Individual Noise Production Non-Noise Production difference 1-2 1 38 32 6 2 10 16 -6 3 84 57 27 4 36 28 8 5 50 55 -5 6 35 2 23 7 73 61 12 8 48 29 19 N = 8 The first gait is to compute the mean of the differences _ D = ? D N The second step is to square the differences (6)? + (-6)? + (27)? + (8)? + (5)? + (23)? + (12)? + (19)? The third step is to calculate the standard hallucination of the difference SED = _ ?D D? / n -1 n The last step is to compute the t test value _ t = D / SED Using the raw data and formula above to calculate the t test value the actual t test value, when calculated properly, is 3. 087. By going to the appropriate t tabl es in your text book you can find the critical value to be, at the ? .05 confidence interval is 1. 895.The conclusion drawn is that the differences are statistically significantly different. When to Use Independent Mean or Correlated Sample Difference Testing. In research investigation situations the choice of using an independent sample t test of a correlated sample test is dependent upon whether or not the investigator is seeking to determine differences or relationships. In some situations the need to know whether or not a difference exists between two products or services is more important than knowing if there is a relationship between the two. For example take the consulting psychologist wants to know if training program A has better success in training managers than training method B.The psychologist would select a sample of each training situations (generally 30) and test the success of each sample and compare the success of program A with program B. The results would confir m if one training programs was better that the other. If, however, the psychologist was interested in determining how each program compared to the industry standard the programs would be compared, independently, to the population program mean. On the other hand should the consulting psychologist wants to determine whether or not a relationship exists, or predictability can be determine, from one program in two different situations or under two different situations a correlated t test is used.However, knowing the relationship in pre and post test situations are generally reserved for improvement situations. Drawing Conclusions for the t Test. Any conclusion drawn for the t test statistical is only as good as the research question asked and the null hypothesis formulated. t tests are only used for two sample groups, either on a pre post-test basis or between two samples (independent or dependent). The t test is optimized to deal with small sample numbers which is often the case with b ehavioral scientists in any venue. When samples are excessively large the t test becomes difficult to manage due to the mathematical calculations involved.

No comments:

Post a Comment