Liberal Arts: English-Assessment
Liberal Arts: English-Assessment
Summer 2008
ESL 6638 Assessment of the Adult ESL Student Online
(2 sem. cr.) Dates: June 28-August 9
"Learn the basic principles of testing and evaluation, and how assessment is related to curriculum. Learn how to conduct valid formal and informal assessment of ESL learners. Develop authentic assessment tools for academic and workplace settings. Discuss entrance and exit criteria for ESL services and how to assess student progress. Explore the politics of testing and assessment. Online course and login information at www.hamline.edu/gseonline. Optional orientation session at Hamline on June 28, 10:00 a.m.-12:00p.m. Drew Science 305. Target Audience: ESL teachers of adult learners.
"2 semester credits. Enrollment limit: 29. Online dates: June 28-August 9. Cost $460. Required text and course packet. See Policies & Procedures.
Instructor: Todd Wagner is the assessment and evaluation specialist for Adult Basic Education at the Minnesota Department of Education
"
Required Books:
The Essentials of English: A Writer's Handbook (with APA Style) (Spiral-bound)
by Ann Hogue (Author), from amazon.com List Price: $41.33
Understanding and Using English Grammar (Paperback)
by Betty Schrampfer Azar (Author) "Following are some dialogues between Speaker A and Speaker B..." (more)
, from amazon.com List Price: $48.67
Week #2
Terms
Pyschometrics-"...field of study concerned with the theory and technique of educational and psychological measurement, which includes the measurement of knowledge, abilities, attitudes, and personality traits. The field is primarily concerned with the study of measurement instruments such as questionnaires and tests. It involves two major research tasks, namely: (i) the construction of instruments and procedures for measurement; and (ii) the development and refinement of theoretical approaches to measurement..."
eugenics-"...is a social philosophy which advocates the improvement of human hereditary traits through various forms of intervention.[2] Throughout history, eugenics has been regarded by its various advocates as a social responsibility, an altruistic stance of a society, meant to create healthier and more intelligent people, to save resources, and lessen human suffering."
Craniometry-"..is the technique of measuring the bones of the skull. It is distinct from phrenology, the study of personality and character, and physiognomy, the study of facial features. However, these fields have all claimed the ability to predict traits or intelligence."
*see Rubrics
"scoring scale, is typically created which contains the essential criteria for the task and appropriate levels of performance for each criterion...
holistic rubric-"assigns a level of performance by assessing performance across multiple criteria as a whole"
analytic rubric articulates levels of performance for each criterion so the teacher can assess student performance on each criterion.
valid-"degree to which a certain inference from a test is appropriate and meaningful" (AERA, APA, & NCME, 1985).."
reliable-consistent
washback-effect a test has on teaching and learning
What Is "Washback" and Why Do Some Language Researchers Avoid the Term? , from jalt.org
modality-channel (aural/oral) of language use
Related Sites:
Assessment Glossary, from gwu.edu
Glossary of Assessment Terms, from fetaweb.com
Assessing Special Education Students, from pdf format
English Assessment Terminology, from pdf format
indirect vs. direct tests
*see A Comparison of Direct & Indirect Assessments of Writing Skill, by Hunter M. Breland
discrete-point vs. integrative testing
*see Methods of Assessment, btinternet.com
"Discrete point tests: cannot be pragmatic. There is no ordinary discourse situation and no normal language use context where a learner might be asked to listen and distinguish between "ship" & "sheep" or perform active to passive transformations."
What is a discrete point approach to testing?
"Electronic quiz tools usually involve a discrete point approach to testing as opposed to an integrated or authentic approach, such as papers and projects. Discrete point tests are made up of test questions each of which is meant to measure one content point. Discrete point testing is associated with multiple choice and true/false formats, which have been criticized for testing only recognition knowledge and facilitating guessing and cheating. However, if they are used for an appropriate PURPOSE and if the test questions are well constructed, discrete point tests can be used for effective teaching and learning..."
objective-vs. subjective scoring
*see
Objective and subjective evaluation
"Objective tests are often constructed with selected-response item formats, such as multiple-choice, matching, and true-false. An advantage to including selected-response items in objectively scored tests is that the range of possible answers is limited to the options provided by the test writer—the test taker cannot supply alternative, acceptable responses.
Related Sites:
ASSESSMENT AND TESTING - MASTER ASSIGNMENTaraba
*concurrent validity (wikipedia)-"is demonstrated where a test correlates well with a measure that has previously been validated. The two measures may be for the same construct, or for different, but presumably related, constructs.
For example, if a test measuring job satisfaction gives similar results to those gathered using a job satisfaction test which has been validated in past investigations the new measurement has concurrent validity. Alternately, a measure of job satisfaction might be correlated with work performance. Note that with concurrent validity, the two measures are taken at the same time. This is in contrast to predictive validity, where one measure occurs earlier and is meant to predict some later measure."
*inter-rater reliability (wikipedia)-"iss the degree of agreement among raters. It gives a score of how much homogeneity, or consensus, there is in the ratings given by judges. It is useful in refining the tools given to human judges, for example by determining if a particular scale is appropriate for measuring a particular variable. If various raters do not agree, either the scale is defective or the raters need to be re-trained."
CURRICULUM-BASED TESTS-"..measurement that uses "direct observation and recording of a student's performance in the local curriculum as a basis for gathering information to make instructional decisions" (Deno, 1987, p. 41)...provides a structured way to see how well a child performs on the materials the teachers is assigning the class.(from umn.edu/~devenz)
".. refer to instruments that assess one element/component of a skill at a time" (Hamline Univ)
COMPETENCY-BASED TESTS-""
What is CBT & Characteristics of CBT
"..A skill performed to a specific standard under specific conditions...Assessment of competency takes the participant’s knowledge and attitudes into account but requires actual performance of the competency as the primary source of evidence. ..
DIAGNOSTIC TESTS -""
Test your English language ability
Online diagnostic tests
*internal consistency(wikipedia)-"is a measure based on the correlations between different items on the same test (or the same subscale on a larger test). It measures whether several items that propose to measure the same general construct produce similar scores. For example, if a respondent expressed agreement with the statements "I like to ride bicycles" and "I've enjoyed riding bicycles in the past", and disagreement with the statement "I hate bicycles", this would be indicative of good internal consistency of the test."
Tests
Online Practice Tests, from MBA-Center.net
People
-4 key figures
Francis Galton
"..Victorian polymath: geographer, meteorologist, tropical explorer, founder of differential psychology, inventor of fingerprint identification, pioneer of statistical correlation and regression, convinced hereditarian, eugenicist, proto-geneticist, half-cousin of Charles Darwin and best-selling author..."
Wikipedia
"..created the statistical concept of correlation and widely promoted regression toward the mean. He was the first to apply statistical methods to the study of human differences and inheritance of intelligence, and introduced the use of questionnaires and surveys for collecting data on human communities, which he needed for genealogical and biographical works and for his anthropometric studies. He was a pioneer in eugenics, coining the very term itself and the phrase "nature versus nurture". As an investigator of the human mind, he founded psychometrics (the science of measuring mental faculties) and differential psychology. He devised a method for classifying fingerprints that proved useful in forensic science. As the initiator of scientific meteorology, he devised the first weather map, proposed a theory of anticyclones, and was the first to establish a complete record of short-term climatic phenomena on a European scale.[1] He also invented the Galton Whistle for testing differential hearing ability..."
Charles Spearman, from Indiana.edu
"..Yet, his technical and theoretical contributions to the development of intelligence research as a scientific enterprise cannot be overstated. He was the first to offer a tenable psychometric definition of intelligence, and is therefore considered to be the father of classical test theory (Jensen, 1994). In a famous article, “General Intelligence’ Objectively Determined and Measured” (1904), Spearman proposed the idea that intelligent behavior is generated by a single, unitary quality within the human mind or brain. Spearman derived this theoretical entity, called the general factor, or simply g, through a new statistical technique that analyzed the correlations among a set of variables. This technique, called factor analysis, demonstrated that scores on all mental tests are positively correlated; this offered compelling evidence that all intelligent behavior is derived from one metaphorical pool of mental energy. Although proponents of multiple intelligence theory reject this interpretation, factor analysis remains one of the most important tools in 21 st century intelligence research..."
Wikipedia
"..FRS (September 10, 1863 - September 17, 1945) was an English psychologist known for work in statistics, as a pioneer of factor analysis, and for Spearman's rank correlation coefficient. He also did seminal work on models for human intelligence, including his theory that disparate cognitive test scores reflect a single general factor and coining the term g factor..."
Sir Cyril Burt, from indiana.edu
... remains one of the most complex and intriguing figures in the history of intelligence testing. He was a pioneer of educational psychology in England and was one of the most respected and honored psychologists of his time. However, he had controversial ideas regarding the heritability of intelligence, and there is ample evidence that he used fraudulent data to support his views (Scarr, 1994)..."
Wikipedia
".. (March 3, 1883 – October 10, 1971) was an English educational psychologist who contributed to [1] educational psychology and claimed to have developed the method of factor analysis in psychological testing,[2] although his mentor and predecessor as chair of the psychology department at University College London, Charles Spearman actually did so.[3] Burt is known for his studies on the effect of heredity on intelligence. Shortly after he died, his studies of inheritance and intelligence came into disrepute after evidence emerged indicating he had falsified research data,[2] in addition to having falsely claimed to have invented factor analysis. Some scholars have asserted that Burt did not commit intentional fraud.[citation needed].."
Louis Leon Thurstone, from indiana.edu
"... made significant contributions in many areas of psychology, including psychometrics, statistics, and the study of human intelligence. He developed methods for scaling psychological measures, assessing attitudes, and test theory, among many other influential contributions. He is best known for the development of new factor analytic techniques to determine the number and nature of latent constructs within a set of observed variables..."
Wikipedia
"..29 May 1887–30 September 1955) was a U.S. pioneer in the fields of psychometrics and psychophysics. He conceived the approach to measurement known as the law of comparative judgment, and is well known for his contributions to factor analysis..."
Alfred Binet, from indiana.edu
"..is most widely known for his contributions to intelligence. Wolf (1973) postulates that this is the result of his not being affiliation with a major university. Because Binet did not have any formalized graduate study in psychology, he did not hold a professorship with a prestigious institution where students and funds would be sure to perpetuate his work (Siegler, 1992). Additionally, his more progressive theories did not provide the practical utility that his intelligence scale would evoke..."
Wikipedia
"..(July 8, 1857 – October 18, 1911), French psychologist and inventor of the first usable intelligence test, the basis of today's IQ test.
Born in Nice, Binet was a French psychologist who published the first modern intelligence test, the Binet-Simon intelligence scale, in 1905. His principal goal was to identify students who needed special help in coping with the school curriculum. Along with his collaborator Théodore Simon, Binet published revisions of his intelligence scale in 1908 and 1911, the last appearing just before his untimely death. A further refinement of the Binet-Simon scale was published in 1916 by Lewis M. Terman, from Stanford University, who incorporated the German psychologist William Stern's proposal that an individual's intelligence level be measured as an intelligence quotient (I.Q.). Terman's test, which he named the Stanford-Binet Intelligence Scale formed the basis for one of the modern intelligence tests still commonly used today. They are all colloquially known as IQ tests..."
William Stern, from Wikipedia
"..(April 29, 1871 - March 27, 1938), born Wilhelm Louis Stern, was a German psychologist and philosopher noted as a pioneer in the field of the psychology of personality and intelligence. He was the inventor of the concept of the intelligence quotient, or IQ, later used by Lewis Terman and other researchers in the development of the first IQ tests, based on the work of Alfred Binet. He was the father of the German writer and philosopher Günther Anders. In 1897, Stern invented the tone variator, allowing him to research human perception of sound in an unprecedented way..."
Henry H. Goddard, from Wikipedia
"..(August 14, 1866 – June 18, 1957) was a prominent American psychologist and eugenicist in the early 20th century. He is known especially for his 1912 work The Kallikak Family: A Study in the Heredity of Feeble-Mindedness, which he himself came to regard as deeply flawed, and for being the first to translate the Binet intelligence test into English in 1908 and distributing an estimated 22,000 copies of the translated test across the United States; he also introduced the term "moron" into the field. He was the leading advocate for the use of intelligence testing in societal institutions including hospitals, schools, the legal system and the military. He played a major role in the emerging field of clinical psychology, in 1911 helped to write the first U.S. law requiring that blind, deaf and mentally retarded children be provided special education within public school systems, and in 1914 became the first American psychologist to testify in court that subnormal intelligence should limit the criminal responsibility of defendants..."
Lewis Madison Terman, from Wikipedia
"... (born 15 January 1877 in Johnson County, Indiana, died 21 December 1956 in Palo Alto, California) was a U.S psychologist, noted as a pioneer in cognitive psychology in the early 20th century at Stanford University. He is best known as the inventor of the Stanford-Binet IQ test. He was a prominent eugenicist and was a member of the Human Betterment Foundation."
Theories
item response theory (IRT), from wikipedia
"... is a body of theory describing the application of mathematical models to data from questionnaires and tests as a basis for measuring abilities, attitudes, or other variables. It is used for statistical analysis and development of assessments, often for high stakes tests such as the Graduate Record Examination. At its most basic level, it is based on the idea that the probability of getting an item correct is a function of a latent trait or ability. For example, a person with higher intelligence would be more likely to correctly respond to a given item on an intelligence test..."
Gardner's Multiple Intelligences and Sternberg's Triarchic Theory
howard gardner, multiple intelligences and education
"The theory of multiple intelligences: ...Howard Gardner viewed intelligence as 'the capacity to solve problems or to fashion products that are valued in one or more cultural setting' (Gardner & Hatch, 1989). He reviewed the literature using eight criteria or 'signs' of an intelligence:.."
Wikipedia
"..The theory suggests that, rather than relying on a uniform curriculum, schools should offer "individual-centered education", with curriculum tailored to the needs of each child.[1] (This includes working to help students develop the intelligences in which they are weaker.).."
Current Issues
No Child Left Behind
"..Under No Child Left Behind, states are working to close the achievement gap and make sure all students, including those who are disadvantaged, achieve academic proficiency. Annual state and school district report cards inform parents and communities about state and school progress. Schools that do not make progress must provide supplemental services, such as free tutoring or after-school assistance; take corrective actions; and, if still not making adequate yearly progress after five years, make dramatic changes to the way the school is run..."
The Nazi Model For Outcome-Based Education, from crossroad.to
"..In spite of the information explosion, America hasn't heard the message. Perhaps our leading change agents don't know what their deceptive strategies and dumbed-down curricula will do to our children. Maybe they haven't noticed the similarities between their educational strategies and former Nazi tactics for molding young minds and teaching group conformity. Perhaps today's psychological manipulations are simply a modern expression of human "wisdom" without biblical guidelines..."
A Rouge Forum Broadside
The Fascist Origins of the SAT Test
Rich Gibson, San Diego State University, April 2001
"..The SAT measures, above all else, class, sex, and race. (Fairtest, Roney). The SAT, like every similar test, is designed to divide people with razor sharp precision, to enumerate human value and to track people's futures under a veil of objective science. The SAT is a commodity itself, for sale to every student, school, and college in the world. It also commodifies people, attaching worth to individuals, but more pointedly drawing the lines of what can only be called class warfare. The impact of the SAT is to create the logic for a more deeply stratified society, divided primarily by issues of inherited income, sexism, and racism. The fraudulent claims of the SAT to promote a more equitable and meritorious society have been thoroughly revealed elsewhere. (Lemann, Fairtest, Roney). This analysis is a brief history of the SAT, unmasking the politics of the people who designed it, and those who promote the SAT and similar exams today ..."
The Bell Curve, Wikipedia
"..Its central point is that intelligence is a better predictor of many factors including financial income, job performance, unwed pregnancy, and crime than parents' socioeconomic status or education level. Also, the book argued that those with high intelligence, which it called the "cognitive elite", are becoming separated from the general population of those with average and below-average intelligence and that this was a dangerous social trend..."
The Bell Curve and the IQ controversy, from hartford-hwp.com
RACISM:
Failing the Test of Fairness: Institutional Racism and the SAT, from alternet.org
"..The fact is, even if such biased items are removed from the SAT, the unequal educational experience of the students taking the test -- especially in terms of class and race -- all but guarantees a persistent scoring gap between whites on the one hand, and blacks, American Indians or Latinos on the other.
CFP: Race and Racism in Writing Assessment (edited collection)
May 17, 2008 - 10:56 — Asao B. Inoue, from wpacouncil.org
Failing the Test of Fairness, Institutional Racism and the SAT
Posted: Monday, August 12, 2002
By Tim Wise
"..Black students are well aware of the negative stereotypes held about them by members of the larger society. As such, when blacks who are highly motivated and value educational achievement take a standardized test and expect the results to be used to indicate cognitive ability, the fear of living down to the stereotype negatively impacts their performance. These students may rush through the test—so as to seem more confident than they truly are—or alternately take too much time, trying desperately not to make mistakes. The self-doubt engendered by the racist beliefs of the larger culture is added to the general anxiety that all test-takers feel, to produce, for black students, a unique disadvantage. ..."
History Lesson
Flawed From The Start
The History of the SAT
by Matt Pacenza from journalism.nyu.edu
"..Brigham had begun administering the SAT to test groups in 1926, and had concluded by 1933, when he met Chauncey, that his SAT reliably predicted academic success. Conant and Chauncey seized upon this conclusion, began administering the test to students who wished to win scholarships at Harvard, and convinced many of their Ivy League colleagues to follow suit.."
Examples of Jim Crow Laws
"...Education [The County Board of Education] shall provide schools of two kinds; those for white children and those for colored children. Texas "
DISCUSSION BOARD:
-Topic: Doubters and Believers
Date: Tue Jul 15 2008 23:27
Author: M, Sal
Subject: certificate to show! Re: #2 oppressive and humiliating
"yes, students would like to see scores or some way to know they are doing well. I had one student (from Bulgaria) that asked me and my "supervisor" if we could give him a certificate. He wanted to go back to his home country to show his people (future place of employment, family, etc..) that he took an ESL course while working (was a intern as a vet in a local nearby pig farm) in America! It was the first time my "supervisor" and I ever been asked this, which we will try to do more often now!"
Date: Wed Jul 16 2008 15:10
Author: W, Todd
Subject: Re: certificate to show! Re: #2 oppressive and humiliating
"
Several programs around the country, including Minneapolis, issue certificates to students based on reaching certain score/proficiency levels on the CASAS tests. My understanding is that this process has been well-received in Minneapolis by both students and teachers. Implementing this process also promoted a great deal of consideration of the CASAS compentencies and whether they constitute a reasonable/quality/useful/valuable content domain of instruction for an ESL program. My understanding is that that process was also well received and that many teachers who were skeptical of relying heavily on the CASAS competencies have found the compentencies, on closer examination, to be reasonable/useful.-Todd
Date: Fri Jul 11 2008 21:06
Author: C, Matthew
Subject: Re: #2 oppressive and humiliating
"
When I worked at a intensive English language school, I often had students who craved more drill than we offered. Some would purchase Raymond Murphy's Grammar in Use texts on their own. The underground movement got so popular that we built an elective around the text. At the time, my Academic Director argued that at least that way we could insure that they had some communication based activities along with the grammar.
-Topic: History of Educational Testing
Forum: History of Educational Testing
Date: Sun Jul 06 2008 13:57
Author: H, Gregory
Subject: History of Testing
"Hello Classmates,
I just read the Power Point about the History of testing and it scares me. It seems that our whole approach to achievement testing in the U.S.A. has imitated experiments and practices of people who we deemed were "credible experts". What´s up with the craniometry test to determine intelligence? It seems a little troubling to me that the "IQ" test came from a method Alfred Binet developed to find out the causes why French school children where not meeting curriculum standards. Apparently no one knows how that test can be equated to a true representation of intelligence.
After reading over the 8th grade proficiency exam from a Kansas school, I wonder if this should be more like aCommunityy College proficiency exam nowdays. The questions are almost all based on theory or rote knowledge.
This article gets me a bit riled up, which I believe is the point.
Are achievement tests some means of control? Are standardized tests some means of social domination?
I think we should try to find out a way to test achievement based upon real life situations and performance of tasks. It seems like almost all tests are based on theory or are out of context for the people taking the tests.
Date: Thu Jul 10 2008 14:36
Author: R, Joyce
Subject: Re: Zappardino and the History of Educational Testing
"
I'd like to be candid and say that I never thought studying the history of assessment would be so intersting. (Sorry, Todd) The good news is that this is intriguing.
Historically, I was unaware of the newnesss of the study of the mind psychology" and so aligining Binet's efforts and the entree into scientic methods of measuring elements of the mind helped me to understand how the scientific and lay communities could be awed and motivated to accept answers about the unknown elements of the mind. Aren't we all tempted to take alittle test to find out more about ourselves?
The idea that testing is a socially imbedded activity is important for us to remember as instructors who interface cross-culturally. The cloak of science that covered those early researchers political agendas may well seem obvious to some cultures represented by our students i.e. they may discern that we have motives beyond their benefit in our obsession with formal testing.
Zappardino seems to refer to the "faith" we have/ had in the numbers yielded from testing. I couldn't help but think how often we say "numbers don't lie". We sort of exalt numbers/statistics and the interpretations. Maybe I'm carrying this too far, but there is a scripture that says, "God is not a man that he should lie." So for me, this "numbers don't lie" thing puts numeric results on a pretty high level.
I'm in the dark about how one changes the culture of standardized testing, but I don't want to be a gatekeeper that holds folks at bay from educational or other opportunity, helping them "settle" and come to terms with a lesser view of themselves and their potential. I want to remember the idea of testing being culturally bound, even my own created tests.
Culturally speaking, I have one small anecdote: When we moved to MN from CA our youngest daughter was given a kindergarten readiness test. She was asked to name five jor six things you can do with water. She replied, "Drink it, swim in it, wash dishes." The tester smiled condescendingly, "How about skate on it? When it snows, make snowmen?" later she told me that I might need to read more to my child to help her think in broader contexts. WHAT? Hey, we lived on the beach! We drank it, swam in it and washed up! - Joyce
-Topic: Other Module 2 Issues
Forum: Other Module 2 Issues
Date: Thu Jul 10 2008 00:11
Author: M, Sal
Subject: Good tip for assessment from O'Malley
"
I'm still finishing the reading for Module 2, but I wanted to comment on one particular advice O'Malley gave. It was on "Designing Authentic Assessments" (Ch. 2 pp. 17-18). I live in a small college town (5,000 pop) here in Morris, MN and thought the tip on "conducting professional development..." would be very useful here. I plan to contact some professors at the local campus and ask for their advice on planning authentic assessment. Very informative so far in this reading assignment!
Date: Thu Jul 10 2008 22:40
Author: M, Sal
Subject: The importance of meeting the students' needs
'
Yes, reading Chapter 1 "Assessment as Information Gathering" by Kathleen M. Bailey was very helpful in the part of meeting the students' needs (pp.5-6). We are sometimes given guidelines on what to teach. However, we also need to consider the students' needs, which we'll know once we know the information about them as quoted by Diane's (teacher) dialogue with Kathi (author giving advice). I see this as very true when I teach ESL to local migrant "dairy farmers" compared to other students with a different profession/job. What I've learned from teaching ESL as a volunteer in the last 4 years, the students seem more interested on learning terminology relating to their workplace. They "want" to learn English to improve their work situation. As supposed to a learner that does not work in the same occupation/field or even as a homemaker. "
Date: Fri Jul 11 2008 10:15
Author: M, Sal
Subject: "Massive Systemic ignorance and blind reliance on standardized est scores..."...
"
Wow, the quote above I partly took from Bailey's response on Ellie's (teacher w/three titles in a California school) response to the "school system" she was part of just summarizes one of many stories out there! Reading this (Chapter #3-Conflicting Purposes of Assessment) really got my attention and brought some personal experiences (as a student in grade school and teacher in ESL as an adult). In "grade school", I was close to being sent back because of my "learning disabilities". My parents immigrated here from the Philippines and never taught me their native language (Tagalog). They were afraid of me and my sibling being "left out" in our English-speaking dominant "new home country" (USA). However, my parents would speak Tagalog to each other in the home and their "English" (spoken differently in the Philippies, like Britain somewhat) was a little different. Thus, going to school, I would have trouble-another factor was their "passive" cultural life-style. English grammar was never every my strong quality (math was) growing-up! Anyways, going back to being a "current ESL teacher" (volunteer), I can relate to some of my students (both adult and non-adult). When I tell people in the small rural-college town community (Morris, Minnesota) that some of my students have "college degrees", they get suprised! One of my students (grown-up adult) was a "migrant worker" from Mexico, who got a college degree as a veterinarian . He came to our class because he just wanted to learn English! Just reading the story of Ellie makes me wonder how many school systems has "mis judged" or labeled students with a "learning disability" due to their language being "Non-English"! Wow, great story!
Forum: Other Module 2 Issues
Date: Fri Jul 11 2008 17:43
Author: M, Sal
Subject: Contrasting Pairs of Concepts-examples?
"
Hi all,
I really haven't had that much experience with creating my own testing and scoring of ELL or ESL Students. I was wondering if anybody has and willing to share their experiences relating to the "three contrasting pairs of concepts" Bailey shares in Chapter 6?
Forum: Other Module 2 Issues
Date: Fri Jul 11 2008 18:48
Author: M, Jeannine
Subject: Re: Contrasting Pairs of Concepts-examples?
" One of our English department faculty, Susan Bosher, has done a lot of work with multiple choice testing with ELL health careers students. She uses language simplification techniques. I have one of her journal articles on my other computer. I'll find it and send it as an attachment. In the past I've taken my exams to Susan and had her review the questions. It is extremely helpful. I've also found that the modifications I make based on her suggestions make the test much more understandable for all students- not just English learners. Writing exams, as you probably know, is really hard. JMH "
Author: F, Patricia
Subject: Re: Statement #4 How well do I know my students?
"I also agree with the statement that we can't be everywhere at all times. Even a poorly designed test can give us some information. Say we hand the student a test during registration on the first day that asks them to write their grocery list from last week rather than their name, address, etc. That is neither reliable nor valid. If the student is unable to write one item down on the paper, however, we can take and educated guess that they may not be able to read the instructions or may not be able to write the answers.
Greg, here is a prime example of your statement on being mislead by a student's proficiency in one area. I had a 50 something man from Mexico that came to class one evening. He talked with all the instructors, trading stories and information for more than 15 minutes at a level I would have said was close to a native speaker of English. When we handed him the registration forms he informed us he could not read or write in English or Spanish. That was during my first semester of teaching and it was a lesson I will never forget.
Author: L, Anne
Subject: 1-4 thoughts
"
...2- I tend to go back and forth on this one. If the assessments are relevant and address the needs of the students, then I think assessments are not oppressive and humiliating. However, “throwing” someone into an assessment who has never tested before is oppressive and humiliating.
Week #3
I. The Four Skills (Listening, Speaking, Reading, & Writing) and Hierarchical Components of Language
A. Syntax and Lexicon
1. syntax (wikipedia)-"study of the principles and rules for constructing sentences in natural languages"
2. Lexicon (wikipedia)-"its vocabulary, including its words and expressions. More formally, it is a language's inventory of lexemes."
B. Morphology (wikipedia)-"field of linguistics that studies the internal structure of words. (Words as units in the lexicon are the subject matter of lexicology.) While words are generally accepted as being (with clitics) the smallest units of syntax, it is clear that in most (if not all) languages, words can be related to other words by rules. For example, English speakers recognize that the words dog, dogs, and dog-catcher are closely related. English speakers recognize these relations from their tacit knowledge of the rules of word-formation in English. They intuit that dog is to dogs as cat is to cats; similarly, dog is to dog-catcher as dish is to dishwasher. The rules understood by the speaker reflect specific patterns (or regularities) in the way words are formed from smaller units and how those smaller units interact in speech. In this way, morphology is the branch of linguistics that studies patterns of word-formation within and across languages, and attempts to formulate rules that model the knowledge of the speakers of those languages..."
C. Phonetics and Phonology
1. Phonetics (wikipedia)-"the study of the physical sounds of human speech. It is concerned with the physical properties of speech sounds (phones), and the processes of their physiological production, auditory reception, and neurophysiological perception."
2. Phonology- (wikipedia)-"is a subfield of linguistics which studies the sound system of a specific language or set of languages. Whereas phonetics is about the physical production and perception of the sounds of speech, phonology describes the way sounds function within a given language or across languages."
Related Sites:
Linguistics-Syntax, from harmony.org.uk
Other Terminology
Formative v.s. Summative Evaluation, jan.ucc.nau.edu
Formative:
"Formative Evaluation is a bit more complex than summative evaluation. It is done with a small group of people to "test run" various aspects of instructional materials. For example, you might ask a friend to look over your web pages to see if they are graphically pleasing, if there are errors you've missed, if it has navigational problems. It's like having someone look over your shoulder during the development phase to help you catch things that you miss, but a fresh set of eye might not. At times, you might need to have this help from a target audience. For example, if you're designing learning materials for third graders, you should have a third grader as part of your Formative Evaluation."
Summative:
"Summative evaluation provides information on the product's efficacy ( it's ability to do what it was designed to do). For example, did the learners learn what they were supposed to learn after using the instructional module. In a sense, it lets the learner know "how they did," but more importantly, by looking at how the learner's did, it helps you know whether the product teaches what it is supposed to teach."
Summative evaluation is typically quantitative, using numeric scores or letter grades to assess learner achievement.
3.1
2. Understanding Adult ESL Content Standards by Sara Young , pdf format
(standard-based) project indicators/benchmarks
performance assessment task
Adult Ed Content Standards
Standards-Literacy Tent Wiki
-The CASAS system – assessing English language learners, from casas.org
TESOL
-Equipped For the Future (EFF)
\
"..Other research studies found that teachers’ perceptions
of content standards and the professional development
provided for them play a strong role in the successful
implementation of standards. Specific professional development
about standards-based curriculum and assessment
contributed to better student math achievement in
California, according to a self-report of 1,000 K-12 teachers
(Cohen & Hill, 2000). Case studies and descriptions of professional
development to support standards-based education
in K-12 document a wide range of activities used by
school districts to train teachers, including workshops and
conferences, institutes, observations of expert teachers,
mentoring, study groups, grade-level meetings with other
teachers, site-based inquiry seminars, training and support
to use new curricula and standards, and certification training
(Bye, 2004; Dutro, Fisk,..-pp 5 of 6
World-class Instructional Design
and Assessment (WIDA)-developed English language proficiency
assessments
"...Research on K-12 standardsbased
education shows that student progress depends to
a great extent on teacher professional development and
commitment to the standards..."-pp 5 of 6
Resources:
The Splendid ESOL Web, Pima College Adult Education
National Institute for Literacy’s Adult Education Content
Standards Discussion List
Mid-continent Research for Education and Learning
Standards, reporting, outcomes, accountability and
teacher credentials - What’s online
Assignment:
Compare-
-CASAS
-EFF (standards)
"By identifying four categories of generative skills, EFF broadened the range of skills adult literacy and basic skills basic programs are typically expected to cover. These skills include strong reading, writing, and math skills; they include the skills we need to communicate and work well with others; to solve problems and to keep up with change. These categories of skills (see side bar) include those we traditionally think of as interpersonal skills, and those decision-making and learning skills we traditionally talk about as "higher order" or critical thinking. Our goal in proposing this range of standards and in framing them as we did was to shift the focus of adult literacy and basic skills instruction and assessment away from a decontextualized skills-based curriculum toward a contextualized, practice-based curriculum that was better matched to and firmly grounded in learners' own purposes for returning to schools."
Four Categories of EFF Skills: Communication Skills, Decision-Making Skills, Interpersonal Skills
, and Lifelong Learning Skills
-Massachussetts, Curriculum Framework for ESOL
...Teachers and counselors need to meet with learners early on in the program in order to identify students' most urgent needs for using English...smaller immigrant groups may have a pressing need to learn how to communicate with doctors or shop owners in English;-pp 9
...For the language classroom this implies that students will benefit from practice in listening, speaking, reading, and writing that takes into account the kinds of communication that happen in different contexts outside the classroom.."-pp 10
"..Not all students will master a skill before the class moves on, but they may master it later, as the teacher focuses on a different context or topic. This is an especially important concept as teachers try to address individual learner needs in programs that are multilevel and/or open entry.
Teachers should be aware that students try to make sense of a new language and construct rules of how English works, often based on how their home language works; this concept is referred to as “interlanguage”...-pp11
"As much as possible, classroom materials should reflect the diversity of our students and those of the range of groups who live in the US without oversimplifying and stereotyping.
As students learn about American culture, their own culture needs to be validated. They should understand that they don't have to lose the personal identity and values they came here with and change themselves into a new “American” person..-PP 13
"..Teachers should also encourage learners to take advantage of learning opportunities outside the classroom. Where possible, instructors should plan class activities or assign homework that give students practice in learning on their own. These could include “eavesdropping” on the conversations of native speakers, by talking to people at work or their children's school, by watching TV or listening to the radio, or by reading magazines and newspapers..."-pp 14
# 6 standards look like in more detail at the six proficiency levels. (pp 17-
1 & 2. "The Listening Strand and the Speaking Strand:
These two strands include the skills that focus on developing fluency and making and gaining meaning in oral communication...
3. "The Reading Strand:
This strand includes the skills necessary to interpret printed material, including charts, graphs, schedules, and environmental print. Those skills include symbol mastery, phonological awareness, decoding, word recognition, word analysis, and comprehension.
4. "The Writing Strand:
This strand includes the skills necessary for both physical and intellectual mastery of written communication. Arenas of competence include language structure and mechanics, organization, and fluency. Development of every writer’s unique and personal “voice” is also valued."
5. "The Intercultural Knowledge and Skills Strand:
..The focus in this strand is on developing an awareness of cultural differences and attitudes and in developing the skills and knowledge needed to function in a culturally diverse society..."
6. "The Navigating Systems Strand:
..This includes knowing what opportunities exist and acting in accordance with both their rights and responsibilities within a particular system. It includes knowing about both mainstream systems (e.g. housing) and resources that are available to them in advocating within those systems (e.g. tenants' rights groups). Some systems impose problems or barriers on students, some systems (e.g. libraries) are opportunities, and many systems are a combination of barrier and opportunity.
*needs base
Other Related Sites:
ACCUPLACER Test Practice Questions - Help your ACCUPLACER Exam ..., from testprepreview.com
*referred by Joyce R. (7/14/08)
"..The system uses the Accuplacer placement tests and set cut scores for placement in transfer-credit-bearing reading, writing and math classes. As you might guess, the placement criterion for reading is an Accuplacer reading sub-test score and for math classes is a Accuplacer math sub-test score. For writing, however, the review committee found the Accuplacer reading subtest score to be a better predictor of success in writing classes than the Accuplacer language subtest score! The language subtest is a multiple-guess type format that focuses on "knowledge" of writing components. The Accuplacer test suite also includes a "written essay" sub-test that, somewhat amazingly is scored by the computer. My understanding is that while some faculty found the written essay to be a better predictor of success in writing classes, there was resistance to mandated use of it because it takes longer to administer and because some faculty are still skeptical of the idea that a "computer" could score a writing sample validly and reliably. (Note that the reliability and validity of the computer scoring is well-established. I haven't had a chance to learn about how the scoring algorythm works, e.g. sentence length, word use, etc., but am defintely curious to find out.)
Anyway, point is that a reading test being a better predictor of writing class performance than a "writing" test suggests that the tests are measuring "writing" pretty indirectly...."-Todd W. (7/17/08 on Blackboard)
Best Literacy, from cal.org
*referred by Patricia "Patti" F. & Christina (7/14/08)
DISCUSSION BOARD:
-Topic: 3.1 Adult ESL Instruction Content Domain
Date: Mon Jul 14 2008 17:11
Author: M, Sal
Subject: Re: CASAS
"
Yes, we use the CASAS here (Morris Literacy Project-Morris, Minnesota) through the state. I personally feel it's been an "ok" measuring tool. We give it (entry) in the beginning and ending (exit) to see their progression. However, their conversation skills better tells us how they progress.
Just a little note: From my years giving it to our students, the "migrant workers" from Mexico always get the month/date/year (e.g. 7/13/08) question wrong. In Mexico (day/month/year-13/7/08), the order is different. This shows one example how CASAS can be "non-culturally friendly".
Date: Mon Jul 14 2008 23:35
Author: M, Sal
Subject: Re: CASAS-Massachussets (don't like the name-lol)would be my choice because...
"
yes...in addition I feel the "Massachusetts, Curriculum Framework" would be the most useful of the 3 (CASAS, EFF, ..). Besides the title "Curriculum" ("curriculum based tests") and it being named after a state (sport dominate and take MN players-"KG", "Big Pappy", "Moss", etc..) I'm not fond of-j/k!-this would be my choice! It seems to have more of a cultural focus (see #5 Strand-The Intercultural Knowledge and Skills Strand), which would help many of my students (mostly migrant workers) in Morris, MN. Also, it serves ("needs base analysis of students") the "needs of the students" (see #6 The Navigating Systems Strand), which many come with a motivation to learn English for a particular reason (e.g. medical issues as was well put with stories-quotes in this particular "set"). This makes the CASAS look bad, which the Minnesota Literacy Council should look into implementing!
Author: Wagner, Todd
Subject: Re: functioning within culture
"There are three "models" for ABE/ESL programming in the use: K-12 system based, community college based and workforce system based. Each has strengths and weaknesses. For example, three big strengths in Minnesota's model are: substantial state funding ($6 state funds for everyone $1 federal funds, which is among the highest ratios in the country) which results in part from every school district in the state receiving funds (easier to get legislative agreement as funds go to all districts); integration of the volunteer/community-based organization programming into the system, i.e. volunteered based programming generates funding (in most states the literacy-council based system typically operates independently of the state and federally funded system); and third the many "points of entry" to the system that the first two strengths yield.
One weakness of the Minnesota system, which we are addressing with a three-year transition initiative (which is yielding a great deal of success) is that we have not been as well coordinated with the higher education system as would be ideal. This has been particularly problematic in ESL programming as some colleges have established several "levels" of ESL programming. In those programs, students may use up much of their financial aid progressing through the ESL levels and then the developmental education classes. Students are often attracted to those programs as they like the idea that they are going to college. We are, in part, addresses this issue through our transition initiative by promoted location of ABE programming on as many campuses as possible.
Anyway, after that long digression, my sense is that in Minnesota the vast majority of ESL students taking paying for ESL classes at colleges are there to "go to college/tech college" and not just for ESL classes. Note that, in states where the community college system hosts ABE/ESL programming students may be attending for ESL, GED prep, adult diploma only and not planning to continue with college course work.
-Todd
Subject: Re: functioning within culture
"
I hear that so much from my adults that they are losing thier culture and that the kids don't care, all they want to be is American. I just keep telling them that they can still keep and practice their family traditions at home and that maybe someday their kids will realize the importance of there heritage, so try to be patient:0) My ansestors came from Germany about 5 generations ago, but we still celebrate Dec. 6th and put our shoes out for St. Nick. This is something that I continue with my children. I think that explaining tradtions in culture is important to know the why behind it when you teach or talk about it. -Anne
"Subject: Re: Hard Choices
"
Overwhelmed and mind boogling....I asked our ESL coordinator how students are assessed for admission. She listed a battery of tests: Michigan English language battery (MELAB), MELICET, listening testing, and essay testing. Our other campus uses COMPASS, CELSA and CELT. I'll have to look up each of these because I've never heard of them before! She is apparently the only reader of the essay which she grades holistically. Students take 2 English classes for non-Native speakers. Interestingly, the students who struggle most are often those who were never assessed because they were in the US school system for 8+ years. These students have 'fossilized speaking/ writing errors' that become apparent in college level work. I love that term- fossilized."-Jeanine M.
Subject: Re: Hard Choices
"
My take is that, really, no one has this figured out. As a field we swing between doing no testing and throwing every test we can think of at students as Jeannine describes. I think this occurs, in part, because tests and testing are a bit of a "black box" to most educators. We don't really understand them and so sometimes fear their use has a negative impact on programming and at other times believe they have an almost magical ability to place students, determine success, etc.
One example of this has occured in relation to our post-secondary transitions (ABE/ESL to college) initiative here in Minnesota. Staff from programs that have complained for years that the testing required, e.g. CASAS for NRS purposes tells us nothing about how students or their programs are doing are now busy teaching to the Accuplacer. And I now find myself pointing out that doing well on the Accuplacer does not necessarily mean that a student will do well in credit-level college classes. The Accuplacer is a predictor tests like the SAT or ACT and, in all three cases, they are only crude predictors. What the programs should ideally be focusing on is articulating instruction with the college courses that students will be taking to help ensure they are prepared to function successfully in class.
I think we all need to be as clear and intentional as possible when developing, selecting, using and interpreting results from tests. The steps required for the class project attempt to provide a model for doing so.-
Todd
Subject: Re: Hard Choices
"
Joyce,
Each ABE consortium in Minnesota received funding for use in implementing the transition initiative this year. There are 53 consortia in Minnesota and some are very large, e.g. the Arrowhead Economic Opportunity Agency consortium covers the area east and north of a line drawn from Duluth to Grand Rapids to International Falls. Most consortia include multiple, in some cases dozens, of program sites/centers and not all sites will be implementing the transition initiative or providing tranistion-related services.
Regarding articulation, however, we have many programs around the state that have already developed or are working on developing various kinds of "pre" programming, e.g. pre-Certified Nurse Assistant, pre-Welding, etc. We also have several programs that have implemented a college prep program called Mindquest Academy with two of those programs co-located at MCTC in Minneapolis and South Central in Mankato. One of the goals of our professional development program, which is called ATLAS and is hosted at Hamline University, for next year is to work on providing professional development opportunities related to articulation. My guess is that most of the public two-year colleges in Minnesota are either collaborating with an ABE program already or have at least been approached by one or more ABE consortia in the past year regarding collaborating.
Mindquest is at http://www.mindquestacademy.org/
ATLAS is at http://www.hamline.edu/gse/academics/centers/sltl/atlas/index.html
ATLAS' transition info is at http://www.hamline.edu/gse/academics/centers/sltl/atlas/transitions.html
.
For some examples of "pre" programs http://mnabe.themlc.org/Promising_Practices.html.
-
Todd
Subject: Re: functioning within culture
"
This conversation on acculturation is very interesting. I work on an NIH grant working with Somali women and girls. The concerns about US culture, particularly how it affects their sons is a concern repeated often in our focus groups. Some women are willing to send their sons back to Somalia rather than to lose them to US culture. The other issue, of course, is language. Many children out pace their parents in English language skills, resulting in a power shift in the family structure and function. Have you ever read "The Middle of Everywhere" by Mary Pipher? She give a wonderful language-related example of a young man who tells his parents (who don't speak English well) that a black leather coat is part of his school uniform. The parents use all their money to buy that for him.
By the way, the belief that past generations of immigrants 'assimulated' into US society easier and more willingly than current immigrants is largely a myth, particularly in Minnesota. Many immigrant groups lived in urban enclaves or farming communities and self-segregated themselves by language and/ or religion. Look at the Catholic churches in St. Paul- St. Francis de Sales was German, St. James: Irish, St. Stan's: Czech and each resides within one mile of the other near W7th in St. Paul..."-Jeannine M.
3.4
the Purposes Brainstorm page, pbwiki
Three Challenges in English Language Assessment, html format & pdf format by Carol A. Chapelle & Joan Jamieson
"..Terms such as “placement,”
“proficiency,” and
“achievement” might be used
in response to a question about
what a test is supposed to be
used for. These terms may work
for the public, but teachers
need a more systematic,
thorough, and accurate way of
considering the purpose of an
English Language assessment
because the purpose of the
assessment is critical for
choosing or developing a good
test. We think of test purpose
as consisting of three
interrelated concepts:
• One way to look at test
purpose is to consider the
inference that test users
want to make on the basis
of test scores. For example,
we designed Longman
English Assessment to allow
test users to make an
inference about examinees’
English Language reading
comprehension, listening
comprehension, and
knowledge of written
structures. Making an
inference about these three
aspects of English ability
was one way of looking at
the test purpose.
• A second way is to
consider the use of the test
results. Typically, a test is
given because a decision
has to be made about
examinees, for example,
their readiness to study at a
university, their eligibility
for a job, or the grade that
hey should receive in a
course. Longman English
Assessment is not intended
for any of these relatively
high-stakes test uses;
instead, it is intended to
give learners an idea of
their level of English
ability and to offer level-
appropriate advice for
improving their English.
• The third aspect of test
purpose is any other effect
that the test is intended to
have, such as serving as a
motivating, interesting, and
informative experience for
learners."
3.5
Focusing the Question PowerPoint, pdf format
Final Project
Design of an Assessment Instrument: "Real-Life Skills"
July 17th of 2008
CONTEXT
This assessment is designed for classroom settings to help student learn "daily-living skills (e.g. intro conversations, grocery shopping, and other day-to-day social activities in the community). The "Real-Life English workbooks" (see Steck-Vaughn Company or amazon.com) has different levels, which uses various community "real-life" settings. Students want to be better "active-participants" in their workplace, home, and community while they are residents of this small rural college community (10,000+ population in the Stevens County area we serve). Thus, teacher too want to be "better" in working with each individual students to achieve their goals to succeed-not only in America, but also when they go back to their "home country". This assessment will encourage all three stakeholders (students, teachers, and program) through these results in various ways of performance.
SETTING
-Students Profile:
Many of our students (Mexico, Brazil, and Eastern Europen bloc nations) in the Morris Literacy Project are "migrant famers" that work in the surrounding area farms doing various work (e.g. dairy, hog, and typical field work). All of our students have been adults (late teens to the elderly); however, we currently have a student is only a teen. They like to come in groups (Spanish, Portuguese, Russian, etc..) , so the language barrier hasn't been a challenge. However, we they don't come in groups, they somehow work cohesively together.
-Audience:
The results would be very helpful for teachers (current and future volunteers) on how to improve our approach in teaching this particular topic ("real-life" skills). This may also help our continued new volunteers (mostly new in-coming college students) more interested in our on-going program (Morris Literacy Project). Knowing "what works" and "what doesn't work" (our approach in teaching) may help retain future volunteer teachers and more importantly-students that come to check out how we do class.
PURPOSE:
The purpose of this assessment is to see how students are improving in the "four-different
area skills" (Listening, Speaking, Reading, & Writing). Thus, to help them feel more confident in applying them in their "real-life" settings (e.g. workplace, home, community, etc..). We hope this would be given as both a summative and fromative placement test to inform both student and teacher.
QUESTIONS:
Several questions will come up from this assessment-1: Are the student's skills helping them in their various workplace, home, and/or commmunity settings? Are we consistently retaining our number of volunteer teachers and students in a certain period of time? Do they demonstrate a degree of progress to move to the next level of books of "Real-Life"?
Week #4
4.1
Terms
Itemized Response Theory:
Wikipedia
Chapter #1-Item Calibration & Ability Assessment, from creative-wisdom (multi-media)
-Guttman Scale, Wikipedia
"..if they can be ranked in some order so that, for a rational respondent, the response pattern can be captured by a single index on that ordered scale. In other words, on a Guttman scale, items are arranged in an order so that an individual who agrees with a particular item also agrees with items of lower rank-order. For example, a series of items could be (1) "I am willing to be near ice cream"; (2) "I am willing to smell ice cream"; (3) "I am willing to eat ice cream"; and (4) "I love to eat ice cream". Agreement with any one item implies agreement with the lower-order items..."
=>Multiple Choice Tests: Test scoring and analysis
Christina Ballantyne (tlc.murdoch.edu.au)
tentative student proficiency-portion of correct answers for each student
tentative item difficulty-the pass rate for each item
convergeance-"there will be some differences int he model.."
Chapter #2-Item Characteristic Curve-"the relationships between the probabilities and the skill levels"
Related Sites:
Assessment Issues:
"..The ICC is a plot of Probability that the Item will be answered correctly against Ability. The shape of the ICC reflects the influence of the three factors:
* Increasing the difficulty of an item causes the curve to shift right - as candidates need to be more able to have the same chance of passing.
* Increasing the discrimination of an item causes the gradient of the curve to increase. Candidates below a given ability are less likely to answer correctly, whilst candidates above a given ability are more likely to answer correctly.
* Increasing the chance raises the baseline of the curve.
This simple simulation allows the user to investigate the factors governing the shape of the Item Characteristic Curve. All three well known IRT models are represented (referred to as IRT1, IRT2 and IRT3) and Item Characteristic Curves can be super-imposed on one another to see how they relate.''
IRT Model (ony 1 parameter=item selection: A&B-2 Parameter; A, B, & G-3 Parameter=diagnosis parameter=2/3 parameters)
"B parameter"/threshold parameter-item difficulty
"A parameter"-item discrimination
"G parameter"-item guessing
Lord's Paradox-a phenomenon of the probability of giving to the correct answer to the blue is .55 and red item is more difficult
intercept
Chapter #3-Item Information Function
item information function-"mathematical way to know how much info ICC provides, the more precision you have in estimating a parameter->the more you know of the value of that parameter
Item information function of the 2PL model, from metheval.uni-jena.de
Chapter #4-Test Information Function
test information function-identifying information for entire exam
alternate forms-having different items with different levels
form balancing-swapping items (e.g. easy, difficult, and hard) between the item pool and the forms is needed
Chapter #5-Item Person Map
odd ratio-ratio of non-desired events (Q)/divided by desired events (P) or Q/P
logit-log (odd ratio)
easy if below zero (e.g. -.7)
item person map-item (test question itself-easy, average, difficult) and person (item proficiency value) can be viewed simultaneously
Q: odd of 1/6 passing?
A: .20
Chapter #6-Misfit
misfit-about the response pattern (e.g. European history question accidentally added in a "American History Exam"); when there is a huge discrepancy (model and data don't fit with each other)
residuals-discrepancies
Chi-square fit-a statistical approach for the most common test of goodnesss (correct & incorrect; categorized by the skill level
O-E 2 (squared)/ E [O-observed, E-expected]
#2 Approaches to detect misfit:
1-graphical method-model and data are superimposed
2-Chi-square test of goodness of fit-chi square divided by the degrees of freedom
Q: In regression we obtain a systematic pattern of residuals
A: F
Residual Plots and Regression Assumptions, from stat.tamu.edu
Degree of Freedom
""Degree of freedom" (df) is an "intimate stranger" to statistics students. Every quantitative-based research paper requires reporting of degrees of freedom associated with the test results such as "F(df1, df2)," yet very few people understand why it is essential to do so. Although the concept "degree of freedom" is taught in introductory statistics classes, many students learn the literal definition of this term rather than its deeper meaning. Failure to understand "degrees of freedom" has a side effect: Students and inexperienced researchers mis-interpret a "perfect-fitted" model or an "over-fitted" model as a good model. To rectify this situation, two approaches of illustrating df in terms of sample size and dimensionality are recommended. To enhance the pedagogy of this concept, a multimedia program written in Macromedia Flash (Macromedia, 2001) was developed. In the program, different aspects of the above concepts are visualized and interactive features (questions and answers) are added to encourage the learners to think about the deeper meanings of df."
df=n (# of obeservations)-r (# of necessary relationships)
Computer Adaptive Testing:
Related Sites:
What is Item Response Theory, IRT? A Tentative Taxonomy, from rasch.org
Wikipedia
"..is a method for administering tests that adapts to the examinee's ability level. For this reason, it has also been called tailored testing."
Computer Adaptive Testing Tutorial, from edres.org
"Here you will have the opportunity to learn the logic of CAT and see the calculations that go on behind the scenes. You can play with an actual CAT. We provide the items and the correct answers. You can try different scenarios and see what happens. You can pretend you are a high ability, average or low ability examinee. You can intentionally miss easy items. You can get items right that should be very hard for you..."
Samples:
Computer Adaptive Test Prep," ..your resource for the computer adaptive exam software otherwise only available through full-service GRE® Test, GMAT® Exam, and Patent Bar Test Prep service providers."
Computer-Adaptive Test (CAT) for GRE, from syvum.com
"decides according to your response what question to present you next. It covers the same content and uses the same types of questions as the paper-based test..."
4.2
Adult Education and Literacy
"..promotes programs that help American adults get the basic skills they need to be productive workers, family members, and citizens."
Pro Literacy
"..is a nonprofit international literacy organization based in Syracuse, NY, that was formed by the 2002 merger of Laubach Literacy International and Literacy Volunteers of America, Inc. ProLiteracy Worldwide is the oldest and largest nongovernmental literacy organization in the world. It achieves its mission, championing the life-changing benefits of literacy for adults and their families, by sponsoring educational programs that help adults and their families acquire the literacy practices and skills they need to function more effectively in their daily lives."
History of the NRS
"..began in the 1990s, during the trend towards greater accountability for educational and employment programs. In 1993, through the Government Performance and Review Act (GPRA), all Federal agencies were required to develop strategic plans to demonstrate that they are reaching their goals..."
Getting Started
"..Below you will find the answers to many questions you may have about this website.."
-"Q. What online training does NRS offer?
A. We have designed online training courses for adult education program administrators that explain NRS requirements and provide guidance on how to improve the quality of NRS data collection. Courses are available on NRS-related topics including:"
-"Q. What is an NRS Webinar?
A. NRS Webinars are an opportunity for state directors of adult education and their staff to attend regular online training courses on topics pertinent to NRS including the revised NRS Implementation Guidelines and planning for a successful data system. These training courses are announced on the home page of the Web site and listed under the Calendar. Additionally, state directors are invited via e-mail several weeks prior to each event. All NRS Webinar materials are available on NRSWeb. "
History
"A collaborative effort was undertaken to develop the NRS during 1997-1998. After the passage of Workforce Investment Act (WIA), NRS became mandatory. Implementation of the NRS began in 1999 and continued to be refined over the next couple of years. In 2000, states began submitting their student and program outcomes to DAEL. Current NRS activities focus on perfecting the system to demonstrate program effectiveness and improve student outcomes."
Q&A
-" Q. Where can I learn more about how the implementation of NRS works, such as how to improve the quality of my program data?
A. The NRSWeb site offers a variety of materials and resources to help program administrators and state staff learn about the NRS requirements and how to use data, improve data quality, monitor programs, and build a better data system. You can find these resources in two ways: 1) by topic through the main menu bar to find guides, training materials, webinars, and online training courses for specific subject matter, or 2) by visiting the Training & Activities page for specific types of resources. "
Office of Adult and Vocational Education (OVAE)
NRSweb
ONLINE DISCUSSION:
"My (Morris, MN) Experience with NRS Testing"
Experiecnce: "To be honest, I never really looked very closely on the results NRS gives to us. As a volunteer teacher, I never had an interest as long as we have been getting good numbers. I let my "boss/supervisor" deal with the results and he gives me his input. After taking this class, I'll probably start looking at the scores more closely now.
Program: "We use the CASAS, which we have trouble-challenges with the "pre-testing within the first 12 hours". Awhile back, when we met students for the first time, we would give them the CASAS test and they seem to be overwhelmed. Now, we decided to just wait after the first class visit or another better time (within the "first" 12 hours). That first day of class makes a huge impression on what this class will be like. Thus, this will impact their decision to return or not! Unfortunately, we sometimes never see (due to many factors-migrant work is "unpredictable") these students again and would never get the chance to give them the CASAS. At other situations, we do give them the CASAS and they would never come back (after 3-5+ more class days) and we will never have the chance to give the "post-test" to see how or if they ever improved to another level.
Teachers: "We (teachers) usually get our results in our regular regional meeting in Alexandria, Minnesota. I have been able to witness different reactions of various teachers from different sites (e.g. Melrose, Alexandria, etc..). The teachers share their perspective, which we all learn from each other. We get "lectured" at times to do "better" and "encouraged" to keep doing what we are doing too!
Students: "We really don't share the results from the NRS with our students, which I don't think they have an interest. However, they are more interested on how they do in the CASAS (pre and post testing)! They (us-teachers too!) get very excited when the move up in the "6 ESL levels"!"
Accountability: "Yes, the accountability or test results from NRS testing has motivated us teachers to do better for many personal reasons (e.g. keep the program going in this area for this county-wide community , job security for "paid-teachers", and want to see our local program and the region overall we represent to do well). It keeps us on our toes!
Comment on others' posts: "No, I don't think students are persisting long enough to take the post test for many reasons as I stated some above. For our area, we have many migrant workers that keep occupied with their work. Most of our students in the past have been dairy farm workers and they work long hours. They are sometimes too tired to drive to our class site after a long day (especially the long Minnesota winters). There are other "work labor" issues that I don't want to get into in this discussion! A positive example, we had a student (05'-06') from Bulgaria that worked 6 days a week. I was surprised he was able to attend our classes regularly because of his personal motivation (and other factors that I can go too long with) to learn English before going back to his own country (became part of the European Union (EU) the past year.
We can help our students achieve their goals by "improving", "improving, "improving", etc.. I don't think we'll ever reach perfection, but know that we can keep "changing for the better" (unless-"don't fixed it if it's not broken"-your particular program is doing well). Each student has their own personal goals, so I can't speak for all. In general in this area (rural farming community), many of our students are "migrant workers". Thus, they will be returning back home to use or not use what they learn. They come to learn ESL to just get enough to get through the time they are working here. Being in the "middle of no where", they come to "socialize" and "learn about the community" they'll be living in the time period they'll be working. However, we just got a new student (from Brazil) that is 14 years old and wants (parents encouraging him too) to learn ESL because he is going to become an American citizen. He doesn't want to be an "outcast" when he enters high school this fall.
Our GED students are the ones that we usually focus on helping get employment and other various goals."
Author: L, Danielle
Subject: Importance of NRS
"
Even with what we view as negative consequences to having to test, I definitely believe we need national standards like NRS. Otherwise, each of us would just teach what we think is important without much data to back us up. As I mentioned elsewhere, I did teach without guidance for a semester, and, although I knew we were making progress, sometimes I felt like I was going around in circles. "
Author: H, Gregory
Subject: New kid on the block
"
Hello Group 1 and all classmates,
I have to admit that I feel a little bit out of the loop with the current conversation. This is mostly new material for me. It is very interesting to hear of your experiences and opinions about NRS and these standardized testing programs. I have been living in El Salvador for the past 5 years and do not have any experience with what you are going through in the States.
In El Salvador, curriculum design is in vogue and the various English teaching institutions are left to themselves to set their own standards. We deal with some Cambridge tests in the bi-lingual high school where I teach. We also prepare more advanced ELL's to take the TOEFL test. We are starting to offer test preparation for the SAT test as well.
I would just like to make a few comments about my observations of the NRS functioning levels. There are 6 levels with the beginning level practically 0 proficiency in the target language. The next two levels are so close to each other, they are nearly indistinguishable. Low and High Intermediate in my humble opinion does not seem realistic to me. Why am I reacting in this way, I ask myself? To quote a descriptor phrase from the High Intermediate level functioning level under Listening and Speaking; "Can communicate basic survival needs with some help." At the High Intermediate level of a second language I thought this would be taken for granted. What I am getting at here is the arbitrariness of these functioning levels. Since I am not familiar with the assessment tests given to arrive at these levels I will shut my mouth. It just seems difficult to me to put human beings into these neat pre-packaged boxes with different bench mark code names.
4.3
Adults with Basic and Below Basic Literacy Levels:
Findings from NAAL and Implications for Practice
"he National Institute for Literacy hosted this webcast on Tuesday, August 15, 2006 to discuss the results of the National Assessment of Adult Literacy (NAAL) 2003, specifically relating to Americans who tested in the Below Basic and Basic literacy levels....
The panelists included John Strucker, of the National Center for Adult Literacy and Learning, discussed basic skills; and Brian Bosworth, of the consulting firm FutureWorks, discussed implications for workforce programs.
WEBCAST NOTES:
-Sheila
prose
-continuous texts (paragraph)
documents-non-continuous text (graphs, tables, stats, etc..)
quantitative literacy-use numbers embedded in prose and documents
#3 major features:
1. language-bilingual?
2. supplemental assessment
3.oral reading assessment
3rd Reason:
67% of jobs created over the next decade required college degree
#4 literacy levels
"..Next, although 13% of American adults spoke only Spanish or another language before starting school, these individuals accounted for 37% of the below basic population. .."
-Dr. John Stucker
".. I would be working with the other teachers and asking questions about how is our outreach? Are we really reaching these folks in our community? How is our teaching approaching going? How we are doing with the young people who are sometimes getting pushed out of high school?..
as Sheida pointed out, you see the highest percentage of people who report learning disabilities as well as other disabilities. And most learning disabilities involve reading even if they aren't labeled as reading disabilities. So these are people who have identified reading problems. Other studies we have done show that there is a higher percentage of their native US born people who have been in special education, have gotten extra help, whose teachers realize they needed extra help, and whose parents realized they needed it.."
And then the other thing about these folks is, again, it's remarkable how fast these ELLs have picked up English conversation. But don't let that fool you. If you actually get a chance to assess their reading skills in English, it turns out they are extremely idiomatic when they talk to you. If one person walked in and joined us with this panel, we think gosh! That person speaks great English but when they went to do embedded literacy tests, such as the NAAL, those tests were hard for them. Let's talk a little bit about the basic people. I think this is an extremely important population we don't want to neglect, because they are close to intermediate. And we know from other studies that intermediate is the point where you see big changes to the positive in income, civic participation, personal reading habits, all the things that embed literacy in our life and make it so meaningful and important...
...If we can identify their specific needs, their needs for fluency and that sort of literate, academic vocabulary, it may be that we can get them into the intermediate level faster and that opens up a lot of opportunities for them in terms of involvement in post-secondary and all the other things that we talked about..
-Brian Bosworth, Future-Works
".. Fourth point I want to make, most employers say they are as concerned or even more concerned about other workplace basic skills as they are about traditional measures of literacy and what do they mean by those other workplace skills? Technology skills or computer literacy is increasingly important to all employers. They identify career management and lifelong learning skills, and by this I mean the navigational skills associated with career management and lifelong learning. The employers identify, thinking critically, acting logically and solving problems as a critical basic skill. They identify finding and using information, knowing when to ask a question, knowing when you don't have the right information to answer that question and knowing where to go to get that information. They identify teamwork as a basic skill. The ability to work in groups with others cooperatively. They identify basic employability skills as well, sometimes called social skills, attendance, timeliness, work ethic, showing up sometimes is the most critical basic skill. Employers do identify math, reading, writing as important deficits in their workplace. But we find, through our survey work that employers are more likely to suggest some of these other basic workplace skills are as important or more important to them than the conventional measures of literacy.."
They would be contextualized in the language and the situations of the workplace. They would use work as a way to develop the workplace basic skills. They would be focused on a credential, a credential, which satisfies entrants to the next level of education and training....
Q&A:
-Brian
'..NCSALL colleague, Steve Reeder out at Portland state working with other researchers developed a way that you can sort of do a synthetic NAALs on your county or community, based on local demographic data. And some of the other characteristics that occur both on the NAAL assessment, and the background questionnaire and that you can also gather about your local population. .."
-Sheila
"...Sheida White: I think vocabulary is an area where ESL adults have particular difficulty in...because unlike native speakers of English who have the listening vocabulary, and so their problem is essentially decoding the word and then they can map their overall vocabulary to the text that is before them. Oftentimes, ESL adults do not have the overall vocabulary, so they have to learn that in addition to learning the...how to read the words and so it's a little bit more difficult for them to do that. So I would say vocabulary is a particular area where ESL adults can benefit from. .."
National Assessment of Adult Literacy
"By comparing results from 1992 and 2003, NAAL provides the first indicator in a decade of the nation's progress in adult literacy. NAAL also provides information on adults' literacy performance and related background characteristics to researchers, practitioners, policymakers, and the general public."
-Brian
Q: "We marketed these classes through various venues, but attendance has been too low to continue the classes. Any suggestions on what they can do?
A: Brian Bosworth: "Well, I think my quick suggestion and I hate to sound like a broken record is that offering the instruction in the workplace in partnership with the employer where most of these folks are for 40 hours a week, isn't a bad strategy in terms of securing participation and securing attendance."
National Institute of Literacy, discussion list
-Literacy in War and Peace
Tom Sticht's Work in Adult Literacy Education
By Thomas Sticht (from nald.ca)
"...In the winter of 1967 I got a call from a friend working in Monterey, California at a field unit of the Human Resources Research Office (HumRRO) of the George Washington University. He told me the military was joining President Johnson's War on Poverty by inducting 100, 000 new recruits a year who were low in reading ability. HumRRO thought it might be necessary to do more teaching by "show and tell" so that the New Standards Personnel (NSP, as they were called) could learn by listening rather than by reading. With my background working with blind students, HumRRO thought I would be a good person to do research on listening and reading abilities of the NSP..."
..In 1992, I was invited to London by what is now the Basic Skills Agency (BSA), to speak at its national conference. Princess Anne, the Patron of the BSA attended and sat through my presentation which emphasized the importance of getting "double duty dollars" through the intergenerational transfer of literacy from parents to their children. Newspapers reported that the Princess Royal found these ideas quite sensible. I worked with the BSA for several years on their national family literacy program.
In 1994 I was honored by being elected to the Reading Hall of Fame and in 1997 Timothy Shanahan and Susan Neuman, writing in the Reading Research Quarterly , selected the work colleagues and I did in the late 1960s and early 1970s at HumRRO as one of the thirteen most influential lines of literacy research since 1961.
Reach Higher, America: Overcoming Crisis in the U.S. Workforce...
Related Sites:
pdf format
Foreward
"..Lee Kwan Yew, the first Prime Minister of Singapore and architect of its economic miracle over the past half century,
once reasoned that a healthy economy depended on four fundamental requirements: dependable electrical power,
clean potable water, world-class transportation, and an educated and trained workforce. In our current global
economy, we can no longer take these precepts for granted."
....., from national commission adult literacy
".. calls for a dramatically revamped service system with the
capacity to effectively serve 20 million adults annually by the year 2020..
Section A. FACING THE PRESENT
2. Our Treacherous Path:
...Americans should have been stunned when the National Assessment of Adult Literacy (NAAL),
released in 2005, revealed that a staggering 30 million American adults scored at “below
basic”—meaning they could perform no more than the most rudimentary literacy tasks.
Another 63 million adults could perform only simple, basic everyday literacy activities.
The NAAL findings are ominous because most good jobs require at least some education beyond
high school. The NAAL found that of the approximately 222 million adults aged 16 or older living
in households or prisons in the United States, some 93 million lack literacy at a level needed to
enroll in the postsecondary education or job training that current and future jobs require...
[International Scale of Top H.S. Graduates-US #11; Korea #1]
3. The New Landscape
[National Demographic of States with College Diplomas-Texas #1]
This is a tall order, but it is what the U.S. job market demands. The Bureau of Labor Statistics
forecasts that between 2004 and 2014, 24 of the 30 fastest growing occupations will require
workers with postsecondary education or training...
Lack of Teacher Certification and Benefits. (pp 13)
High-quality instruction is essential to
foster student retention and produce successful outcomes. But salaries for adult educators
are low, and many positions lack benefits. This limits the pool of potential instructors and leads
to high turnover.
While many adult education instructors are credentialed for K–12 teaching, few states require
that adult education teachers show mastery of the specialized knowledge and skills needed to
teach adults. Moreover, states have not established certification systems to ensure that all
instructors have this ability.Worse yet, the majority of adult educators nationwide are working
part-time, often as a second job. This is another clear sign that adult education is not considered
a serious professional enterprise.
-David J. Rosen
"World Education, Boston, MA
International consultant in non-formal education to the government of the Philippines. Sponsored by the Asian Development Bank and the Royal Norwegian Government. Helped to develop national adult education system including elementary and secondary equivalency certificate alternatives. 2/95-3/95; 11/96-12/96; 6/97-7/97; 1/98-2/98; 7/98; and 2/2000.Project awarded UNESCO Literacy Project of the Year recognition in 2000. Quality Assurance Consultant on performance-based assessment, curriculum development, and teacher training, for school-based and out of school youth Integrated Production and Pest Management farmer field school project. July, 2004 - April 2005.
Consultant to the World Education Literacy Division on developing an intermediary role in Boston-based workforce development, 2004; conduct of scan of national adult literacy organizations for the National Institute for Literacy, April, 2005 - present; development of state Professional Development Distance Learning plan for the System for Adult Basic Education Support (SABES), November, 2005 - April, 2006.
ONLINE DISCUSSION:
Sal
"After watching (webcast form) and then reading the transcript (document form) of this report, I'm remembered as a child of easily understanding picture stories (e.g. quantitative literacy-71 million/32 % performed) instead of just words (e.g. prose-60 million) & (e.g. documents-48 million). Despite their data results concluding the adults did more poorly in the quantitative (using numbers) section, I feel math or working with numbers is complicated for certain groups.
The high percentage of only one language (Spanish) population that did poorly wasn't a surprise from my experience working with many migrant workers from Mexico. Also, the same goes with the population that didn't finish high school because of my experience with GED students in our ABE program.
The "Adults Just Don't Know How Stupid They Are" discussion was somewhat interesting, which I thought giving an international scope brought some enlightenment. The data on the importance of "apprenticeships" shows the importance of early programs (e.g. "OJT"-On the Job Training; Internships, etc..) in high school age level at least to college age level. I feel there is that different level of "pride" on every individual that we already know everything that we need to know on various subjects or skills (e.g. driving).
I feel it's the responsibility of "all of us" (researchers and adults themselves) on determining who is right about whose literacy levels pose a "problem". Instead of "criticizing" each other, we need to "work together" as a global society in literacy. Ever heard of the quote, "What we got here is failure to communicate" (used in a rock song from one of my favorite 80's band-Guns N' Roses)? I feel as ESL teachers or potential ESL teachers, we have a BIG task-role in teaching literacy to the "world". Thus, bridging different nations of different cultures of many ethnic backgrounds. Literacy is one way to communicate to each other, which can solve many of our social problems (e.g. wars that are somewhat caused by "misunderstanding" of one another)
An example of "working together" that's in the process in the area I'm teaching ESL as a volunteer is with our local dairy farm. They employ over 250+ "Spanish-speaking" migrant workers (many from Mexico) and they took the initiative (after hearing from the community) to start a "Spanish language" program for their "English-speaking" American workers. The guy that teaches "Basic Spanish" to them just moved up here from the Twin Cities the beginning of this year. Ironically, his wife was an ELL/ESL student in our program. That is how we found about this! They plan to hire a teacher to teach "English" to their "Spanish-speaking" migrant workers down the road.
Lastly, I would like to comment on the "Ohio Portfolio System", which I feel is a very good idea. I've learned somewhat about doing portfolios for our ELL/ESL students in the past. However, it is sometimes hard to do one when we are challenged with "retention" rates of our students. This is another "on-going" subject to discuss about later!
""arbitrary" literacy skills"-"hand on learning experience ( Newgate school here in Minneapolis)-Todd W.
' Algebra skills may not be part of my everyday life but I think the ability to 'think' mathmatically enhances brain function and math is a language..."-Jeannie M.
"...http://wdr.doleta.gov/SCANS/.... I believe SCANS was mentioned in one of our readings. At the workforce center, I had to make sure my students were achieving these skills. I developed a curriculum I called SCANS QUEST to help the supervisors and our student workers engage in tasks that supported SCANS.-Joyce
"In 1990, the Secretary of Labor appointed a commission to determine the skills our young people need to succeed in the world of work. The commission's fundamental purpose was to encourage a high-performance economy characterized by high-skill, high-wage employment. Although the commission completed its work in 1992, its findings and recommendations continue to be a valuable source of information for individuals and organizations involved in education and workforce development."
4.4 CASAS
CASAS Competencies
" They cover nine broad content areas:three-circles
* Basic Communication
* Community Resources
* Consumer Economics
* Health
* Employment
* Government and Law
* Math
* Learning and Thinking Skills
* Independent Living
Competencies, pdf format (print!)
"The CASAS Competencies identify more than 360 essential life skills that youth and adults need to be
functionally competent members of their community, their family, and the workforce..."
Content Standards
"Since its inception, CASAS has focused on teaching and assessing basic skills in contexts that are relevant and important to adult learners. CASAS has developed and continues to refine a highly formalized hierarchy of competencies, the application of basic skills that adults need to be fully functional and productive members of society."
Quick Search=> (click here for login)
"QuickSearch Online has many exciting features. A bold, colorful interface helps users to select from a wide variety of library searches. This year's edition allows "quick searches" for instructional materials by:
* Skill area: reading, math, listening, writing, speaking, critical thinking, and government and history
* A wide range of programs (including Civics, Family Literacy, Workplace)
* Specific CASAS assessment series, i.e. Life and Work, WLS, ECS
* Year of publication of instructional materials
QuickSearch Online can also search for materials that include informal classroom assessments.
The QuickSearch Online Tutorial is now available from the main menu, making it easier to learn how to use QuickSearch Online and to train new staff. Also, several reference tools appear in the Reports and Other Tools menu; for example, test competencies for every CASAS pre- and post-test. "
Tests Overview
Life Skills
"to help identify the basic skills in reading, math, and listening needed by individuals to function successfully in today's society. Using CASAS assessment instruments, agencies assess learners as they enter a program, place them into appropriate educational programs, assess learning gains, and certify attainment of learner and program goals. CASAS integrates valid and reliable assessment with a curriculum and instructional management system."
Life & Work
"s new and is different from other CASAS series in that it was developed based on the California ESL Model Standards, which are organized according to basic skills objectives, or "skills sets." All items in the Life and Work reading tests are correlated to reading objectives, as well as to life and work competencies. This correlation allows for analysis of the results with respect to reading skills sets, as well as in relation to language use contexts (competencies). Assessment items in the new series focus on short narrative passages at all levels, and follow immigrant families as they go about their life and work. Listening and Grammar tests for the Life and Work Series are currently in development."
Workplace Speaking
"s a new standardized speaking assessment, which assesses the speaking skills of adult ESL learners in a workplace context. It helps adult education programs determine to what extent an individual learner has the speaking skills to succeed on the job. Competencies assessed are clustered in three areas:
1. job skills and job information
2. social language
3. workplace safety and customer service"
Functional Writing
"provides teachers of ESL (English as a second language), ABE (adult basic education), and ASE (adult secondary education) at the adult or high school level with a means of assessing their students' writing skills in a functional, workplace, employability, and life skills context. It is adapted from the work of the CASAS Writing and High School Committees."
ECS, WLS, Life and Work and Life Skills forms
CASAS FAQ
ONLINE DISUCSSION:
..http://www.npr.org/templates/story/story.php?storyId=92611633
http://www.npr.org/templates/story/story.php?storyId=92663976.."-from "Elizabeth" W.
How Merit Pay Played Out In A Colo. School District by Larry Abramson
"Teachers have long resisted this idea, but they are gradually warming up to it. Still, many questions surround the issue of merit pay, including whether teachers can be punished for the challenges facing students — especially if those students are low-income, learning disabled, or struggling to speak English..."
D.C. School District Proposes Merit Pay For Teachers
"Forum: Other Module 4 Issues
Date: Wed Jul 23 2008 19:31
Author: Wagner, Todd
Subject: Re: Criterion referenced standardized tests?
Ken, I think the CASAS folks consider their assessments to be criterion referenced--they are designed to measure the CASAS competencies and the results are almost always evaluated against a criterion, e.g. did the student score above the cut score we've established to move up to the next level class? CASAS results are rarely used to compare students to each other and students individual scores are not typically interpreted in relationship to other student's scores.
The classic indicator of norm-referencing is reporting scores in percentile ranks. This approach is still very common, for example even in state testing systems that have established performance levels as criteria--in part because the public demand percentile ranks. (It's sort of like the case with NAAL performance levels. Parents don't really trust states to determine what constitutes a "proficient" or "advanced" level and have trouble identifying with results reported in that way. If they know the percentile rank, however, they at least know where their kids stands relative to other kids--critics would say even they still have no idea how those kids do overall--e.g. 90th percentile in Mississippi vs. Maine. (This reminds me that the National Assessment of Educational Progress NAEP and many/most state K-12 test, e.g. the MCAs in Minnesota have struggled with doing the same thing the NAAL has arguably done. They set high standards for literacy/achievement, the need for which they believe they can justify, and attempt to "push" eduational systems to empower students to meet those higher standards. They then run into a face validity problem with the public when large numbers of students, as with the NAAL, are identified as "below proficient." I was part of a standard, i.e. cut score setting, process for the first round of MCAs in Minnesota. The standard-setting process used was a modified Angoff procedure in which judges, in this case teaches, are shown items and asked to judge the probability that a "competent student" would get each right. Cut scores are then derived from those judgments across all judges and all items. When state agency staff and other policy advisors learned that the Angoff procedure had yielded a cut score the classified far more students as proficient than the most recent NAEP testing in Minnesota had, they threw out the Angoff results and simply chose a cut score that yielded the same proportion of proficient students on the state test as were identified by the NAEP test.
"
-Todd W. (professor/teacher)
Subject: Re: NRS - CASAS
" I still feel assesssment exams are a 'necessary evil'. I wonder if the students described who wouldn't return if they knew they were being tested, don't actually have other issues, like test anxiety. In that respect, 'setting the stage' for testing (ie practice exams, talking about the exam, and giving feedback on the practice) might help diminish fears a bit. One of our faculty has started a support group and counseling for students with test anxiety. Students who enroll in her program show improved test taking skills and test scores. I suppose such a group isn't really feasible within most ESL programs- although involvement in an informal 'support group' might ultimately improve language skills as well."-Jeanne M.
HOW BCSP SETS EXAMINATION PASSING SCORE
"...The most commonly used procedure for setting passing scores on professional certification examinations is the Angoff procedure. Dr. Angoff was a psychometrician (a specialist in testing and measurement of knowledge, skills and abilities) who developed a method that is now widely accepted for certification examinations. Such examinations must have a clearly defined cut score or passing score.
Just like an instrument reading determines whether someone can enter a confined space without protection, candidates who meet or exceed the passing score on a professional examination pass the examination.
To establish the cut score, each member of a panel of experts must review independently every question on the examination and rate them. The rating for each question is "what portion of those candidates who just barely qualify for the examination will know the answer?" The ratings do not include those who are more than qualified through years of experience or advanced studies. In essence, the mean of all ratings of all panelists across all questions becomes the recommended passing score."
4.5 Bests Tests
Mainstream English Language Training (MELT) project, of the Office of Refugee Resettlement (eric.ed.gov)
".. The primary project goal was to link overseas refugee camp training to refugee programs in the United States to facilitate refugees' English language learning and transition to employment and independence in the United States. REEP was selected to field test and refine three products and components of the project, including: (1) field testing of three versions of the Basic English Skills Tests (BEST) designed to measure general language ability; (2) testing and refinement of standardized Student Performance Levels, general descriptions by level of students' language ability; and (3) elaboration and specification of a set of curriculum guidelines entitled the "CORE Curriculum Guide," to be used as a reference guide for the development of U. S. based curricula. REEP benefitted from these tasks in the improvement of its methods and in the testing and achievement of its own goals of promoting refugee self-confidence, employment, and independence in the United States. (MSE)..
-1. CAL :Topics: Testing/Assessment
'...CAL uses current best practices in test development to create its assessments, which are designed to help programs evaluate students and make instructional decisions. CAL’s assessments use authentic language and communicative tasks that allow examinees to demonstrate what they know and can do in English..."
Student Performance Levels (SPL's)-"descriptions of English language proficiency levels for adult non-native speakers of English, were developed so that teachers in refugee camps in southeast Asia and those in ESL programs in the U.S. could communicate effectively about adult students' Enlgish skills.
Center for Applied Linguistics
-2. BEST Plus, developed by the Center for Applied Linguistics (CAL)
"... is an individually administered face-to-face adaptive oral interview designed to assess the English language proficiency of adult English language learners.."
-3. FAQ
=>Online Activation
-4. Best Literacy, from cal
'...tests reading and writing skills in authentic situations specifically geared for adult English language learners in the United States. BEST Literacy can be administered individually or to groups of examinees in one hour or less..."
-5. FAQ
"..Is training required to administer BEST Literacy?
The Center for Applied Linguistics does not require training to administer BEST Literacy. However state agencies, program administrators, test administrators, and test scorers should thoroughly review the BEST Literacy Test Manual - 2008 Edition in order to become familiar with proper testing and scoring procedures.
4.6 Accuplacer
Accuplacer-Introduction for Students
" The purpose of ACCUPLACER tests is to provide you with useful information about your academic skills in math, English, and reading. The results of the assessment, in conjunction with your academic background, goals, and interests, are used by academic advisors and counselors to determine your course selection.
You can not "pass" or "fail" the placement tests, but it is very important that you do your very best on these tests so that you will have an accurate measure of your academic skills..."
COMPASS® computer-adaptive college placement test helps educators (former ACT):
" * quickly evaluate incoming students' skill levels in Reading, Writing Skills, Writing Essay, Math, and ESL
* place students in appropriate courses
* connect them to the resources they need to achieve academic success
Accuplacer ESL-MSCU
Association of Classroom Teacher Testers-Combined English Language Skills Assessment)
"combines three formats of cloze into one: rational, fixed ratio, and multiple choice. Either of the two forms of the test will place students into seven levels of proficiency from low begining to advanced plus. Each 45 minute test has 75 items with 4 choices. Reliabilities for C.E.L.S.A. 1 and C.E.L.S.A. 2 are .93,.94 and the forms correlate .90...
Each form has one beginning, one intermediate and one advanced passage which were developed as follows: First, for each passage rational choices were selected based on years of research developing the English-second-language Placement tests which conformed to curriculum and texts used in seven levels of English-second-language proficiency. Second, after these high discriminators were chosen for blanks, the text was written around the choices in a fixed ratio format. The resulting passages were given to experts to determine if the language was natural, authentic and representative of materials used at each level. Third, students took the fixed ratio blank passages. From their errors, distractors were chosen for a multiple choice format. Fourth, other students took the multiple choice tests; after analysis of the items, the poor items were removed. Originally, six forms of E.L.S.A. were published..."
Yuba Community College District
DISCUSSION BOARD
Subject: Re: Criterion referenced standardized tests?
"
Joyce, to date I think two things have been driving the cut-score setting processes in programs as diverse as NAEP, the NAAL and state testing programs. The first is the belief/perception that Americans are under-skilled or not enough Americans are highly skilled and that that represents a threat to the countries competetiveness. The second is the idea that setting high standards will drive systems to change. So far, however, that doesn't seem to be happening, e.g. funding for ABE has not increased, k-12 achievement has not increased, etc. So one criticism as with the NAAL is that the bar has been set so high it makes the results appear invalid and similar critiques have been made regarding NAEP and state testing programs. The belief is strong though. A state senator once said in response to our state director of ABE (Barry Shaffer) when he had testified that raising the cut scores for graduation tests would result in more students needing ABE services: "Mr. Shaffer you obviously don't understand the point of high standards. When we have high standards, everyone will graduate."-Todd
Integrative Paper #1 – Due July 25th
FINAL DRAFT
salsESL6638AssessmentIntegrativePaper1.doc
Submit a 1 – 2 page paper in which you reflect upon your personal experience with assessing adult second language skills in light of the concepts from the class. What assessment methods and techniques do you use in your program and classroom? How can some of the concepts discussed in class and in the readings help you understand what you are already doing or what you could do in the area of adult second language assessment? Integrate the following into your reflective statement:
The assessment methods and techniques we use in our program and classroom are in various forms. For standardized testing, we use the CASAS (for ELL, ESL, ESOL students) and TABE (for GED students).
1. Why and when are your students assessed?
As I stated in the "Discussion Board" (Module 4.2), we usually assess them within the first 12 hours of their class visit...
"We use the CASAS, which we have trouble-challenges with the "pre-testing within the first 12 hours". Awhile back, when we met students for the first time, we would give them the CASAS test and they seem to be overwhelmed. Now, we decided to just wait after the first class visit or another better time (within the "first" 12 hours). That first day of class makes a huge impression on what this class will be like. Thus, this will impact their decision to return or not! Unfortunately, we sometimes never see (due to many factors-migrant work is "unpredictable") these students again and would never get the chance to give them the CASAS. At other situations, we do give them the CASAS and they would never come back (after 3-5+ more class days) and we will never have the chance to give the "post-test" to see how or if they ever improved to another level."
The purpose of these test as a "local" literacy program is for our "overseers" within the state...
"..Accountability: "Yes, the accountability or test results from NRS testing has motivated us teachers to do better for many personal reasons (e.g. keep the program going in this area for this county-wide community , job security for "paid-teachers", and want to see our local program and the region overall we represent to do well). It keeps us on our toes! .."
For our GED students, we assess them in a different way or standard...
"Our GED students are the ones that we usually focus on helping get employment and other various goals."
Overall our ELL/ELL/ESOL students are assessed differently right from the start compared to our GED students. We have them fill a "goal" sheet, which our "local program" gets credit if they do reach their goal. For example, the common goal listed for our ESOL students is "Learn English". For our GED students, it's getting their "GED Diploma" and/or "getting employment". Then throughout the course of the term or year, we write their individual goals and accomplishments each class session they participate on a separate goal sheet. These are some of the assessment tools and techniques we have been trained by the "state" for our "local" ABE program. The purpose of this is to show their "progress" and to show the individual learner/student what he/she has done so far. Therefore, to hopefully motivate them to continue to strive to reach that goal we both came to agreement in the beginning of the term/year. This informal form of assessment helps us "test-givers" or "teachers" to keep record on what we've done and what to do to help this invidual learner "reach their goal(s)". Also, to serve as data for the state in case we get "evaluated" for funding purposes. Along with this information, we have all types of other records (e.g. homework) for the individual students in a "portfolio" type file folder.
2. What assessment tools and techniques have you used? What was your purpose in using them?
(see above)
3. How do the tools and techniques you have used relate to the desired outcomes of your course or program? How would you describe the actual outcome?
The tools and techniques we have used has met both desired and undesired outcomes of our course or program. We "try" to keep improving it as we "make mistakes" and "learn". To first share some positive outcomes, we've been able to help students (GED) get their GED Certificate and employment. We've seen some progess of some of our ESOL students. For example, when we first "restarted" this program back in 2004, we had a good-size group of "migrant workers" from Mexico. One particular student name "Tulio" was one of the most ambitious, which he would come regularly. I remember taking him out to a community setting and introduced him to some people to informally practice his English. He stopped coming to our class one day and we didn't know why. Well, we found out he became manager of a particular dairy farm site! I personally feel what helped him reach this particular position was "researching" on the internet on resources related to his work. He was a dairy farmer and I found this "dairy terminology" site that translated English to Spanish. I would end up sharing this site with future "migrant workers" that worked on this particular field. He still continues to have this position to this day and he ended up inviting his family-relatives to come work in this area.
Some of the "undesired" outcomes are students that never reach their "goal" that they stated in the beginning of the year/term. We have one student name "Manuel" that kept coming regularly and stopped for about a 1/2-1 year. With our local classroom environment, we sometimes get "very busy" and forget or don't have time to "follow-up" with some of these students. Well, Manuel ended up coming back and would come regularly for awhile. However, we didn't see any "progress" based on his conversations with us. We can pin point to many factors that causes this, but we certainly feel that we could've done much better with this individual student. Some of other "undesired" outcomes is uncertain because we get so many students in the beginning, but they never finish or "come back". We will sometimes never know the outcome!
4. How would you characterize the tools and techniques you used in terms of the testing concepts presented in class?
I feel we have been using the tools and techniques already, but we just didn't know "how to use them" the right way all the time. For example, in Module 4.3 when we were assigned to read the results of the National Assessment of Adult Literacy (NAAL) 2003. I was able to related to what Dr. John Stucker stated...
"And then the other thing about these folks is, again, it's remarkable how fast these ELLs have picked up English conversation. But don't let that fool you. If you actually get a chance to assess their reading skills in English, it turns out they are extremely idiomatic when they talk to you. If one person walked in and joined us with this panel, we think gosh! That person speaks great English but when they went to do embedded literacy tests, such as the NAAL, those tests were hard for them.."
I had one student from Bulgaria that quickly lear