Assessment of Struggling Elementary Immersion Learners: The St. Paul Public Schools Model
The ACIE Newsletter, February 2006, Vol. 9, No. 2
By Aline S. Petzold, School Psychologist, St. Paul Public Schools, St. Paul, Minnesota
| Students who struggle in the immersion classroom are typically assessed for possible special education help following the traditional “severe discrepancy” formula, used nationally since the 1970s. That is, the student’s intellectual potential and actual academic functioning are compared on standardized comprehensive tests given in English. The student is found eligible for service if there is a ”severe discrepancy” or a significant gap between his or her current level of achievement and where he or she should be functioning. |  | 
                    In the St. Paul Public Schools (SPPS) district, however, educators 
                    believe that, if a student does not have the same experiences 
                    as those in the norming sample of a standardized test, the 
                    test should not be used to make decisions regarding that student’s 
                    schooling. In such cases, it is unfair and professionally 
                    unsound to use English-medium tests on students in the immersion 
                    classroom who spend their day being instructed in a language 
                    other than English. Instead, SPPS assessment tools are created 
                    from the students’ curriculum (Curriculum Based, or 
                    General Outcome Measures), comparing a student’s progress 
                    to that of peers. 
To create these more program-appropriate norms, entire grade levels of immersion students are tested for reading fluency and math computation once every three years. Results are analyzed to determine a median score and then used to establish immersion specific norms. Students qualify for special education services only if they are at least two times discrepant (i.e., they make twice as many errors as their peers) in both English and the immersion language.This methodology is adapted from the district’s model of assessing English language learners, in use since 1990. The model requires quite a bit of planning and time initially, but we find that it leads to more accurate placement decisions in the end.
                    Research Support for Model
                    Research support for comprehensive and alternative approaches 
                    to assessment such as curriculum-based measurements when dealing 
                    with linguistically and culturally diverse learners is strong 
                    (Baker, Plascencia-Peinado, & Lezcano-Lytle, 1998; Shinn, 
                    1989). The references and resources listed below deal with 
                    Curriculum–Based Measurement (CBM) theory and its use 
                    with native English speakers or English language learners. 
                    The administration and scoring instructions are for English 
                    language materials. At this time, there is no literature that 
                    focuses specifically on the use of CBM materials in the immersion 
                    setting.
                    Implementing the SPPS Assessment Model: A Step-by-Step 
                    Guide
                    The following outlines the steps for implementing the SPPS 
                    Assessment Model for language immersion students. 
                    What do I need?
                    People - to read instructions for the group tests; people 
                    to monitor the groups during test administration; people to 
                    listen to individual readers; people to score the tests. The 
                    more people on hand, the faster the data can be collected 
                    and analyzed.
                    Time - to administer and to score tests.
Materials - including test packets with a math computation task, a written language story starter, a comprehension task, grade level-appropriate oral reading passages and a common reading passage; pencils, stop watches or watches with second hands, administration and scoring directions for helpers.
                    WHAT DO I DO for testing?
                  
                    Before the Test Day
                    Decide the scale of your project. Will you be focusing on 
                    one struggling student in one classroom or several struggling 
                    students across one grade level? Do you want information about 
                    typical rate of progress from grade to grade at one school? 
                    Do you have more than one program district-wide to compare?                  
Schedule blocks of time to gather two sets of norms (one in English and one in the immersion language). Group testing of math and written language takes about 20 minutes; oral reading takes five minutes per child. (Note: Gathering norms is like making a quilt – allot double the amount of time you think you will need in case of last minute glitches!)
Create probe packets for grade level(s) to be tested. Each packet should contain: a front page with space for identifiers (student’s name, grade, school, date) and a scoring table showing each test with one column for the student’s scores and one with the median score of peers for comparison; two math computation pages (one at grade level, one with mixed calculations to sample knowledge of functions over several grades); one written language story starter; four oral reading passages (three at grade level, from basal texts or the equivalent; one common passage at about 4th grade level). These passages should be 150-250 words, long enough so that the average reader cannot finish before the one-minute time limit. Numbering these passages will make scoring easier. (A discussion in support of oral reading fluency measures as a determinant of second language reading ability can be found in Shinn, Good, Knutson, Tilly & Collins, 1992). Samples of the oral reading probe, story starter, cloze passage and math problems can be found in Figure 1, available online.
Upper grade passages should be longer (250-350 words) and packets should also contain a cloze passage, where certain deleted words must be replaced, to test reading comprehension. The cloze procedure or maze task has been used to test reading comprehension in second language learners since the 1970s (see, for example, Oller, J.W., 1973; Shin et al., 2000).1
Create a master packet of all the oral reading passages, both grade-level and common, to be used for one-on-one reading. These do not need to be numbered.
Create instruction and scoring packets for helpers to use as reference guides. In SPPS we have adapted our own version of the scoring system originally designed by Deno and Shinn at the University of Minnesota (see, for example, Shinn, 1989; see also AIMSweb Curriculum Based Measurement online site).
Recruit people to help you on the test day. The more helpers you have the better!
On the Test Day
                  Arrive early! Bring extra timers. Gather all helpers. Make 
                    sure all helpers have been trained beforehand about the process 
                    and have a copy of the instruction/scoring packet. Begin norming 
                    your group(s). If you are working with more than one class, 
                    consider starting with one-on-one reading and ending with 
                    the whole group testing of written language and math, as long 
                  as all students are present for the group test. 
For group testing: Have one person read the instructions and answer questions before the test, have other helpers (up to three) circulate and monitor the group during testing. Collect the packets and keep them in a safe place until scoring time.
For one-on-one reading: Take students, one at a time, to a quiet place to read out loud for you. This is the most time consuming part of collecting the data. The more helpers you have listening to readers, the shorter this phase will be. Have the student read from the appropriate unnumbered passages in your master packet. Clock their reading for one minute. Mark their errors and the end point on the corresponding numbered passage in the student’s packet. Collect the student packets for scoring later. It is a good idea to have a prearranged activity to occupy the students who are waiting to read to you; for example, reading at their desks, drawing, doing an assignment.
                    How Do I Develop Local Norms?
                    Remember, you need to do the norming twice, once in English 
                    and once in the Immersion language.
When Scoring
                  Have copies of the scoring instructions available for all 
                    helpers. Scoring instructions need to include some examples 
                  of errors.
Start scoring the packets soon after they are administered, if you have time. For example, you could test in the morning, break for lunch and re-group to tackle scoring in the afternoon. Input from the helpers is invaluable when scoring, especially when trying to decipher words and phrases in the written language story segments.
Calculate the scores for each test and complete the scoring table in the front of each student packet. A packet typically takes about ten minutes to score. See Figure 2, available online.
                    When Tabulating Data
                    Create a list of scores for each student in each area tested. 
                    Determine the median score and range of scores for each area. 
                    To do this, you will first need to rank order the scores from 
                    lowest to highest. If the total in your sample is an odd number, 
                    then the median is the middle number. If the total in your 
                    sample is an even number, take the average of the two middle 
                    numbers.
Chart these for use as a reference guide (Available online, Figure 3 shows a sample norm grid by grade; Figure 4 shows norms in a multi-grade format). The median score is used to compare your struggling student against immersion peers. In SPPS, immersion students qualify for special education instruction if their error score is at least twice the median in both English and the language of instruction. For struggling students who do not meet the strict 2X discrepant criterion, the range of scores provides a means of deciding whether they are nevertheless at-risk in comparison, or are still developing skills. The range of scores also gives a sense of the level of difficulty of your testing material. That is, if too many scores are high (or low), you may want to adjust your probes accordingly.
                    When and how often DO I Assess?
                    Carry out this norming in the fall and again in the spring 
                    to gather information about learning growth over time. The 
                    data should be updated every three to five years to maintain 
                    usefulness, but until then, you can relax and enjoy the fruits 
                    of these labors.
It takes effort to develop local norms, but the outcome is a more program appropriate evaluation tool since students are compared to their immediate peers and test content is taken directly from their curriculum. We have used this alternative assessment model in SPPS immersion programs since 1993 as one part of a comprehensive learner-centered approach. As a result, we are experiencing greater success in minimizing the likelihood that struggling students are falsely identified as candidates for special education.






