Go back one page Copyright Saskatchewan Education Go to Evergreen Menu Go to Curriculum Menu Go to Discussion Area Go ahead one page

Assessment and Evaluation

Why Consider Assessment and Evaluation?

Much research in education around the world is currently focusing on assessment and evaluation. It has become clear, as more and more research findings accumulate, that a broader range of attributes need to be assessed and evaluated than has been considered in the past. A wide variety of ways of doing this are suggested. Assessment and evaluation are best addressed from the viewpoint of selecting what appears most valid in allowing students to show what they have learned.

In Student Evaluation: A Teacher Handbook (Saskatchewan Education, 1991) the difference between the various forms of evaluation is explained. Student evaluation focuses on the collection and interpretation of data which would indicate student progress. This, in combination with teacher self- evaluation and program evaluation, provides a full evaluation.

Information in Saskatchewan Education, Training, and Employment documents should be used to help develop an overall evaluation plan.

Phases of the Evaluation Process

Evaluation can be viewed as a cyclical process including four phases: preparation, assessment, evaluation, and reflection. The evaluation process involves the teacher as a decision maker throughout all four phases.

  • In the preparation phase, decisions are made which identify what is to be evaluated, the type of evaluation (formative, summative, or diagnostic) to be used, the criteria against which student learning outcomes will be judged, and the most appropriate assessment strategies with which to gather information on student progress. Decisions made during this phase form the basis for planning during the remaining phases.

  • During the assessment phase, identify information-gathering strategies, construct or select instruments, administer them to the student, and collect the information on student learning progress. The identification and elimination of bias (such as gender and culture bias) from the assessment strategies and instruments, and the determination of where, when, and how assessments will be conducted are important considerations .

  • During the evaluation phase, the information gathered during the assessment phase is used to make judgements about student progress. Based on the judgements (evaluations), decisions about student learning programs are made and reported to students, parents, and appropriate school personnel.

  • The reflection phase allows you to ponder the successes and shortfalls of the previous phases. Specifically, evaluate the utility and appropriateness of the assessment strategies used, and make decisions concerning improvements or modifications to subsequent teaching and assessment. Science Program Overview and Connections K- 12 contains questions that encourage reflection on student assessment, your own planning, and on the structure of the curriculum.

    Formative, diagnostic, and summative evaluation processes each involve all four phases. The relationship of the phases is illustrated in Figure 6.

    Assessing Student Progress

    Specific assessment techniques are selected in order to collect information about how well students are achieving objectives. Which assessment techniques are chosen depends on what the teacher wants the students to demonstrate, the capabilities of the students, and on what the students have been doing in class. The environment and culture of the students is also an important consideration.

    Various assessment techniques are listed here for reference. Each teacher must exercise professional judgement in determining which techniques suit the particular purposes of the assessment. No two situations are identical so no two evaluation strategies should be identical. For further information on assessment strategies and on instruments to collect and record information about student learning, refer to the Student Evaluation: A Teacher Handbook (Saskatchewan Education, 1991).

    Following is a list of specific student assessment techniques from the Handbook, page 13 .

    Methods of organization

    Methods of data recording Ongoing student activities Quizzes and tests

    Student Assessment in Science

    At the start of any class, you, the teacher, have a group of new students. The students are new, even if they know each other or know you, because they will be dealing with different material, from a different point of view, within an evolving system of interactions. The factors of scientific literacy and the learning objectives for the curriculum become the criteria by which to assess the students' learning and progress. These may be easily attainable by the majority of students, but some will need extra support to reach their potential. Adaptations to materials or approaches will be required.

    Graded teaching resources and standardized tests are built on what is accepted as normal or average for a student of that age group and often for a specific segment of society. A standardized test assesses how a child matches culturally-biased standards over a narrow range of skills. The results must be considered in that context. This measure may be unattainable by some students. Alternatively, some students may not reach full potential because they are not challenged but are allowed to remain at the "acceptable average". The Adaptive Dimension recognizes that the needs of all students must be considered for effective teaching and learning to occur.

    In assessing the factors of scientific literacy, methods can be established for addressing knowledge (Dimensions A, B, D), values (Dimensions G and F), and abilities (Dimensions C and E) in ways that suit the nature of the factor. See Figure 7.

    The factors of scientific literacy in Dimensions A through E can be assessed through manipulation of factual knowledge. However, it is quite possible to assess only factual knowledge and this is a fault of much current student assessment. When examined, this assessment is often little more than simple recall or limited application of facts. When assessment does go further and appears to include abilities, often too much emphasis is still devoted to straight recall. Students deserve to be assessed on the range of abilities they have been using and developing during instruction. The overall assessment plan should reflect the students' different learning styles, their different ways of displaying their learning, and the nature of the abilities being assessed. Self-referenced assessment may be one aspect to investigate.

    Assessment can be oral, written, practical, or some combination of these. Practical exercises are the best way to assess scientific and technical knowledge and skills (Dimension E). For example, reading a thermometer diagram is not the same as knowing how best to use and place the thermometer in order to measure temperature.

    The best way to assess whether students can perform an activity is to observe them while they are actually performing the activity.

    Ask them probing questions. The use of anecdotal records, observation checklists, and rating scales can assist in data collection as these observations are taking place.

    The types of tasks and questions which students are expected to address influence their responses. When the tasks and questions are limited, so are the responses. Tasks and questions which elicit only one word or simple sentence answers usually test basic recall of factual knowledge. It is very important to consider that once students have, for example, formulated a model in a particular context during a science activity, if that exact same context is given in the assessment, the response is recall, and not a test of any conceptual or process ability. Valid assessment of the ability requires slightly different conditions so the ability is tested through a new set of events.

    Good questioning is extremely important for effective teaching and testing. Avoid using only questions that have a single acceptable response. Structure questions so that some type of reasoning is required. How..., why..., and explain... are stems you can use to create divergent questions. Present problem solving activities. Develop critical and creative thinking. These ways promote and challenge higher level thinking. Ask yourself whether your questions to the students, and the students questions to each other and to you, require reporting knowledge, using knowledge, or creating knowledge. Try to ensure a good mix of question types.

    Ask your students to interpret a graph or photograph, or to answer a question orally. Assessment does not have to consist totally of written work. Varied formats adapt to students' differing learning styles and allow students to show what they have learned.

    Summative assessment items following the completion of a unit can cover more scope and depth than formative assessment items. Apart from the scope and depth of the activities selected, the format of summative assignments can be just as varied, including practical tasks (to reflect practical knowledge and abilities), interpretation of graphs and photographs, and investigative problems and assignments.

    Multiple choice, true or false, or fill-in-the-blank tests usually assess only basic factual recall. Such tests should be used as little as possible and fewer "marks" should be awarded them in comparison with those items that require more complex abilities. Multiple choice questions which test higher levels abilities than recall can be constructed, but the process of construction is long and difficult.

    Essay questions are useful tests of understanding and ability to synthesize and evaluate. They can indicate an understanding of all Dimensions of Scientific Literacy, and can be used in both formative and summative assessment. Students who have difficulty writing may be given the option of alternative forms of communication to express their understanding of the concepts. Illustrations or art projects, an oral report, a concept map, a project, journal writing, or some other challenging activity may serve as innovative alternatives to the written essay. Care must be taken to provide writing opportunities and guidance to improve their writingto students having difficulty with writing. Always keep in mind the importance of assessing understanding of the objectives, whichever format is used.

    Projects are useful items for summative assessment. Students can explore a topic in depth, and use of a range of process abilities. If the project is a group effort, difficulties might arise in assessing the individual participation of each student if the project is a group effort. The contributions and participation of individuals within a group can often be determined by structuring the tasks, assigning roles, and observing the ways in which the group members interact with one another. Using student self-assessment is another avenue to determining individual contributions and participation. The number and type of assignments completed in a learning centre can be recorded as a summative assessment. Assessment stations are particularly useful for allowing students to demonstrate competence.

    Assessing values is an emerging area of assessment and evaluation. At one time, values were not considered a part of the school's written curriculum. Parents and society certainly required that students develop acceptable behaviours and attitudes, but these were promoted through the "hidden curriculum" - the teachers' and school's influences.

    Now, specific attitudes and values are to be openly promoted in students, so the teacher's influence must be directed to these objectives. Accordingly, they must be assessed. For further information on values review Chapter VI in Understanding the Common Essential Learnings: A Handbook for Teachers (Saskatchewan Education, 1988). Both in questioning and in the matter of values, there is a need for knowledge about the influence of culture on communications. Teachers must recognize, be sensitive to, and respect cultural differences. Values are a direct result of culture and as such, the connection between the two may need to be made explicit.

    There are valid reasons to assess students' value and attitude outcomes at school and to attempt to promote these with effective teaching methods and individual student reflection. Since the values listed in Dimensions F and G of the factors of scientific literacy may be developed over time, teachers should be emphasizing many of the same values through the grades, but developing them to higher levels. This cumulative development helps to take students to a point where the level achieved may become a feature of their characters, and may continue to develop further in adult life.

    Figure 7: Including Dimensions of Scientific Literacy in Planning for Assessment

    Key to abbreviations of evaluation techniques:

    ar........anecdotal record
    co........contract
    lr........laboratory report
    oc........observation checklist
    or........oral response
    pa........peer assessment
    pf........portfolio
    pr........project or written report
    pt........performance test
    rs........rating scale
    sa........self assessment
    wt........written test

    An 'x' in a cell indicates a strategy that might be appropriate for assessing that Dimension of Scientific Literacy. The placement of an x in a cell is not definitive. You may not be able to use that technique to assess factors from the Dimension indicated. You may find that a blank cell represents a combination which is appropriate for use in your classroom. The terms for evaluation techniques are taken from Student Evaluation: A Teacher Handbook. Assistance in designing an evaluation plan that uses these techniques can be found in that document.

    Summary of % weight by domains and DSLs:

    Performance-Based Evaluation in Science

    In a curriculum based on activity, it is essential that performance testing be used for a significant portion of student assessment. Performance testing can also be used to gauge the effectiveness and value of particular activities. Skills and abilities that are stressed in the activities should be the ones that are included in the performance assessment. Performance testing should be used when it can test factors or criteria which can not be assessed as well using solely a paper and pencil item. They can be done as individual tests or as group tests. Many activities described in activity books can be used as perfomance tests. If you pick one from an external source, make sure it relates to the objectives you have been stressing during instruction.

    A significant portion of the grade for Middle Level science classes should be derived from performance-based evaluation.

    The following is an example of a performance task. The criteria for assessing the task are included in the student instructions for the task so that the students are aware of the basis for their evaluation. They should be given or develop these criteria at the beginning of a section of study, and told that these criteria would be used to help assess their performance at the end of the unit.

    Instructions for teacher

    Materials (per individual or group)

    Use two distinct samples of soil for this activity. Try to obtain one with a fairly high percentage of clay, and one of sandy soil. Alternatively, a sample of local soil and a sample of bagged potting soil may be used. The critical thing is to get two samples with significant differences. When you travel to another part of the province, you may want to take along some ice cream pails and collect soil samples along the way.

    Instructions to students

    You will be evaluated according to the following criteria:

    1. Dump the contents of each vial onto the corresponding sheet of paper.
    2. Compare the samples. Record your observations in your lab journal. (To compare is to describe all the similarities and differences.)
    3. Place the funnel into the jar or cylinder for support. At the outside edge of one piece of filter paper, use a ballpoint pen to label it Y.
    4. Fold the filter paper marked Y and place it in one funnel. Fold the other piece of filter paper and put it in the other funnel.
    5. Carefully transfer the soil from the sheet of paper marked Y to the filter paper marked Y. Transfer the other soil to the other filter paper.
    6. Fill each vial with water. Add one vial of water to each funnel, bit by bit if necessary to prevent overflow.
    7. When the water has flowed through each system, compare the filtrate in each bottle or cylinder.
    8. Carefully remove the filter paper from each funnel. Place the filter paper labelled Y back on the sheet of paper labelled Y. Place the other filter paper on the other sheet.
    9. Compare the wet soil samples.
    10. Dispose of the soil sample marked Y in the yellow pail. Put the other sample in the other pail.
    11. Answer the following questions. Give your reasons for each answer.
      • Would one of the samples be better for growing crops?
      • Would one of the samples hold rain water better than the other?
      • Which is most like the soil in your region?

    Record-Keeping

    To aid data collection so that the factors of scientific literacy are addressed in student assessment, checklists have been included in the Science Program Overview and Connections K-12 and in this guide. Teachers should adapt these to suit their needs.

    Teachers often differ in the way they like to collect data. Some prefer to have a single checklist, naming all the students in the class (or in one work group) across the top and listing the criteria to be assessed down the side. The students' columns are then marked if a criterion is met. In this case some information would have to be transferred later to a student's individual record.

    Other teachers prefer to have one assessment sheet per student, which is forms part of the student's record. That sheet would list the factors for assessment down the side, but along the top might be a series of dates indicating when assessment took place. Such an individual file would illustrate development over the year. In this case, information might have to be transferred from the record to the official class mark book, as required.

    Examples of these types of assessment sheets are also given in Science Program Overview and Connections K-12, and may be available from the Stewart Resource Centre at the Saskatchewan Teachers Federation offices in Saskatoon.

    Program Evaluation

    Program evaluation is a systematic process of gathering and analyzing information about some aspect of a school program in order to make a decision, or to communicate to others involved in the decision-making process. Program evaluation can be conducted at two levels: relatively informally at the classroom level, or more formally at the classroom, school, or school division levels. At the classroom level, program evaluation is used to determine whether the program being presented to the students is meeting both their needs and the objectives prescribed by the province. Program evaluation is not necessarily conducted at the end of the program, but is an ongoing process. For example, if particular lessons appear to be poorly received by students, or if they do not seem to demonstrate the intended learnings from a unit of study, the problem should be investigated and changes made. By evaluating their programs at the classroom level, teachers become reflective practitioners. The information gathered through program evaluation can assist teachers in program planning and in making decisions for improvement. Most program evaluations at the classroom level are relatively informal, but they should be done systematically. Such evaluations should include identification of the areas of concern, collection and analysis of information, and judgement or decision making.

    Formal program evaluation projects use a step-by-step problem-solving approach to identify the purpose of the evaluation, draft a proposal, collect and analyze information, and report the evaluation results. The initiative to conduct a formal program evaluation may originate from an individual teacher, a group of teachers, the principal, a staff committee, an entire staff, or central office. Evaluations are usually done by a team, so that a variety of background knowledge, experience, and skills are available and the work can be shared. Formal program evaluations should be undertaken regularly to ensure programs are current.

    To support formal school-based program evaluation activities, the Saskatchewan School-Based Program Evaluation Resource Book (1989) has been developed to be used in conjunction with an inservice package. Further information on these support services is available from Evaluation and Student Records Division, Saskatchewan Education, Training, and Employment.

    Curriculum Evaluation

    During the decade of the 1990's, new curricula will be developed and implemented in Saskatchewan. Consequently, there will be a need to know whether these new curricula are being effectively implemented and whether they are meeting the needs of students. Curriculum evaluation, at the provincial level, involves making judgements about the effectiveness of provincially authorized curricula.

    Curriculum evaluation involves the gathering of information (the assessment phase) and the making of judgements or decisions based on the information collected (the evaluation phase), to determine how well the curriculum is performing. The principal reason for curriculum evaluation is to plan improvements to the curriculum. Such improvements might involve changes to the curriculum document and/or the provision of resources or inservice to teachers. It is intended that curriculum evaluation be a shared, collaborative effort involving all of the major education partners in the province. Although Saskatchewan Education, Training, and Employment is responsible for conducting curriculum evaluations, various agencies and educational groups will be involved. For instance, contractors may be hired to design assessment instruments; teachers will be involved in instrument development, validation, field testing, scoring, and data interpretation; and the cooperation of school divisions and school boards will be necessary for the successful operation of the program.

    In the assessment phase, information will be gathered from students, teachers, and administrators. The information obtained from educators will indicate the degree to which the curriculum is being implemented, the strengths and weaknesses of the curriculum, and the problems encountered in teaching it. The information from students will indicate how well they are achieving the intended objectives and will provide indications about their attitudes toward the curriculum. Student information will be gathered through the use of a variety of strategies including paper-and-pencil tests (objective and open-response), performance (hands- on) tests, interviews, surveys, and observation.

    As part of the evaluation phase, assessment information will be interpreted by representatives of all major education partners including the Curriculum and Evaluation Division of Saskatchewan Education, Training, and Employment and classroom teachers. The information collected during the assessment phase will be examined, and recommendations, generated by an interpretation panel, will address areas in which improvements can be made. These recommendations will be forwarded to the appropriate groups such as the Curriculum and Instruction Division, school divisions and schools, universities, and educational organizations in the province.

    All provincial curricula will be included within the scope of curriculum evaluation. Evaluations will be conducted during the implementation phase for new curricula, and regularly on a rotating basis thereafter. Curriculum evaluation is described in greater detail in the document Curriculum Evaluation in Saskatchewan (Saskatchewan Education, 1991).

    Go back one page Copyright Saskatchewan Education Go to Evergreen Menu Go to Curriculum Menu Go to Discussion Area Go ahead one page