|
|
|
Printer-friendly version of this
essay
Bringing the Science Assessment Standards into the Classroom
Audrey B. Champagne
with
Susan A. Sherwood
Özlem Çezikturk
University at Albany
State University of New York
Here's a pop quiz. No fair looking at your neighbor's answer
- What is the purpose of assessment in science education? Choose one:
- To monitor student progress
- To plan teaching activities
- To formulate education policy
- Depends upon whom you ask
- All of the above (and more)
The correct (and sometimes maddening) answer is "e. All of the above (and
more)." Because assessment serves many purposes in our efforts to improve
science achievement, assessment is a central focus of the standards-based reform
movement. This new emphasis on assessment has been frustrating and at times overwhelming
to classroom teachers whose primary goal is to develop science literacy, not to
test it. The purpose of this essay is to suggest how the Assessment Standards
contained in the National Science Education Standards (National Research Council,
1996) can help science teachers meet this goal.
To begin, we should all have a common definition of the term assessment.
Time for another pop quiz (you know the rules):
- Which of the following is most consistent with the definition of assessment
contained in the National Science Education Standards? Choose one:
- Multiple choice quizzes focused on factual information
- Essay exams used to measure students' conceptual understanding
- Performance tasks used to measure students' abilities to do inquiry
- Tests used for grading
- Data collection with a purpose
And the answer is "e. Data collection with a purpose."
Next question: What kinds of data are collected and for what purposes? Government
agencies at the federal and state levels spend millions of dollars collecting
student achievement data, teacher quality data, and per-pupil expenditure data.
These data are used primarily for the purpose of making policy. Some states
use individual student achievement data to determine which students will receive
high school diplomas. Local districts collect student achievement data and use
it to identify teachers and schools that are doing the best job of meeting standards.
Teachers regularly collect student achievement data for the purpose of grading.
Teachers also collect data to plan and guide instruction and provide feedback
to students on their progress. Monitoring students' responses (verbal and non-verbal)
to teaching/learning activities and responding to them on a minute-to-minute
basis is almost automatic. Teachers, therefore, are often unaware that they
are collecting data and using it to decide how to proceed with a lesson. Intuitively,
teachers understand that assessment is a powerful tool for improving student
learning. The National Science Education Standards contain important information
about how to harness the power of the assessment tool.
Next quiz:
- How many National Science Education Assessment Standards are there?
- Twenty-eight
- Seems like a thousand
- Forty
- Whose standards are we talking about?
- Five
The total number of National Science Education Standards (NSES) set by the
National Research Council (NRC) is twenty-eight. (The NRC NSES are not to be
confused with the AAAS Benchmarks for Science Literacy.) Of the 28, five are
Assessment Standards. The NRC Standards have been created for all facets
of science education, including teaching, professional development, content,
assessment, programs, and agencies and organizations. With that framework in
mind, let's focus on how the five Assessment Standards can be used to
improve student achievement.
Standard A: Assessments must be consistent with the decisions they are
designed to inform (NRC, 1996, p. 78).
This standard suggests that as teachers prepare for assessment, they need to
answer three basic questions:
- What general purpose will the data I collect serve? Is the purpose
to plan my teaching, report to parents, provide feedback to students
?
- What specific decision will I make with the data I collect? Will
I use the data to decide how to improve the way I teach inquiry? Will I use
the data to decide whether or not to fail a student? Will I use the data to
decide if I should re-teach weight-weight problems tomorrow?
- What data do I need to make the decisions? If my decision is related
to the teaching of inquiry, my data should measure how students' ability to
inquire improved as a result of the teaching method I used. If my decision
is about failing a student, my data should include, among other data, a broad
range of information about the student's achievement, the effort the student
has put forth, and mitigating circumstances in the student's personal life.
If my decision is about whether or not to re-teach weight-weight problems
tomorrow, my data might include the questions students asked at the end of
class and their performance on the practice problems assigned for homework.
Standard B: Achievement and opportunity to learn science must be assessed
(NRC, 1996, p. 79).
There are two considerations here, opportunity to learn (OTL) and achievement.
When addressing OTL, the NRC Standards tell us that "Student achievement
can be interpreted only in light of the quality of the programs they have experienced"
(p.78). Furthermore, "
because student achievement is in part dependent
upon opportunity to learn, opportunity to learn and achievement must be assessed
equally" (p. 82). Fair's fair. Without adequate resources, time, and teaching,
students can not be held accountable for achievement. When examining student
achievement, you need to ask yourself, did I give my students adequate opportunity
to learn? As a classroom teacher, not only do you need to assess student achievement
but also the OTL that students in your classroom have had to develop the understanding
and abilities that you expect them to achieve.
The second consideration is that assessment should focus on highly valued
content (inquiry; understanding facts, concepts, theories, and principles;
scientific reasoning and decision making; and scientific communication) not
just content that is easily assessed. Basic knowledge is more easily assessed
than understanding or the abilities of inquiry. Even so, all must be addressed.
If, for instance, only basic knowledge is assessed, both teachers and students
will likely dismiss the importance of the understanding or the abilities of
inquiry. Assessing inquiry abilities can be a time-consuming ordeal, but do
we really want students to get the message that inquiry is of little significance?
(Champagne et al., 2000). Moreover, if most of our assessment consists of multiple
choice exams of scientific factoids, we are telling students that if they can
pass a test on that information, they "know" science (NRC, 1996, p.
82). The assessments you give communicate to your students the content you believe
has most value. Consequently, you need to ensure that the content you assess
is the valued content defined by your state and the National Science Education
Standards. Additionally, the content must be assessed using appropriate measures.
Standard C: The technical quality of the data collected is well matched
to the decisions and actions taken on the basis of their interpretation (NRC,
1996, p. 83).
The strategy for collecting data should match what you are trying to measure.
For instance, it doesn't make sense to assess students' ability to conduct scientific
inquiry with a conventional paper-and-pencil test. However, a multiple-choice
test could be very useful to a teacher measuring student ability to get information
about the chemical elements from the periodic table.
Assessments should be realistic (some use the adjective authentic),
that is, similar to the activities engaged in by scientists or scientifically
literate adults. Even when conducting assessments, you should be preparing your
students to use their knowledge and abilities in situations outside of school.
For example, you might design a test of students' ability to perform purity
tests on water in the context of providing information to a town board faced
with an environmental decision. Done properly, assessment is a learning tool
for students as well as a measuring tool for teachers.
A prime directive of this standard is that serious decisions require
precise and accurate data. An informal "seat-of-the-pants"
classroom assessment is fine if you are monitoring student feedback contemplating
how to proceed with a lesson. However, if the purpose of data collection has
far-reaching consequences such as student retention, you must be confident that
the data you have collected are reliable and valid.
Reliable assessments produce similar results. If inquiry skills are the content
being measured, the same results should be obtained whether the inquiry task
is in the context of the physical or life sciences. An assessment is valid when
it measures that which it claims to measure. For instance, students' ability
to generate a testable hypothesis can not be measured using short answer
items that require the student to identify those hypotheses that are
testable.
When making a high-stakes decision such as who passes or fails a course, you
have the responsibility to collect high quality data. The assessment must therefore
be rigorously constructed.
Standard D: Assessment practices must be fair (NRC, 1996, p. 85).
To say it another way, assessments should be equally unfair to all students.
Tasks need to be set in a variety of contexts that do not favor the experience
of any one group; that is, set in contexts which are accessible to males and
females; city and rural dwellers; and any other groups you can identify whose
experiences are unique. For instance, if physics teachers take the effort to
include problems about rotational motion in their test banks that are set in
the contexts of the kitchen, the farm and the garage, all students will have
equal chances to encounter items set in contexts with which they are familiar.
Conversely, all students will have an equal chance to encounter items set in
contexts with which they are not familiar. The purpose of this standard is more
than fairness; it's to ensure assessment validity. Results should be related
to student science content understanding only, not to gender, ethnicity, or
other exogenous factors (NRC, 1996, p. 85).
Standard E: The inferences made from assessments about student achievement
and opportunity to learn must be sound (NRC, 1996, p. 86).
Personal beliefs and experiences often come into play when we reason from data
to conclusions, even when we strive for objectivity. To encourage others to
consider the soundness of our conclusions, we must identify assumptions and
be explicit about each step in the reasoning process from data to conclusion.
For example, if your students do very poorly on a test you have given, you
might draw several different conclusions: the students are poorly motivated;
the test was too hard; the content was too difficult; you did not give them
ample opportunity to learn. Which, if any, is reasonable, and why? Any conclusion
you draw without support of the underlying assumptions and clear evidential
reasoning is meaningless. Whether discussing individual or class achievement,
teachers should be vigilant about providing support for their inferences and
conclusions to parents, administrators, and the community. In this way teachers
present themselves as well-prepared professionals.
Last quiz, we promise. Since this question has several complex answers, we'll
use a constructed response format.
- What is an appropriate next step for you to take after reading this essay?
Write your response.
Possible answers include some or all of the following (the more the better).
Using the Assessment Standards as guidelines, teachers can explore the details
of their programs to see if adequate time and materials are present for the
level of student achievement that is expected. They can evaluate their assessment
procedures to determine if all levels of student learning (basic content, inquiry
skills, understanding) are included. Assessments to which students are exposed
(including those that are required by the district and state) should be examined
for reliability and validity. To be most effective, all of these steps require
time and collaboration, so share this information with your colleagues and begin
analyzing opportunity to learn and about assessment in your classroom, school,
district, and state. You need to go beyond the classroom as you think about
assessment; even after you close the classroom door, district requirements and
state mandates vis-a-vis assessment will influence what you do. Realizing the
potential assessment has for improving your students' opportunity to learn will
take a while. (You might want to order a pizza; you'll need sustenance, but
hold the anchovies.)
References
American Association for the Advancement of Science (1993). Benchmarks for
Science Literacy. New York: Oxford University Press.
Champagne, A. B., Kouba, V. L., and Hurley, M. (2000). Assessing inquiry. In
J. Minstrell and E.van der Zee (Eds.), Inquiring into Inquiry Learning and
Teaching in Science. Washington, D. C.: American Association for the Advancement
of Science.
National Research Council (1996). National Science Education Standards.
Washington, DC: National Academy Press.
|