Your Source for Learning
Technology, Strategy, and News
    [Forgot Password?]
ARTICLES      
RSS feed RSS feed

The Roles and Design of Tests in Online Instruction

“Regardless of the learner’s path, and regardless whether the test is spread throughout a course or given all at the end, we will provide the most valid and respectful learning environment if our tests are fair and thorough and if they continue adding to the learner’s growth even as they measure their knowledge.”

Recently this thought hit me: “Who cares how employees learn it, so long as they can demonstrate that they do know it – on a test?” This notion may be my learner-control subconscious rebelling after a series of mandated minimum-student-contact-time online courses my team had to develop. In order to ensure that each learner met the minimum prescribed contact time, we felt compelled to force them to see most of the screens as well as listen to the entire audio on each of those screens. (We did try to make it interesting by making the courses scenario-based.)

Editor’s Note: Parts of this article may not format well on smartphones and smaller mobile devices. We recommend viewing on larger screens.

So, on courses that don’t have such mandates, can we go to the other extreme and give learners total control over what content they view, including the option of whether they even view the content we provide at all? Can they just prepare on their own and test out?

Perhaps … it seems that it depends on a few things:

  • Are the tests valid? Do they measure the right knowledge and skills?

  • Are the tests sufficient? Do they cover enough of the material so there’s a high correlation between test success and job performance?

  • Do employees really want to follow self-directed learning or constructivism approaches, or do they really want someone to guide them?

  • What about mastery learning, a practice shown to be among the most successful learning strategies of the past several decades?

There is an abundance of research on the above theories in academic settings, but not so much in workplace settings.

In my own experience, employees like it somewhere in the middle. In preparing this article, I researched a variety of terms while looking for one that described the concept I was after. I believe the concept of “mastery learning” comes closest. With mastery learning, the learners receive the resource materials but otherwise expect to study on their own, progressing to each successive level after passing the prescribed test(s).

Many schools use this practice. I learned of two examples of mastery learning in the workplace. In one, a new employee had to take a comprehensive test after one year. If she passed it, she received a sizable pay raise. If she failed, she lost that job. In another example, the setup was the same but after the employee passed the test the company had no money for the pay raise, so the employee left on his own. I plan to do more research on the use of this practice in the workplace.

I suspect mastery learning holds the most promise for enlightened workplace learning, but no matter where we are on this spectrum, there is a need not just for valid online end-of-course tests, but for tests with slightly different purposes, as well. That’s what we’ll cover in this article.

Tests: macro view

Where do test questions come from? If you immediately said, “the learning objectives,” then you get an A+. Before we look at construction of questions themselves, let’s look at tests from a macro viewpoint.

The role of tests

What role do tests play in online workplace learning?

“…our first concern with assessment is knowing what students know.” (sic) [Stoner and Smith, 2007 – see References at the end of this article]. But I have come to view tests as having much broader application than merely assessing knowledge, especially in Web-based training (WBT) where there’s no instructor to moderate the learner’s learning path, conduct role plays, offer encouragement, or debrief a test. We can use well-designed test questions diagnostically to moderate the learner’s path, reinforce learning, and help the learner gain new insights; and, together with appropriate feedback, they can even encourage and motivate the learner. Online assessments can play an economic role as well, by saving employers time, money, and effort. [Greenstein 2010]

Before we go on, let me set the scope of this article. The goal of workplace learning is job and organizational performance. Usually such performance involves some levels of knowledge and skill that we can’t teach or measure on a computer. We often use online learning (or Web-based training) for lower-level cognitive skills, but we’ll see how you can use it at higher levels of Bloom’s taxonomy, too. Other types of learning such as authentic assessment must and do take place in the workplace, and we may measure these in other ways. But although some of the principles mentioned here apply equally to paper and pencil tests, the thrust of this article is on using online tests, or automated testing as some would call it. [Brooks 1997]. We will also examine how to enhance instruction by what we do after the learner takes a test, and we’ll look at some authoring and learning management system capabilities and limitations.

Tests can measure knowledge at the outset of a course or program (pre-tests), within it (embedded), and at the end (post-tests). Tests can be summative (scored) or formative (used to adjust the learning path or approach). Properly set up and combined with the right tone of feedback or remediation, a test can encourage the learner and build self-confidence. We frequently embed self-check questions in our online courses that look and feel like games, and our employees love them! So much so that we even use them in some instructor-led group settings to add some variety and fun. I believe there is a key difference between tests used in a school setting and those used in the workplace. In school, the student’s goal is a high test-score, yet the instructor often wants to force a bell curve. Some would likely accuse an instructor whose students all get A’s of teaching the test. Conversely, in the workplace the goal is mastery – by everyone. Why? Because we want employees to know how to do their job. They may not all achieve 100%, but we hope to skew the curve as far towards 100% as feasible – learner ability, not intentionally difficult test design, being the variable.

Test types

Now let’s look at the pros and cons of using the various test types. Throughout this discussion I’ll use the word “course” to signify any designated content no matter how structured or unstructured.

Pre-tests

We give pre-tests before the learner embarks on a course. Here are some reasons commonly cited for using pre-tests [Berry, 2008, and Berman, 2001]:

  • To find out what the learner(s) already know about the subject. This may be useful to:

    • Establish a baseline to judge the effectiveness of the training or, at least, the amount of new knowledge the learners gained

    • Divide a group into learning groups (pertains more to instructor-led, but there is a parallel for WBT as we’ll see later)

    • Help the learner herself know which material to zero in on. [Stoner and Smith, 2007]

  • Provide a preview of the terminology and content as well as the depth of coverage.

  • Motivate the learner by virtue of the preview.

  • Set the learner’s expectations, especially if you plan to use similar or identical questions on the post-test.

The above reasons notwithstanding, in my opinion there is a single compelling time to use pre-tests, and that is when you expect a significant amount of prior knowledge among the learners and you will adjust the learning path to accommodate that knowledge.

One big downside of pre-tests is that they are, after all, a test. A significant percentage of learners already suffer test anxiety, so why exacerbate that fear. That can hardly be motivating.

I think there are more personal, congenial ways to motivate or give a preview than through a test. Borrow from fiction and establish suspense at the outset. Here is an example of an opening line you could use in a sexual harassment prevention course:

What if you overheard your neighbor at work making a lewd comment to another co-worker. Should you keep it to yourself, or tell someone? If you tell someone, are you putting yourself in jeopardy of retaliation?

Post-tests

The main reason for using a post-test is simply to measure the learning outcomes. Can the learner demonstrate mastery of the subject matter as defined by the course objectives? The post-test must evolve from solid and thorough task analysis in the early stages of instructional design. In fact, many instructional designers develop and finalize the learning objectives and then go straight to designing the post-test. Only then do they develop the actual instructional material.

I think a key to a successful post-test is what you do with the results. For one thing, the test review itself, even online, should serve as the capstone to that chunk of learning. And you should use item analyses to assess the effectiveness of the instruction and the test.

Embedded tests

If pre-tests are used for tailoring the learning path and post-tests are used solely to measure knowledge, then what is the role of embedded tests? They can do the same things the other two test types can do, and more. Embedded tests can:

  • Perform formative assessment, or diagnostics, throughout the course, tailoring the delivery to the learner’s existing knowledge on a topic-by-topic basis instead all at once up front.

  • Perform summative assessment, but at intervals throughout the course instead of all at the end. An embedded test can provide an alternative to a final exam. While discussing compliance training at a conference workshop a few years ago, most of the participants agreed it was sometimes more advisable to force learners to get each embedded question correct before they could move on than it was to use a scored final exam. The rationale? In effect, every learner has to get 100% in order to get credit for the course.

  • Provide practice for and/or a preview of the final test. Such a preview can reduce anxiety for the final test and can thus build the learner’s confidence, especially with the use of conversational, reassuring feedback.

  • Encourage and motivate. You can use short embedded tests along the way to reassure the learner in building-block fashion that he or she is acquiring and applying the knowledge at an acceptable level.

  • Provide a scenario-based learning experience.

    • In a course on sexual harassment prevention, we used multiple-choice questions in each scenario to simulate a conversation. [Dickinson, 2009] It wasn’t as open ended as a live roleplay, but it engaged the learner more than a straight presentation would have. It also enabled us to address the “gray area.” For example, an employee has options on whom to approach with a harassment complaint – the instigator, their manager, or HR – and there are pros and cons to each option. So we allowed multiple correct answers (thus letting the learner proceed in the course) while the feedback acknowledged the points to consider with each choice and why one option may be the preferred choice in a given situation.


(56)
I appreciate this article

Comments

Login or subscribe to comment

Mike,

Great article! You've looked at this topic in detail, and given some good advice.

A question I'm not sure how best to phrase. Could item analysis encourage test designers not to test the most important things in a course?

Suppose there's a topic that's not amenable to automated assessment, like Web design. Someone has to look at the learner's code, and judge it on multiple dimensions. There are not always clear right or wrong answers.

A test designer who must provide item analysis reports might avoid activities like this. But it's formative feedback on open-ended tasks that might have the most learning value.

What do you think?

Kieran
http://coredogs.com
Kieran,

Interesting question! I would have to research this to give you a consensus of what others do, but my immediate reaction is two-pronged.

1. I could see where an instructional designer might avoid subjective questions if they were indeed motivated and rewarded on the basis of favorable item analysis reports, or if they were expected to produce item analyses on questions for which they are not appropriate, i.e., subjective vs. objective. I hope there aren't people in this situation, because as you say there are often important, subjective skills that need to be assessed, though probably not online which is the focus of this article).

2. This is where I'd have to check the literature to be sure I'm correct, but I believe item analysis reports are generally used only for objective questions, especially ones like multiple choice or variations such as drag and drop.

As you imply, a subjective question, especially one that relies as much on art as science such as elegance of web page design, is not conducive to online grading except perhaps if the scorer has a reliable rubric to use as a scoring guide while viewing the programmer's 'product,' the web page. I would be interested in what others have to say. Here again, I would hope folks don't decide to avoid certain questions for the sake of the item analysis if such questions are essential to a valid assessment and logistically plausible.

Thank you for your question.
I think striking the balance on this is the key. When we approach assessment structure for certified training we use a mixed arsenal pre and embedded tests, both simple choice or quiz types and long answer open types.

Obviously there is a cost to assessing open-ended questions but they are invaluable for 'knowing what students know'.

My experience has been that test design must be tailored not only to the content to be assessed but to the position of the learning in the course.

With tests popping up throughout our learning pathways students need a balance of challenge, encouragement and (importantly) limits.

Interested to know how you have approached structuring your test types in courses.

Thanks for the article,
Dan
http://leapelearning.com
Mike,
Thanks for pulling all this together in one thoughtful article. A few comments/thoughts:
-- According to some brain research gurus, tests are really good vehicles for rehearsal and practice, but not really assessing what has made it into long-term memory. Given this, traditional tests really only measure what's in working memory, most of which disappears within 18-24 hours. Most of our workplace training needs to focus on transfer and application of skills so online tests don't really measure "learning" that can be reliably transferred. How do we reconcile this with instructors/staff wanting to point to "knowledge or skills gained" when they compare the pre-tests and post-tests? Sure, there might have been an increase in knowledge for a brief period of time, but how much of it will be recalled a day later? Sustained knowledge and skill transfer is what we're after.
-- Also, multiple choice questions with multiple answers require more complex recall and discrimination, just like in the real world. They are certainly more challenging to the student and the instructor (to write meaningful feedback) but they can play an important role.

Your thoughts?Thanks!

Related Articles

We have tended to think of competence as a fixed point: either you are competent or you are not. If that’s the whole story, Albert Einstein was just a competent physicist, Florence Nightingale was a competent nurse, and the Beatles were only a competent band. There has to be more, beyond mere competence. There’s a continuum of performance from novice to master. Learn about it here!
Welcome to Nic Laycock, our newest columnist! Nic will be reporting each month on eLearning in EMEA – Europe, the Middle East, and Africa. In his first column, his focus is on the comments of Keri Facer, keynote speaker at PELeCON12 in the UK, about the nature of learning in an uncertain future.
Articulate announces the release of Articulate Storyline, the first eLearning authoring tool that’s simple enough for beginners and powerful enough for experts. The innovative features in Articulate Storyline revolutionize how eLearning authors create interactivity – there’s no programming required. Getting eLearning content to learners everywhere is easier than ever with Articulate Storyline.