Your Source for Learning
Technology, Strategy, and News
    [Forgot Password?]
ARTICLES      
RSS feed RSS feed

Delivering Computerized Assessments Safely and Securely

"Organizations have a wide choice of assessment environments, authoring practices and technologies that make it possible for them to deliver assessments of various types safely, securely and economically. It’s essential to consider the nature and purpose of assessments and use appropriate delivery environments in order to balance security requirements with the need for cost-effectiveness."

Security is a pressing concern for organizations using computer-based assessments, but so is cost-effectiveness. It’s essential to maintain the integrity of item pools, keep test scores secure and minimize the possibility for cheating — but without an exorbitant price tag. The thoughtful use of technology, combined with careful consideration of an assessment’s purpose and consequences, makes it possible to deliver assessments safely, securely and economically.

Over-engineering low-stakes assessments can result in unnecessary costs and wasted time. Under-engineering high-stakes assessments can undermine people’s confidence, organizational processes and the face validity of an assessment. This article provides a methodology for assessing the security requirements of your computer-based testing programs. It will help you implement appropriate security measures based on the purpose and nature of assessments. It will also explore the various means and technology available to enhance the safety and security of a wide range of computerized assessments.

Differentiating between assessments

The first step in determining security levels for assessments is to distinguish between different types of assessment. One important distinction is whether you are using computerized assessments to:

    a. Measure knowledge, skills, attitudes, and personality traits; or,

    b. Promote learning and reduce forgetting.

Another way of classifying assessments is according to their consequences (or stakes). The potential outcomes of an assessment could affect a participant’s propensity to cheat and consequently how much security must be applied. After high-stakes assessments people might be hired, fired, promoted, demoted, graduated, not graduated, and/or authorized or certified to perform a particular job or role. Other assessments, with lower stakes, have far less significant outcomes.

Assessment purposes

The method of delivering a computerized assessment depends largely upon the use and consequences of the results. The participant’s motivation to complete the assessment is important too. This often relates to the nature of the assessment, which I’ll define with the following terms:

Formative: designed to help students turn information into knowledge by asking questions that tend to increase remembering and reduce forgetting.

Summative: designed to measure knowledge, skills and attitudes by asking questions and measuring the responses.

Diagnostic: designed to provide a diagnosis and prescription to help people reach their objectives.

Assessment levels

Assessments are often referred to as exams, tests, quizzes, or surveys depending on whether the assessment is intended to measure or promote learning. For the purposes of this article, I’ll define an assessment as any systematic method of obtaining evidence by posing questions to draw inferences about the knowledge, skills, attitudes and other characteristics of people for a specific purpose.

There are four levels of assessments, mentioned above. Here are the definitions as they are used in this article:

Exam: A high-stakes summative assessment used to measure and document a student’s current knowledge or skills. Its purpose is to measure knowledge. Consequences for passing or failing an exam might extend to being hired, fired, promoted, demoted, released from custody, authorized or certified, graduating or not graduating. People who take exams are highly motivated to complete them successfully.

Test: A medium-stakes, diagnostic assessment to measure a student’s knowledge or skills. A test is designed to inform the student or their tutor of the student’s current level of knowledge or skill. Its purpose is to measure knowledge in order to promote learning. The consequences for passing or failing an individual test are not high — however, consistent failures might extend to being hired, fired, promoted, demoted, graduating or not graduating. People who take tests have a medium-level incentive to complete them successfully.

Quiz: A low-stakes formative assessment used to measure a student’s knowledge or skills and provide feedback. It is designed to provide feedback to inform the student of their current level of knowledge or skill. Its purpose is to promote learning. There are few if any consequences for passing or failing a quiz. Quizzes are normally formative assessments used to promote learning and reduce forgetting rather than forming a summative judgment. The motivation to complete a quiz is medium.

Survey: A low-stakes assessment to measure the opinions, attitudes and sometimes knowledge and skills of a group. Its purpose is to guide thinking about how to improve learning activities and job aids for individuals, the group, or future groups. There should never be direct consequences for a survey respondent as you are looking to understand true opinions and attitudes. However, as results are tabulated for the group, training courses, job aids, and other interventions may be planned. Sometimes the opinion might have consequences for others: an instructor in the case of a course evaluation or managers in the case of an employee attitude survey. Despite the low stakes of surveys, the sensitive nature of the information they collect can call for considerable security. (See Case Study #1.) It goes almost without saying that the motivation to complete a survey is generally low. Indeed, the challenge is to get people to respond!

 

SIDEBAR 1: The Case Studies

Case Study #1 — Student Services Survey

Scenario. A community college in the southwest United States wanted to improve student retention by providing relevant and useful student services on campus. To get information on the most needed services, the college conducted a survey of currently enrolled students to determine their needs, issues, and concerns. The college wanted to make the survey easily available to currently registered students, but because many of the survey items asked about sensitive information (e.g., needs for financial aid or medical facilities), the results of the survey needed to be confidential.

Delivery requirements. The requirements for the delivery of the student survey are listed below:

  1. The survey must be easily accessible to all currently registered students.
  2. The survey must allow only registered students to access and enter survey responses.
  3. The survey results must be anonymous. In other words, it must not be possible to trace survey responses back to individual students.

Delivery solutions. The first requirement was met by using a Web-based authoring and delivery tool to allow the survey questions to be created and published on the college Website. College administrators advertised the survey through email and signs posted on campus. From any computer workstation with an Internet connection and Web browser, either on- or off-campus, students could access the student services section of the college Website and select the link to the survey.

The second requirement was met by loading student registration records into the survey database prior to administering the survey. Students were told to use their student identification number as the username for logging into the survey. By entering a student ID, the student is validated in the system and allowed to respond to survey questions. This validation process prevents any non college students from accessing the survey.

The third requirement was met by choosing an option within the survey delivery tool to make the user responses anonymous. This feature stores question responses but does not store the responses with any student identifying information such as the student identification number. This allows all student responses to survey questions to remain anonymous but useful in an aggregated format.

Summary. In short, new technologies in survey authoring and delivery tools allow colleges to easily develop Web-based surveys and configure those surveys to validate users responding to the survey but allow user responses to remain anonymous. This adds an important level of flexibility for colleges needing to collect large samples of anonymous data.

Case Study #2 — Surgical Technician Quizzes and Tests

Scenario. In today’s job market, surgical technicians are in large demand, and community colleges are struggling to offer surgical technician programs to a large base of students. Kirkwood Community College addressed the demand by creating a Surgical Technician Consortium that included four other colleges in Iowa to offer surgical technician courses via a distance delivery program.

Hawkeye Community College, Indian Hills Community College, Northeastern Community College, and Southeastern Community College all had students expressing an interest in pursuing surgical technician careers. These colleges, however, had no surgical technician faculty or budget for hiring faculty, so they contacted Kirkwood to determine if Kirkwood’s surgical technical program could be offered remotely to their campuses. In response, Kirkwood faculty developed courses that could be conducted over a statewide fiber-optic network where students and teachers interacted through real-time video conferencing. Quizzes and tests were Web-based assessments authored by faculty at Kirkwood and delivered to students at each participating campus.

While Kirkwood had a great deal of experience offering distance- delivered courses, the college had less experience with offering assessments over the Web to remote locations. The first time a course was offered, a number of security issues arose related to the standardized administration of assessments across the multiple remote campuses. The Kirkwood surgical technician faculty worked through the issues and implemented both software and procedural solutions to improve the delivery of the course quizzes and exams.

Security Requirements. Quizzes and tests offered through the distance-delivered surgical technician courses needed to meet a number of security requirements, as described below:

1) All quizzes and tests needed to be authored and controlled by Kirkwood faculty to ensure identical quiz and test content across all remote locations. 2) Proctor guidelines needed to be in place to ensure the standardized delivery of quizzes and tests, preventing differences in test delivery from affecting student performance. 3) Quiz and test questions should be locked-down or protected so that students could not easily share the content of questions with their peers or search for answers to test questions using unapproved resources.

Security Solutions. Faculty easily met the first requirement by building the assessments using standardized test authoring and delivery tools. Only surgical technician faculty at the main campus with access to assessment authoring tools could create, update, and publish the quizzes and tests. All students received the same quizzes and tests throughout the surgical technician courses.

The second requirement posed a number of issues. Test center facilities and proctor procedures were not the same across the remote campuses. For example, in some locations, students launched their own quizzes or tests from a link on computer desktops without proctor supervision, whereas in other locations proctors launched quizzes or tests for the students. In another situation, some proctors regularly checked student picture IDs, where other campuses did not. Kirkwood faculty, after learning about some of the differences in administration modified the process for launching quizzes and tests. First, the faculty enabled a proctor password feature in the test delivery software that requires a test proctor to enter a username and password before a quiz or test can be launched for a student. This prevented students from launching their own tests without proctor supervision. Second, the faculty developed written procedures for test proctors at remote locations so the protocol for proctoring tests was very clear.

The third requirement, preventing students from sharing question content, was met in three different ways. First, Kirkwood surgical technician faculty required the quizzes and tests to be administered under a secure browser offered with the test delivery software. The secure browser prevented students from accessing a standard browser tool bar or leaving the browser window while taking a test. While working in the secure browser, students couldn’t surf the Internet, launch other applications like word processors or email programs, or print the question content. Second, the surgical technician faculty authored the quizzes and tests to administer questions randomly. If two students sat down at adjoining workstations to take the same quiz, the two students would not receive their questions in the same order, greatly limiting their ability to share answers. Finally, by requiring that the tests be administered in a proctored setting, students were monitored while they tested, which limited their opportunities to share content.

Summary. The control of quiz and test content, the standardized administration of quizzes and tests, and limiting students’ ability to share question content were the security concerns for the surgical technician courses. These concerns were met with a combination of software settings and proctor guidelines.

Case Study #3 — Are You Sure You Need a Proctor?

A U.S. software company believes it is more important to ensure that certification is available to everybody who wants to get it than to insist that all certification exams be proctored. Setting proctored assessments in specific locations at specific times inevitably limits the opportunities for sitting the assessments.

Instead, after successfully completing the pre-requisite software training courses, candidates can register using a credit card for an online exam that they need to pass in order become a certified professional in the use of this company’s products. The candidates then take the appropriate assessments at home or wherever they wish.

The company argues that, should anybody dispute the validity of a certified professional, then they can arrange to serve as proctor and request that the person retake the test. The company has found it much easier to run its certification program without proctors for every test. The safeguard of allowing people to challenge the validity of a certification and proctor the re-test ensures that the system is not abused.

 

This article does not attempt to describe all of the consequences, legal liabilities, validity, reliability and planning issues that are involved with writing an assessment that is suitable for its purpose. Many other publications detail the processes and procedures used to produce good and fair assessments. However, as you can see from Figure 1 all of these issues are related to the stakes of an assessment.

 

2 comparative vertical bar charts describing the stakes of assessment

FIGURE 1 Low, medium and high stakes

 

Delivery issues

Good decisions about assessment environments begin with considering the distinct set of delivery issues that pertain to each type of assessment. The higher the value of the assessment, the more you will want to control the delivery environment and the more supervision you will require. Figure 2 details some of the issues that you must consider when deploying computerized assessments:

 

screenshot of spreadsheet noting learning delivery issues

FIGURE 2 Delivery issues

 

Delivery options

Now that you’ve considered the nature, purpose and consequences of your assessments, you need to consider your options regarding assessment environments. High-stakes assessments must offer a consistent environment that is free from distractions together with some form of monitoring by an invigilator or proctor. Low-stakes assessments do not require such rigor. The greater the rigor required, the more expensive an assessment will be to deliver. The sponsoring organization that owns an assessment stipulates the physical environment, technology, monitoring standards and authentication methods for delivering it. (See Figure 3.)

 

chart of comparing different elearning delivery methods

FIGURE 3 Delivery method

 

Where should assessments be delivered?

You need to consider a wide range of delivery methods when planning assessments. An assessment center offer controlled environments where groups of users can take assessments under conditions laid out by the organization that is running it. This means that candidates can be sure of fair conditions and equal treatment. Assessment centers can be used for a wide variety of assessments, depending on the way they are set up and administered.

Physical Environment. Assessment centers should be enclosed rooms without through-traffic. They should be quiet and free from distractions, well lit and with a comfortable temperature throughout the year. Computers should be separated from one another by acoustic partitions. The equipment should offer similar operation, accessibility options and performance. Candidates should not be able to print or capture the assessment content; they should be prevented from accessing resources, such as Web pages, cell phones, digital phones and digital cameras that might unfairly help them during the assessment process.

Technology environment. Assessments should be delivered on consistent and comparable technology. Screen sizes, processor speeds, and network connections do not have to be the latest and greatest, but they do need to be comparable.

Monitoring. You can monitor (or proctor/ invigilate) assessment centers in a number of ways. Candidates should be in an environment that permits constant visual monitoring. This can be achieved directly by a person in the room or via video surveillance equipment.

Authentication. It is important to confirm that the person taking the test or exam is truly the person authorized to do so. This can be done by confirming a candidate’s identity via a photographic I.D. provided by the government, academic institution, or, in some cases, by their employer or simply by a signature.

I use the following grading system to indicate the suitability of different delivery methods for particular assessment types:

  1. the method is very suitable;
  2. the method is suitable;
  3. suitability will vary according to circumstance
  4. the method is unsuitable;
  5. the method is definitely not suitable.

Which type of center suits which type of assessment?

Computerized assessment centers come in various guises. Some of the ways in which they can be defined are as follows:

Professionally controlled centers. Professionally controlled centers provide a very consistent and highly controlled environment where a candidate could expect to receive exactly the same treatment and experience from one assessment center to another. This consistency would start with their reception and continue through the assessment process to the time they leave the premises. These centers are used for very high-stakes tests such as nursing and medical exams. They are expensive to use and maintain. Professionally controlled centers provide the consistency and rigor required for very high-stakes, government regulated certification exams. Proctors at these centers will ensure that candidates do not enter the testing room with banned devices such as cell phones, photo phones and digital cameras.

These centers are rarely used to deliver non-regulatory exams due to the high costs associated with these centers. Here’s how they score for the various types of assessments we have discussed:

Very high-stakes exams A+
High-stakes exams A
Quiz F
Survey F
Test D

Franchised centers. Franchised centers are similar to professionally controlled centers, but they cannot enforce the rigor required for very high-stakes assessments. However, these centers are less expensive and provide a great place to offer high-stakes tests in a reasonably consistent environment. These centers are far more prolific than professionally controlled centers. They deliver government regulated exams as well as non-government regulated exams. There are approximately 5,000 franchised centers around the world. How do they rate for different assessment types?

Very high-stakes exams B
High-stakes exams A
AQuiz F
Survey F
Test C

Higher Education centers. Many higher education institutions offer their testing facilities to local companies and are sometimes part of a franchised assessment network. These centers are similar to professionally controlled centers, and they enforce the rigor required by their institution. These centers are less expensive and provide a great place to offer high-stakes tests in a reasonably consistent environment. These centers are ideal, not only for the institution’s use but also for local companies that need to administer high-stakes assessments in conjunction with a local institution. Here’s a report card for them:

Very high-stakes exams B
High-stakes exams A
AQuiz D
Survey F
Test C

Corporate centers. Increasingly, large organizations are establishing their own assessment centers to provide a consistent, confidential, and cost effective method to deliver high-stakes assessments. These centers might be testing potential employees, partners or existing employees on safety or government regulation related issues. These centers are equipped in a similar fashion to professionally controlled or franchised centers. Their grades for different types of assessments look like this:

Very high-stakes exams A
High-stakes exams A
AQuiz C
Survey C
Test B

Non-dedicated assessment centers

Most assessments, computerized or otherwise, are taken within non-dedicated assessment centers. These facilities are used for other purposes but can be used effectively to administer quizzes, tests, exams and surveys. The rapidly improving features, quality, ingenuity and authentication capabilities of assessment software make it possible to run many higher-stakes assessments in these centers. Non-dedicated assessment centers can be one of the following:

Training rooms doubling as assessment centers. Training rooms may not offer as tightly controlled an environment as a dedicated assessment center. For example, a training room is unlikely to offer screening between computers. But you can overcome such limitations with a little thought and ingenuity. Professional assessment software now offers technology that can turn training rooms into assessment centers. Professional software packages can prevent printing, capturing assessment content, exiting inappropriately and task switching. They can also enable you to randomize the distracters in multiple choice and response questions or even create an assessment from randomly-drawn questions, thereby making it difficult to cheat. Note the particular suitability of training rooms for surveys and tests:

Very high-stakes exams B
High-stakes exams A
AQuiz B
Survey A
Test A

Closely supervised in the work place. Of all the issues that affect the delivery of computerized assessments, monitoring (invigilating or proctoring) remains critical for delivering high-stakes certification exams. Local supervisors or managers can monitor widely dispersed candidates. This makes supervised assessments in the work place a practical, valid and cost effective way to provide high-stakes assessments. Beyond that, the advances in assessment software mean that employees can take even very valuable assessments at their own desks with limited supervision. Assessments can open at a specified time, making sure that the candidate cannot get a “sneak preview.” Although you wouldn’t want to offer very high-stakes exams in a supervised work place, you can certainly consider this for other purposes:

Very high-stakes exams C+
High-stakes exams B+
AQuiz C
Survey C
Test A

Remotely supervised in the work place. It is now possible to monitor candidates remotely using video cameras. Although not suitable for very high stakes assessments, this can provide a valid way to administer exams remotely.

Very high-stakes exams C
High-stakes exams C+
AQuiz C
Survey C
Test B

Unsupervised at the work place. Low and medium-stakes unsupervised assessments at a participant’s desk or other work venue are becoming increasingly important. Organizations are striving to discover how effective their training is, to identify the people who need training and to provide new ways to promote learning. This form of assessment doesn’t require the same monitoring, but it does require assessment software that prevents people from printing or capturing confidential information.

Very high-stakes exams F
High-stakes exams D
A Quiz A
Survey A
Test A

Supervised at home. Some distance learning institutions and small-scale certification authorities are now administering higher-stakes assessments at home. However, the candidate must provide the organization with a professional in the field of study (e.g. a doctor, lawyer or accountant) to supervise the assessment. Once again, the success of this kind of delivery lies in the assessment software and how monitors are selected.

Very high-stakes exams C
High-stakes exams A
A Quiz C
Survey C
Test A

Unsupervised at home. Most commonly used for delivering low-stakes self assessments aimed at promoting learning and reducing forgetting.

Very high-stakes exams F
High-stakes exams F
A Quiz A
Survey A
Test B

Unsupervised in a public place (library).

Most commonly used for delivering low stakes self-assessments aimed at promoting learning and reducing forgetting.

Very high-stakes exams F
High-stakes exams F
A Quiz A
Survey A
Test B

Whatever choice you make about assessment delivery, using technology judiciously can improve the safety and security of assessment content and answers, at low cost.

Creating the right assessment environment

Delivering assessments securely is becoming easier as the quality of networks and assessment software improve. Secure assessment players and various monitoring

Solutions are making it possible to deliver many types of assessments safely and securely, either remotely at the desk or in a training room. When close monitoring and a consistent environment are essential for geographically dispersed candidates, there are many excellent assessment centers to choose from. But many organizations are now being empowered by assessment software to create and run high-stakes assessments within their own environments. As the quality of software, monitoring and network solutions grow, this trend will certainly continue.

A combination of software settings and proctor guidelines helped a consortium of communities address the security concerns presented by a distance learning program for surgical technicians. (See Case Study #2 in the sidebar.) And even with certifications, creative thinking and planning can do away with the need for proctors. (See Case Study #3.)

Creating the physical environment

The ideal assessment environment would include the following characteristics:

  • Comfortable, clean and smoke-free
  • Adequate lighting, ventilation, comfortable seating and work surfaces for the candidate during the assessment session
  • Free from external distractions
  • No opportunity to use equipment such as printers, fax machines, copiers or telephones while taking an assessment
  • Provide a viewing window and/or video surveillance system
  • If a monitor is present they should have an unobstructed view of each candidate within the assessment room.
  • IT equipment should be consistent and robust and comply with the local accessibility rules and regulations.

Deploying software to deliver secure assessments

Although many issues can be addressed within a physical environment, assessment software can help ensure that assessments are delivered in a secure way. These include:

Randomization and shuffling

There are many safeguards that can be taken during the assessment authoring process to help protect the security of the assessment. One of the most popular is to shuffle the order of the choices. You can also deliver the questions themselves in a random order. Both of these features help prevent cribbing when users are sitting in non-screened assessment centers.

Encrypted communications

Most computerized assessments are now delivered via the Internet or an intranet. It is important to encrypt communications between the browser and server. This means that if someone ‘sniffs’ the network they would not be able to see the information sent from server to browser and vice versa. This can be done with Secure Sockets Layer (SSL), a protocol that allows the browser and Web server to encrypt their communication. This means that anyone intercepting the communication will not be able to read it.

Scheduling assessments

Some software enables users to set start times of assessments, the length of assessments, and the number of times that assessments may be taken. You can also enable user names and password protect your assessments.

Monitoring assessments

A “monitor” option ensures that a participant cannot start an assessment until a proctor or invigilator has logged on after the candidate. The monitor can also be limited to a range of IP addresses to ensure that a certain physical location is used to administer the assessment.

Secure browsers

You can now ‘lock down’ computers that are being used in assessments to keep users from accessing other applications and Websites while engaged within a medium- or high-stakes assessment. This lockdown prevents candidates from printing, capturing screens, accidentally exiting the assessment viewing source, task switching, etc. The server must effectively authenticate the secure browser to prevent spoofing by technically competent cheats.

Conclusion

Sidebar 2 summarizes software features and their benefits as applied to each level of assessment.

 

SIDEBAR 2: Match software features to assessment levels

The following summarizes software features and their benefits as applied to each level of assessment:

Exams

  1. Secure browser
    • Keeps assessment content secure
    • Prevents printing, capturing content, task switching, etc.
  2. Secure Sockets
    • Prevent network pirates from intercepting content
  3. Monitors
    • Authenticate the candidate
    • Monitor the candidate
  4. Monitor from a specific range of IP addresses
    • Ensures the correct room/computer is used
  5. Assessment only available during a limited time window
    • Reduces opportunities for people to relay information about the test
  6. Shuffle choices
    • Prevents candidates from seeing which choice another candidate selected
  7. Randomized questions
    • Candidates get different questions, which reduces the possibility that they will see the answers from another candidate
  8. Jumps can stop tests for poor candidates
    • Prevents exposing content to candidates who obviously won’t pass
Tests
  1. Secure browser
    • Keeps assessment content secure
    • Prevents printing, capturing content, task switching, etc.
  2. Secure Sockets
    • Prevent network pirates from intercepting content
  3. Monitors
    • Authenticate candidate
    • Monitor the candidate
  4. Monitor from an specific range of IP addresses
    • Ensures the correct room/computer is used
  5. Assessment only available during a limited time window
    • Reduces opportunities for people to relay information about the test
  6. Shuffle choices
    • Prevents candidates from seeing which choice another candidate selected
  7. Randomized questions
    • Candidates get different questions, which reduces the possibility that they will see the answers form another candidate
  8. Jumps can direct students to helpful content
    • Prevents exposure of content that they aren’t ready for
Quizzes
  • Secure browser
    • Keeps assessment content secure
    • Prevents printing, capturing content, task switching, etc.
  • Secure Sockets
    • Prevent network pirates from intercepting content
  • Assessment only available during a limited time window
    • Reduces opportunities for people to relay information about the test and short-circuit the cognitive process
  • Randomized questions
    • Candidates get different questions to practice with
    • Provide repeated search and retrieval practice
Surveys
  • Respondents’ details (name, IP address) can be erased to:
    • Maintain anonymity
    • Eliminate ballot rigging

 

Organizations have a wide choice of assessment environments, authoring practices and technologies that make it possible for them to deliver assessments of various types safely, securely and economically. It’s essential to consider the nature and purpose of assessments and use appropriate delivery environments in order to balance security requirements with the need for cost-effectiveness.


(1)
I appreciate this article

Comments

Login or subscribe to comment

Be the first to comment.

Related Articles

Adobe’s Portable Document Format (PDF) is widely used because of its superior display features, cross-platform compatibility, and the protection it offers to intellectual property. Yet documents in this container may be completely inaccessible to a substantial portion of the potential e-Learning audience, simply because the documents were not properly prepared. Here's what to do to avoid this.
Do you enjoy a challenge? The author had to determine if specific harvest skills had been transferred from a workshop to lead farmers and then on to individual farmers in remote villages in Rwanda. The key to his success was the “clicker”: an audience response system. Not working in Rwanda? His tips will help you use clickers to lead focus groups in the corporate world, too.
As we find new and better ways to create, store, manage, and, especially, share knowledge, what else changes? Perhaps it is that our key role moves from production to decision-making and advice on content and learning quality. Marc reflects on keeping our eyes on what really matters.