英语教学中的测试与评价方法课件.ppt

上传人:小飞机 文档编号:3259585 上传时间:2023-03-12 格式:PPT 页数:83 大小:116KB
返回 下载 相关 举报
英语教学中的测试与评价方法课件.ppt_第1页
第1页 / 共83页
英语教学中的测试与评价方法课件.ppt_第2页
第2页 / 共83页
英语教学中的测试与评价方法课件.ppt_第3页
第3页 / 共83页
英语教学中的测试与评价方法课件.ppt_第4页
第4页 / 共83页
英语教学中的测试与评价方法课件.ppt_第5页
第5页 / 共83页
点击查看更多>>
资源描述

《英语教学中的测试与评价方法课件.ppt》由会员分享,可在线阅读,更多相关《英语教学中的测试与评价方法课件.ppt(83页珍藏版)》请在三一办公上搜索。

1、Testing&Assessment in ELT英语教学中的测试与评价方法,Outline,1.A sketchy history2.Types of language tests 3.Testing techniques 4.Criteria for good language tests 5.Constructing multiple-choice questions 6.Critical discussion,1.History of Langauge Testing and Assessment,A Sketchy History,1.1 Pre-scientific Stage前科

2、学阶段1.2 Psychometric-Structuralist Testing Stage心理测量学-结构主义测试阶段1.3 Integrative-Sociolinguistic Testing Stage 综合-社会语言学测试阶段1.4 Pragmatic and Communicative Testing Stage语用交际测试阶段,1.1 Pre-scientific Stage,Before the 1940sGrammar-translation method 语法翻译法Traditional testing approach,Traditional Testing Appro

3、ach,What:grammatical rules,word formation,word usageHow:written test,no oral testQuestion type:subjective,like translation,composition,written questions,1.2 Psychometric-Structuralist Testing心理测量-结构主义测试,Since the 1950sTheoretical guidance FeaturesAudiolingual method 听说法 Discrete-point testing approa

4、ch(分项测验/分立式测验),理论基础:语言学中的结构主义语言观:语言=语音+词汇+句式+语法,语言能力可以分解为具体的部分来考查。心理学中的行为主义教育论:刺激反应式学习方式,反复练习。心理测量学:依据一定的心理学理论,运用一定的操作程序,把人的知识、能力、性格、态度等心理特性和行为进行量化。,Emphasizing reliability(信度)and validity(效度)Emphasizing objective and accurate assessment,Objective questions dominate,particularly multiple choices.Ana

5、lyzing testing results statistically Enjoying a high reliability 信度高,Discrete-point Testing,A question only assesses one language pointTesting:conducted at different levels of language structure 语言层面Language proficiency:being assessed from the aspects of listening,speaking,reading and writing 技能层面,1

6、.3 Integrative-Sociolinguistic Testing 综合-社会语言学测试,Since the mid-1970s(动态语言观)Integrative skill tests 综合技能测试Assess a learners ability to use many bits at the same time.Question types:cloze/composition/oral interview,etc.,1.4 Pragmatic&Communicative Testing Stage语用交际测试阶段,From the 1980s(功能语言观)Communicat

7、ive approachFrom language usage to language usePragmatic approach,Integrity of language/whole languageAssessing with tasksAccuracy,fluency and appropriatenessCommunicative competence,2.Types of Language Tests,Types of Language Tests,Formative 标准参照性和常模参照性Tests Classified According to Testing Purposes

8、Discrete-point&Integrative Tests 分立式和综合测试High stakes&Low-stakes Tests 高风险和低风险,2.1 Formative&Summative,Formative Assessment:Being carried out throughtout the course;Diagnostic purpose;Assessment for learning;Teacher or learner-initiated,Summative Assessment:Being carried out at the end of a course;Gr

9、ading purpose,assign a course grade;Assessment of learning;Teacher-initiated,2.2 Criterion-referenced&Norm-referenced,Criterion-referenced Assessment:A way of measuring candidates against defined(and objective)criteria;Relatively consistentBeing used to establish a persons competence;Examples:Drivin

10、g tests,IELTS,TEM,etc.,Norm-referenced Assessment:A way of comparing candidates to identify whether the test taker performed better or worse than other test takers;Varying from year to year;Being used for selection;Examples:CEE(gaokao),TOEFL,2.3 Objective&Subjective,Objective Assessment:A single cor

11、rect answer;Objective scoring,no judgment on the part of the scorer.Examples:true/false,multiple choice,matching questions.,Subjective Assessment:More than one way of expressing the correct answer;Subective scoring,calling for judgment on the part of the scorer.Examples:extended-response questions a

12、nd essays.,2.4 Tests Classified According to Testing Purposes,Proficiency Test 水平测试、能力测试Achievement Test 成绩测试、学业测试Placement Test 分级测试、分班测试Aptitude Test 能力倾向测试、学能测试Diagnostic Test 诊断测试,Proficiency Test,Measuring language proficiencyThe content 考试内容:Not based on the content of a language course which

13、people taking the test may have followed.It is based on a specification of what candidates have to be able to do in the language in order to be considered proficient.Examples:SAT,ACT,CEE,IELTS,PSC,PETS,BEC,Achievement Test,Examining how successful a student,a teacher,or a syllabus,or a method is.Bei

14、ng closely linked to the course material used in class.Final achievement tests are those administered at the end of a course of study.Progress achievement tests are intended to measure the progress that students are making.,Placement Test,To identify the appropriate stage of language course accordin

15、g to students ability.To assign students to the appropriate level of classes they should take.,Aptitude Test,Measuring the extent to which an individual possesses specific language learning abilityBeing usually used for selection and diagnosis and for prediction of language learning success.Componen

16、ts of language aptitude:phonetic coding ability(sound discrimination and memory),grammatical sensitivity(recognizing the grammatical function of words),rote learning ability for new sound and meaning inductive learning ability for language patterns,Diagnostic Test,To show students strengths and weak

17、nesses.,2.5 Discrete-point&Integrative,Discrete-point Test:multiple-choice questionsIntegrative Test:Dictation,translation,composition,etc.When he saw his mother,the little boy stopped _.A.crying B.cry C.to cry D.cried One day,the wife of a Chinese king sat watching a worm as it ate some mulberry le

18、aves.Soon it stopped _.Then,as it slowly turned its head from side _ side,a very fine thread came out of its _.It wrapped the thread around and around itself until it was shut _ a little cocoon.,2.6 High-&Low-stakes Tests,A relative conceptHigh-stakes:a test with important consequences for the test

19、taker.Examples:CEE,TEMLow-stakes:End-term Exam,测试种类总结,分类标准 测试类别学习阶段不同 形成性测试,终结性测试评分方式 客观性测试,主观性测试分数解释参照标准不同 标准参照测试,常模参照测试测试目的 水平测试/成绩测试/学能测试/分班测试/诊断测试测试语言技能的分合 分立式测试,综合式测试测试对用户影响的大小 低风险测试,高风险测试,3.Testing Techniques,Testing Techniques,Multiple-choice 多项选择题(单选或复选)Gap-filling 填充题 Matching 配对题Transforma

20、tion 句型转换题Cloze 完形填空题(填充或选择题)True/False 是非题,判断正误题 Error Correction 改错题Dictation 听写Open Questions 开放式问题Short Answer Qs简答题Essay Writing写作Translation翻译,3.1 Multiple-choice Questions,An example:Noise made by a snake is called _.A mew B bark C hiss D quackStem+Choices/Alternatives(the correct choice and

21、distractors),Advantages:Efficiency NeutralityUniversalityResponse clarityDisadvantages:AmbiguityNo partial creditguessingTime-consuming for item consrtuction,3.2 Gap-filling,An example:Eating too much fast food is not _.A hint:a root word(health),the first letter of the word(h_).,Advantages:Testing

22、grammar or vocabularyEssy to gradeRelatively easy to construct.Disadvantages:Ambiguity:more than one possible correct answers.Parents owe their children a set of solid values _ which to build their lives.(around/on),3.3 Matching,Match the word on the left to the word with the opposite meaning.fat ol

23、d young tallactive thinshort quietThis could be individual words,words and definitions,pictures to words etc.,Advantages:Testing vocabularyEasy to construct and gradeDisadvantages:Students may get the right answers without knowing all the words.,3.4 Transformation,This is an interesting book.(转为感叹句)

24、What an interesting book this is!I went to bed after I finished my homework.I didnt go to bed until I finished my homework.It was not until I finished my homework that I went to bed.Not until I finished my homework did I go to bed.A student has to rewrite a sentence based on an instruction or a key

25、word given.,Advantages:Testing grammar and understanding of formFairly easy to gradeDisadvantages:A student may rewrite sentences to a formula.,3.5 Cloze 完形填空,Complete the text by adding a word to each gap.One day,the wife of a Chinese king sat watching a worm as it ate some mulberry leaves.Soon it

26、stopped _.Then,as it slowly turned its head from side _ side,a very fine thread came out of its _.It wrapped the thread around and around itself until it was shut _ a little cocoon.,Advantages:Much more integrative;Effective for testing grammar,vocabulary and intensive reading;A good indicator of ov

27、erall language proficiency.Disadvantages:There may have multiple correct answers.,3.6 True/False,Decide if the statement is true or false.England won the world cup in 1966.T/FThe candidate must decide if a statement is true or false.,Advantages:Test listening&reading comprehensionEasy to grade Disad

28、vantages:Guessing can result in many correct answers.,3.7 Error Correction,Find the mistake in the sentence and correct them.He dont know why Tom refused to speak to him.Errors must be found and corrected in a sentence or passage.It could be an extra word,words missed,mistakes with verb forms,etc.,A

29、dvantages:Useful for testing grammar and vocabulary as well as reading and listening comprehension.Disadvantages:Some errors can be corrected in more than one way.,The Internet is playing a important part in 56 our daily life.On the net,we can learn about 57 news both home and abroad and some other

30、58 informations as well.We can also make phone calls,59 send messages by e-mails,go to net schools,and 60 learn foreign languages by ourselves.Beside,we 61 can enjoy music,watch sports matches,and play the 62 chess or cards.The net even help us do shopping,63 make a chat with others and make friends

31、 with them.64 In a word,the Internet has made our life more easier.65,3.8 Dictation,One of the oldest techniques known for the teaching and testing of foreign languages;Being closely related to grammar translation method;Testing spelling,listening and recognition.,Standard dictation 标准听写Partial dict

32、ation 部分听写Dictation with competing noise 干扰听写 Dictation-composition 听写作文Elicited imitation 复述听写,3.9 Open Questions,Answer the questions.Why did John steal the money?Here the candidate must answer simple questions after reading or listening or as part of an oral interview.,Advantages:Useful for testi

33、ng any of the four skills,but less useful for testing grammar or vocabulary.Disadvantages:More difficult and time consuming to gradeAn element of subjectivity involved in judging how complete the answer is.,3.10 Short Answer Questions,Requiring the learner to write a word,phrase,number or symbol;oft

34、en based on a passage;Sometines with a limit of words in one answer(3-5 words),3.11 Essay Writing,Being widely usedOften being criticized for their lack of objectivity.Requirements for writing:,用词正确语句通顺结构合理内容符合要求文体得当(措辞和行文 正式-非正式)The two new senators have proved themselves exceptionally able(guys/me

35、n).Writing a letter to a close friend or writing a job application letter,Types of Essay Writing,单句写作He doesnt like dogs as much as his wife does.His wife likes dogs better than him.组句成章()They are students.()Mr and Mrs White have two sons.()Now Ben and Jerry are playing football with their father.()

36、Alice is only three.()The boys have a sister,Alice.()Their names are Ben And Jerry.()Alice is sitting on the grass with her mother.,Advantages:InegrativeDisadvantages:Difficult to score reliably and time-consuming to grade Often affected by handwriting,presence or spelling errors,grammar used the su

37、bjective judgments of the grader.Training of graders:time-consuming and needs to be repeated at frequent intervals throughout the grading.,3.12 Translation,Used method of testing in both classroom assessment and formal test.Criteria of good translation vary,4.General Criteria of Language Testing,4.1

38、 Practicality,Factors to consider:Financial limitations;Time constraints;Ease of administration;Scoring.,A test that is prohibitively expensive is impractical.A test that takes a students ten hours to complete is impractical.A test that requires individual one-to-one proctoring is impractical.A test

39、 that takes a few minutes for a student to take and several hours for an examiner to evaluate is impractical.A test that can be scored only by computer is impractical if the test takes place a thousand miles away from the nearest computer.,3.2 Reliability 信度,A consistent measure of performance.可靠性/稳

40、定性Sources of unreliability:the test itself or the scoring of the test,that is,test reliability and rater reliability.Test reliability:the consistency of results if giving the same test to the same subject on two different occasions.Scoring or rater reliability:the consistency of scoring by two or mo

41、re scorers or by the same scorer on different occasions.,3.3 Validity 效度,The degree to which the test actually measures what is intended to measure;Test what is important to test,not what is easy to test;The most complex and important criterion of a good test.,Types of Validity,Content validity 内容效度

42、Construct validity 构念效度Face validity 表面效度Not what a test actually measures,but what it superficially appears to measureCriterion validity 标准效度The extent to which the tests are related to concrete criteria in the real world,The extent to which a test is relevant and representative of what it is used

43、to measure.内容与测试目标是否有关测试内容是否具有代表性测试内容是否适合测试对象,The degree to which a test measures what it claims to be measuring based on a theoretical guidance试题是否以有效的语言观为依据;“结构或构念”指整个考试的理论基础。,How to improve validity of a test:Specification of what is to be measured based on course syllubus;Construction of the tes

44、t items;Review by experienced teachers and experts,4.4 Backwash 反拨作用,Backwash:the effect of testing on teaching and learning.Backwash can be harmful(teaching to the test)or beneficial(diagnostic and promoting improvement).,4.5 Difficulty and Discrimination,Index of difficulty 难度系数Discrimination 区分度(

45、区分考生能力的程度),5.Developing Multiple-Choice Questions,What to measure,To measure knowledge recall as well as higher order thinking.Four types of content(facts,concepts,principles,and procedures)and five types of cognitive behaviors(recalling,understanding,predicting,evaluating,and problem solving).,Fact

46、ual information,True FalseThe capital of Kentucky is Louisville.Multiple ChoiceWhich city is the capital of Kentucky?A.FrankfortB.LexingtonC.LouisvilleD.Paducah,Higher order thinking,What is likely to happen to mortgage interest rates when interest rates on savings go up?A.IncreaseB.DecreaseC.No cha

47、ngeD.Unpredictable,True/False&Multiple Choice Questions,More time-consuming for the teacher to construct good multiple-choice items than true/false or completion items.The difficulty of finding suitable distractors,which are plausible.Plausible:the distractor must have the potential for being select

48、ed as the correct answer.Two distractors are as effective as three if one of the three is not plausible.,Reading level and reading speed of the students must be considered when constructing the items.To insure that one question or its distractors do not provide clues to the answer of another questio

49、n.,Best answer items(measuring understanding,or interpretation)are usually more difficult than correct answer items.Which one of the following was the most important consideration in locating cities during frontier times in America?A.good farmlandB.access to waterways C.moderate temperature.D.easy t

50、o defend against attack by IndiansMay test the ability to compare and evaluate;or may test knowledge or ability to recall.,A.The stem 题干,Be meaningful and provide a definite problemInclude as much of the item as possible.1.The talk show host can _ the president brilliantly.A take on B take after C t

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 生活休闲 > 在线阅读


备案号:宁ICP备20000045号-2

经营许可证:宁B2-20210002

宁公网安备 64010402000987号