《大学英语教改口语考试.ppt》由会员分享,可在线阅读,更多相关《大学英语教改口语考试.ppt(45页珍藏版)》请在三一办公上搜索。
1、大学英语教改口语考试,中国科学技术大学 李萌涛,大学英语教改口语考试,一、大学英语教改和口语考试二、口语考试数据及分析三、口语试题库四、口语试题库的信度和效度研究五、口语评分标准,一、大学英语教改和口语考试,大学英语改革改革的目标是要全面提高大学生的英语综合实用能力,特别是强化听说能力。利用英语口语考试的反驳效应,促进口语教学。学生基数太大,采用口语考试系统实施大规模的口语测试成为可能。,中国科学技术大学04级网络多媒体英语教改试点班英语口语考试现场,二、口语考试数据及分析2004年10月,中国科学技术大学2004级英语教学改革试点班学生英语口语摸底考试,实考517人。2005年1月,中国科学
2、技术大学2004级全体本科生英语口语期末考试,实考1815人。2005年6月,中国科学技术大学2004级全体本科生英语口语期末考试,实考1798人。2005年12月 1813人2006年6月 1798人2007年1月 3707人(二个年级),1.对05、06级学生进行问卷调查05级90名学生,06级89名学生 各有51.1、77.5的学生选择重视口语机考各有53.3、49.2的学生支持采用机考06级学生有67.2的学生认为,机考能体现其口语水平的60-80,2.对教师的问卷调查数据分析,收回14份老师问卷13位老师支持机考,认为能反映出学生口语水平92.9%的老师认为,用机考成绩来解释学生口语
3、能力的可信赖程度达到7090%对照机考口语成绩与期末笔试成绩,有70-80的相关度,三、口语试题库,四所院校共同编写:中国科技大学复旦大学苏州大学西南交通大学,多种题型、与教材配套、内容丰富的英语口语试题库系统支持文本、图形、语音和视频等多种媒体,具有主题和级别等试题指标。试题由多所高校优秀英语教师和测试专家开发。,口语试题库的定位,学业考试(Performance Test),所测即所学。主题式(topic-based),有近似主题,难度、要求不同。综合听说教材里所涉及的主题。涵盖全新版和修订版所有主题。部分题型模拟四、六级口语考试形式。,口语考试题型,目前有7个题型,Reading alo
4、ud用于Band 1-2。每套题至少要包括以下45个题型。Reading aloudListening and speakingQuestions and answersComment on English sayings and quotationsDescribing picturesTalking about moviesGroup discussion,Group Discussion,Part 4 Discussion 5 minutes,20 points Each group is composed of three students Student A,Student B an
5、d Student C.Topic:UFOs are alien spacecraft from outer space,The UFO phenomenon may have existed for over a half-century and UFO sightings are becoming more and more frequent around the world.Some people think these objects are alien spacecraft,although there is no conclusive evidence yet.There is n
6、o doubt that unidentified flying objects in the sky do exist.But the question is what are they?Are UFOs alien spacecraft from other planets?,You will have about 5 minutes to discuss on this topic.Each person can only talk for one minute at a time.During the discussion,you may voice your opinion,argu
7、e with each other,or ask somebody to clarify his or her points.Your performance will be judged according to your contribution to the discussion.,Hints:Numerous UFO photos in the world can show UFOs are alien spacecraft.Many planets outside the solar system are similar to the Earth.So there is a poss
8、ibility that aliens do exist.Nearly all photographs are blurry and many have been proved to be forgeries.There is no evidence of intelligent aliens living anywhere in our solar system or in outer space.准备时间:20 seconds答题时间:60 seconds,2006.2.完成第一批48套试卷,即每一级配12套试卷,每年淘汰2套,并可升级更新。2007.2.完成第二批48套试卷,即每一级配1
9、2套试卷,每年淘汰2套,并可升级更新。可以自由组建试题可以增减考试内容,四、口语试题库的信度和效度研究,为什么要口语考试?Assign gradesImprove instructionMotivate students to workProvide feedback to students,如何进行口语考试?TestingObjectiveSubjectiveFormativeSummative,Formative Testing,Using measurement tools to conduct evaluation for the purpose of improving studen
10、t performanceStudent receives feedback of resultsTeacher considers results in planning subsequent instructionGrades are not recorded!,Summative Testing,Using measurement tools to conduct evaluation for the purpose of assigning student gradesStudent receives feedback of resultsTeacher considers resul
11、ts in planning subsequent instructionGrades are recorded,口语试题库分级,分为四级,对应教学内容难度控制参照课本难度参照国内外口语考试难度确定难度上限,略高于四、六级口语考试难度依据教学实践、课堂口语活动,合作研究口语试题库和考试系统,与国内大学和机构合作(出版社、讯飞)与国外大学合作Bath University(UK)对比研究,机考vs面试试题库科学分级、信度、效度等研究人机交互研究,智能(半智能)阅卷学生在机考时的心理变化研究,五、口语评分标准,准则参考性口试(criterion-referenced oral test)常模参考性
12、口试(norm-referenced oral test),Criterion-Referenced Test,Measures student performance against predetermined standardsStudent meets or does not meet the standardCompetition is between the student and the skill,knowledge,or abilityGrade is based on accomplishmentEverybody can earn a passing grade if th
13、ey meet the standard,Norm Referenced Test,Make test intentionally difficultAverage score should be about 50%Strong students should tend to score high and weak students should tend to score lowAward As for highest scores,Fs for lowest scores,Cs for average scores,Characteristics of a Test,ValidityRel
14、iabilityObjectivityDiscrimination(applies to norm-referenced test only)ComprehensivenessScore-Ability,Validity,A valid test measures:what it is intended to measurewhat the teacher intended for the students to learnwhat the teacher actually taughtA valid test is FAIR,Questions about Validity,Does the
15、 test actually measure what you intend it to measure?Did you teach the content and skills that are being tested?Does the test require the student to know or do something other than what you intended and/or taught?Does some aspect of the test prevent the student who may know the material from respond
16、ing correctly?,Reliability,A reliable test provides accurate and consistent results Test reliability can be viewed from two perspectives:Student reliabilityScorer reliability,Student Reliability,Test items are readable and clearInstructions are simple and unambiguousResponses test only knowledge of
17、the subject matter and not test wiseness,reading ability,or other unrelated trait,Scorer Reliability,Items can be scored consistentlySame scorer would produce similar results on repeated evaluationsDifferent scorers would produce similar results if working independently,Objectivity,Objectively writt
18、enitems are reliableitems are validObjectively administeredObjectively Scored,Discrimination,Important ONLY for norm-referenced testingTest separates more knowledgeable students from less knowledgeable studentsDiscriminating test is intended to reward best students and punish weakest studentsIdeal f
19、or using normal curve to interpret score,Normal distribution,On most measures of human behavior,graphing individual results will result in a“bell-shaped,”or normal distribution curveMost individual scores will fall toward the middle(mean)Fewer scores will fall toward the upper and lower ends,Bell-sh
20、aped Curve,Lowest Scores,AverageScores,Highest Scores,口语考试评分,测量出某个考生在整个群体中的相对位置相对公平,不受相貌、肢体语言的干扰可重复阅卷等级的确立(15)综合评分(Holistic marking)分析评分(analytic marking),教师培训,参照标准确定样本教师分工分数调整,1.参照标准,评分采用准则参考性和常模参考性相结合的评分方法,从内容、语音和流利程度三个方面分析评价考生的英语口语水平。,评分卡:内容语音流利度平均分:(1,2,3,4,5,可用小数点,总数相加),2.确定样本,从往年学生录音中选取有代表性的口语样本每个分数段取2个样本25分的样本样本录音,3.教师分工,由系统指派班级给阅卷老师人工调整,分散班级专门指派有经验的老师抽查阅卷,4.分数调整(正态分布),Thank you!,