世界银行电子档案记忆工程介绍.ppt

上传人:文库蛋蛋多 文档编号:2976774 上传时间:2023-03-07 格式:PPT 页数:24 大小:2.14MB
返回 下载 相关 举报
世界银行电子档案记忆工程介绍.ppt_第1页
第1页 / 共24页
世界银行电子档案记忆工程介绍.ppt_第2页
第2页 / 共24页
世界银行电子档案记忆工程介绍.ppt_第3页
第3页 / 共24页
世界银行电子档案记忆工程介绍.ppt_第4页
第4页 / 共24页
世界银行电子档案记忆工程介绍.ppt_第5页
第5页 / 共24页
点击查看更多>>
资源描述

《世界银行电子档案记忆工程介绍.ppt》由会员分享,可在线阅读,更多相关《世界银行电子档案记忆工程介绍.ppt(24页珍藏版)》请在三一办公上搜索。

1、eArchives:Memory of the World Bank Project数字档案馆:世界银行的记忆Arleen Cannata Seed and Jeanne Kramer-Smyth,施倩,2012年10月15日,Contents,A.Background 背景信息B.Methodology 方法论介绍C.Resource Challenge 资源挑战D.Evaluation of the Scanning Methods 评估三种扫描方法E.Technology Choices and Challenge 技术选择与挑战F.Infrastructure Recommendati

2、ons 基础设施建议G.Benefits of the Project 项目的积极影响Appendix:Words&Expressions 附录:词汇与解释,A.Background,世界银行,世界银行是世界银行集团(WBG)的俗称,成立于1945年。它是一个国际组织,一开始的使命是帮助在第二次世界大战中被破坏的国家的重建。今天它的任务是资助国家克服穷困,向发展中国家提供低息贷款、无息信贷和赠款,它在减轻贫困和提高生活水平的使命中发挥独特的作用。,档案数字化,档案数字化是一种新型档案信息形态,它把各种载体的档案资源转化为数字化的档案信息,以数字化的形式存储,网络化的形式互相连接,利用计算机系统

3、进行管理,形成一个有序结构的档案信息库,及时提供利用,实现资源共享。,世界银行信息获取政策Access to information policy,于2010年7月1日开始实施。新政策公布了世界银行所拥有的不属于“例外信息”之列的所有信息。据此公众可以获得包括有关筹备中和实施中项目、分析和咨询活动以及董事会议程活动的信息在内的更多内容。政策还明确规定了向公众披露信息的程序,并规定,如果信息索取者认为他们索取信息的要求被不当或不合理地拒绝,或者如果一项限制披露某些信息的例外规定违反了公众利益,信息索取者有权提出申诉。,Source:百度百科,世界银行官网,A.Background,Comply

4、with The World Banks Access to Information Policy,1,Make the holdings more accessible to the public,2,Objective,to transform the Archives from a predominately paper-based collection of materials to a modern,pro-active and innovative archive which actively pushes content out to the public in a manner

5、 which engages and informs.,Memory of the World Bank Project,B.Methodology,Do it or not?,How to do?,After,Make some Assumptions,Create a concept note:Basic idea of the eArchive,Rally Supports and Funds from Senior Management,Proof the Concept,Find the most efficient and effective scanning way,Prepar

6、e digitization guideline,Prioritise certain key collections,Start with a pilot,Review the records against the Access to Information Policy,Implement a scanning plan,Prepare a marketing and awareness campaign,B.Methodology,Do it or not?,Make some Assumptions,Create a concept note:Basic idea of the eA

7、rchive,Rally Supports and Funds from Senior Management,Proof the Concept,Protect the original documentPreserve the digital images in the original orderImage quality:faithfully enough,Efforts requiredCostBenefits,Best practices of digitization from other organizations Mock up of our proposed approach

8、 to posting content online,B.Methodology,How to do?,Find the most efficient and effective scanning way,Prepare digitization guideline,Prioritise certain key collections,Start with a pilot,Review the records against the Access to Information Policy,Implement a scanning plan,Using the World Banks copy

9、 centerUsing an in-house contractor in our own officeswith our vendor for the Mine,relevance to the current work of the World Bank,repeated requests by researchers,topics which the Archives has judged to be exemplary of the new open agenda.,Robert McNamaras records,former World Bank PresidentRecords

10、 on Food Security,Preserve both metadata and original orderForm the basis for associating the records with other online offerings,C.Resource Challenge,Resource Challenge,1,2,3,4,We set up the scanner and the settings and kept a box of materials handy.Anyone with a few hours to spare was put to work

11、scanning.,Collaborate with the legal division at the World Bank to define a standard copyright and disclaimer clause,Human Resource,Funding,Initial USD100,000 to undertake a pilot additional funds would be sought to scale up the project,Policy,Processes,a complete process workflow ensure that all th

12、e requisite steps would be covered,C.Resource Challenge,A2I:Access to Information PolicyDeclassified Document:非机密文件TIFF:一种图像文件格式,此图像格式复杂,存储内容多,占用存储空间大QA:quality assessmentOCR:光学字符识别(Optical Character Recognition)Compress:压缩Linearize:一种技术,使得在线浏览某文件时,即使文件没有完全下载,前几页也会出现TRIM System:IBM的一款文件管理软件,Some oth

13、er innovative idea about digitization,D.Evaluation of the Scanning Methods,Handwritten Correspondence,Diagrams and Maps,Onion Skin 描图纸,Mimeograph Pages 油印纸,Black and White Photograph,Doc with Embossed Stamps,D.Evaluation of the Scanning Methods,World Banks Copy Center,Vendor at the Mine,World Bank G

14、roup Archives Staff,D.Evaluation of the Scanning Methods,Quality of the images,Accuracy of scans,Adherence to the requirements,Service and equipment availability,Cost,Workflow and QA processes in place,1,2,3,4,5,6,OCR quality,7,World Banks Copy Center,World Bank Group Archives Staff,Vendor at the Mi

15、ne,World Banks Copy Center,World Bank Group Archives Staff,Vendor at the Mine,1,2,Reject,Due to resource constraints the final decision was to use the internal Archives staff to perform the scanning,E.Technology Choices and Challenge,1,2,3,4,5,根据需求挑选合适的扫描仪及配套软件需求:large flatbed,400 dpi,24-big color,确

16、定提供利用的文件格式PDF压缩至1/3容量,不影响质量Linearized,编号问题,123456001.pdf,TRIM系统自动分配,序列号,文件占用空间大,并且增长迅速备份保存以防丢失,发布文件http:/web.worldbank.org/WBSITE/EXTERNAL/EXTABOUTUS/EXTARCHIVES/0,contentMDK:20033213menuPK:64319235pagePK:36726piPK:437378theSitePK:29506,00.html,F.Infrastructure Recommendations,Lightweight Approach 轻

17、量级,Hard drive,with mirrored backup,with folder ID number in all names,Full Implementation 全面实施型,Manual as needed.Possible use of script to generate skeleton folder structure,Manual upload of PDFs to web CMS.Manual creation of links to PDF from folder lists within finding aid.,Storage of master TIFFs

18、 and PDFs,Extraction of folder metadata,Document repository based on Documentum.See Repository Business Requirements Document,Automated synchronization(自动同步)of folder metadata between TRIM and document repository,Integrated publishing support with push from document repository to web platform.See UI

19、 Business Requirements Document.,Image QA,Web Publication,Web QAA2I ReviewCondition assessment before and after digitizationBasic Metadata Extended Metadata,Spot checking manual QA via review of PDF,Simple workflow implemented within document repository,G.Benefits of the Project,Adherence to Access

20、to Information Policy,Benefits,Communicates the Context and Original Order,Provision of Easy Access to Public Records,Provision of A Best Practice,A Better Informed World Citizenry,Increased Openness,Our dream is to make accessible and public many high value records which the World Bank has amassed

21、over the years in every sector,every network,every country and put this great,deep,and broad collection of development knowledge online for the world to access free of charge.,Appendix:Words&Expressions,Digitization,Linearize,OCR,Compression,Finding Aids,Appendix:Words&Expressions,In a records and a

22、rchives environment,the conversation of analogue materials(such as paper documents)into digital form so that they can be stored and accessed electronically.The process of digitization involves converting characters or images into binary digits(二进制)to create digital files.,Source:International Resear

23、ch on Permanent Authentic Records in Electronic Systems(InterPARES)2,Digitization,Linearize,OCR,Compression,Finding Aids,Appendix:Words&Expressions,A linearized PDF file is aPDFfile that is structured in a way that allows the first page of the PDF file to be displayed in a user Web browser before th

24、e entire file is downloaded from the Web server.Users might become frustrated and impatient if the PDF files that are not linearized,because it could take 30 seconds or even longer for a user Web browser to display a large PDF file that is not linearized.,Digitization,Linearize,OCR,Compression,Findi

25、ng Aids,Appendix:Words&Expressions,OCR(Optical Character Recognition)is the recognition of printed or written text characters by a computer.This involves analysis of the scanned-in image,and then translation of the character image into character codes.OCR is being applied by libraries,business,and g

26、overnment agencies to create text-searchable files for digital collections.,Source:http:/en.wikipedia.org/wiki/Optical_character_recognition,Digitization,Linearize,OCR,Compression,Finding Aids,Appendix:Words&Expressions,The process of condensing digital information to reduce its space requirements f

27、or storage or transmission.Lossless compression:the reduction of data with no loss of information,allowing an exact recovery of the original.Lossy compression:the reduction of data with some loss of information,preventing an exact recovery of the original.,Digitization,Linearize,OCR,Compression,Find

28、ing Aids,Appendix:Words&Expressions,Digitization,Linearize,OCR,Compression,Finding Aids,A finding aid is a tool created by records professionals that provides contextual information about the subjects covered in archival materials.Finding aids usually contain detailed inventories aimed to help resea

29、rchers locate relevant materials efficiently.,Appendix:Words&Expressions,Finding aids often consist of numerous hierarchies(层级).These hierarchies are used to illustrate the relationships of items or files to higher levels of organization.For example,consider the following image:,Digitization,Linearize,OCR,Compression,Finding Aids,Thank you for your time!,

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 建筑/施工/环境 > 项目建议


备案号:宁ICP备20000045号-2

经营许可证:宁B2-20210002

宁公网安备 64010402000987号