A Novel Approach to On-Line Handwriting Recognition Based on
ANovelApproachtoOn-LineHandwritingRecognitionBasedon
BidirectionalLongShort-TermMemoryNetworks
MarcusLiwicki1
1
AlexGraves2
J¨urgenSchmidhuber2,3
HorstBunke1
Inst.ofComputerScienceandAppliedMathematics,
UniversityofBern,Neubr¨uckstr.10,3012Bern,Switzerland2
IDSIA,Galleria2,6928Manno-Lugano,Switzerland
3
TUMunich,Boltzmannstr.3,85748Garching,Munich,GermanyAbstract
Inthispaperweintroduceanewconnectionistapproachtoon-linehandwritingrecognitionandaddressinpartic-ulartheproblemofrecognizinghandwrittenwhiteboardnotes.Theapproachusesabidirectionalrecurrentneu-ralnetworkwiththelongshort-termmemoryarchitecture.Weusearecentlyintroducedobjectivefunction,knownasConnectionistTemporalClassi cation(CTC),thatdirectlytrainsthenetworktolabelunsegmentedsequencedata.Ournewsystemachievesawordrecognitionrateof74.0%,comparedwith65.4%usingapreviouslydevelopedHMM-basedrecognitionsystem.
1.Introduction
Althoughtheproblemofhandwritingrecognitionhasbeenconsideredformorethan30years[1,12,16],therearestillmanyopenissues,especiallyinthetaskofun-constrainedhandwrittensentencerecognition.Handwritingrecognitionistraditionallypidedintoon-lineandoff-linerecognition.Inon-linerecognitionatimeorderedsequenceofcoordinates,representingthemovementofthetipofpen,iscaptured,whileintheoff-linemodeonlytheimageofthetextisavailable.
Inthispaperweconsideranon-linerecognitionprob-lem,namelytherecognitionofnoteswrittenonawhite-board.Thisisarelativelynewtask.Aspeoplestand,ratherthansit,duringwritingandthearmdoesnotrestonata-ble,handwritingrenderedonawhiteboardisdifferentfromhandwritingproducedwithapenonawritingtablet.De-spitesomeadditionaldif culty,thewhiteboardmodalityisimportantinseveralapplications,suchasthedocumenta-tionoflecturesormeetings.Intheparticularapplication
underlyingthispaperweaimatdevelopingahandwritingrecognitionsystemtobeusedinasmartmeetingroomsce-nario[17],inourcasethesmartmeetingroomdevelopedintheIM2project[11].Smartmeetingroomsusuallyhavemultipleacquisitiondevices,suchasmicrophones,cam-eras,electronictablets,andawhiteboard.Inordertoallowforindexingandbrowsing[18],automatictranscriptionoftherecordeddataisneeded.
Inthispaper,weintroduceanovelapproachtoon-linehandwritingrecognition,usingasinglerecurrentneuralnet-work(RNN)totranscribethedata.ThekeyinnovationisarecentlyintroducedRNNobjectivefunctionknownasCon-nectionistTemporalClassi cation(CTC)[5].Whereaspre-viousobjectivefunctionsonlytrainRNNstolabelinpid-ualdatapointswithinasequence,CTCtrainsthenetworktolabeltheentireinputsequenceatonce.Thismeansthenet-workcanbetrainedwithunsegmentedinputdata(animpor-tantrequirementforon-linehandwriting,wherecorrectseg-mentationofinpiduallettersisoftendif culttoachieve),andthe nallabelsequence(inthiscase,thecharacterleveltranscription)isgivendirectlybythenetworkoutput.
InourwriterindependentexperimentsontheIAM-OnDB[9]1,awordrecognitionrateofupto74.0%hasbeenachieved.Theseresultsaresigni cantlyhigherthenthosefrompreviousexperimentswithanHMM-basedsys-tem[10].
Therestofthepaperisorganizedasfollows.Section2givesanoverviewoftheproposedsystem.InSection3themainstepsforpreprocessingthedataandextractingthefeaturesarepresented.Section4introducesthenewclassi cationapproachforhandwritingrecognition.Exper-imentsandresultsarepresentedinSection5,and nallySection6drawssomeconclusionsandgivesanoutlooktofuturework.
1http://www.iam.unibe.ch/ fki/iamondb/
Figure1.Illustrationoftherecording
2.SystemOverview
TheeBeaminterface2isusedforrecordingthehandwrit-ing.Itallowstheusertowriteonawhiteboardwithanor-malpeninaspecialcasing,whichsendsinfraredsignalstoatriangularreceivermountedinoneofthecornersofthewhiteboard.Theacquisitioninterfaceoutputsasequenceof(x,y)-coordinatesrepresentingthelocationofthetipofthepentogetherwithatimestampforeachlocation.Theframerateoftherecordingsvariesfrom30to70framespersecond.AnillustrationisshowninFig.1.
Thesystemdescribedinthispaperconsistsofthreemainmodules:theon-linepreprocessing,wherenoiseintherawdataisreducedandthetextlineisnormalizedwithre-specttoskew,slant,widthandheight;thefeatureextrac-tion,wherethesequenceofpointsistransformedintoase-quenceoffeaturevectors;andtherecognition,whereanASCIItranscriptionofthehandwritingisgenerated.
3.Preprocessing
Beforefeatureextractioncanbeapplied,therecordeddatahastobenormalized.Thisisaveryimportantstepinhandwritingrecognitionsystems,becausethestylesofthewritersdifferwithrespecttoskew,slant,heightandwidthofthecharacters.Ifwedonotapplyanypreprocessingandusetherawfeatures,therecognitionrateissigni cantlylower.Thepreprocessingstepsappliedinthecurrentsystemhavebeenintroducedin[10],butforthepurposeofcomplete-ness,wegiveashortoverviewbelow.
Therecordedon-linedatausuallycontainnoisypointsandgapswithinstrokes,whicharecausedbylossofdata.Hence,weapplysomenoise lteringoperations rst.Thecleanedtextdataisthenautomaticallypidedintolinesus-ingsomesimpleheuristics.Astheskewoftensigni cantly
2eBeam
SystembyLuidia,Inc.-
http://doc.guandang.net
Figure2.Splittingatextlineintosubpartsandskew
correction
Figure3.Baselineandcorpuslineofanex-amplepartofatextline
varieswithinthesameline,wesplitlinesintosubparts.AnexampleofsplittingisshowninFig.2(upperline).
Nextthesubpartsarecorrectedwithrespecttotheirskewusingalinearregression.ThisprocessisillustratedinFig.2withtheresultingtextlineshowninthelowerpart.Forslantnormalization,wecomputethehistogramoverallanglesbetweenthelinesconnectingtwosuccessivepointsofthetrajectoryandthehorizontalline[8].Subsequently,thehis-togramisprocessedtorecovertheskewangle.Aftertheseoperations,weremovedelayedstokes,e.g.thecrossingofa“t”orthedotofan“i”,usingsimpleheuristics.Thenextim-portantstepisthecomputationofthebaselineandthecor-puslinebycomputingtwolinearregressionlinesthroughtheminimaandmaximaofthey-coordinatesofthestrokes.Figure3illustratestheestimatedbaselineandthec …… 此处隐藏:15233字,全部文档内容请下载后查看。喜欢就下载吧 ……
相关推荐:
- [高中教育]电子线路高频非线性部分2.1
- [高中教育]中班美术活动——我的小手
- [高中教育]常用三极管参数大全
- [高中教育]计算机常见故障及解决办法
- [高中教育]风机基础环水平度控制方法探讨
- [高中教育]机械安全工程(专升本)阶段性作业3
- [高中教育]2009年安徽省高考语文考试说明刍议
- [高中教育]unit5 let's eat公开课教案设
- [高中教育]计算机网络原理课后习题答案
- [高中教育]2016-2022年中国新能源市场研究与投资
- [高中教育]2015-2020年中国会议行业市场评估及投
- [高中教育]经销商大会峰会主持人串词开场白
- [高中教育]2014新版北师大数学三年级上册小熊购物
- [高中教育]七年级第一学期体育与健康全套教案
- [高中教育]第三章:国际金融市场
- [高中教育]六年级下册数学单元测试-2.比例 北师大
- [高中教育]2016年上海海事大学法学院624刑法之《
- [高中教育]中国碳化钙产业竞争现状及未来五年投资
- [高中教育]网络时代,我们怎么玩
- [高中教育]圆锥曲线——高中数学基础知识与典型例
- 高集医院世界艾滋病宣传日活动方案
- 苏教版六年级英语上册期末试卷含答案
- 全民枪战生化英雄模式幽灵怎么玩 生化
- 灿烂的宋元文化一导学案
- 第2章货币资金与应收款项
- 北师大版八年级下册数学第三章《分式》
- 浅析高分子材料成型加工技术
- 华南理工大学2013年度共青团先进集体及
- 教师资格科目二小学教案模板(共合集)
- 工程扩建可研报告
- 中华人民共和国海事局2014年度招录公务
- 提高农村小学生作文能力的教学尝试
- 徒手心肺复苏术操作步骤
- 毛概试题库7-15章
- 2014-2015学年度(上)初中班主任工作计
- 企业驾驶员安全生产责任书
- 第07章 不等式测试题-2016年高考文科数
- 医疗器械经营企业工作程序
- 考研英语必背36篇_彩版_精华
- 初中9月13-15假期作业 (1)




