教学文库网 - 权威文档分享云平台
您的当前位置:首页 > 文库大全 > 高中教育 >

A Novel Approach to On-Line Handwriting Recognition Based on

来源:网络收集 时间:2026-07-03
导读: ANovelApproachtoOn-LineHandwritingRecognitionBasedon BidirectionalLongShort-TermMemoryNetworks MarcusLiwicki1 1 AlexGraves2 JurgenSchmidhuber2,3 HorstBunke1 Inst.ofComputerScienceandAppliedMathematics, UniversityofBern,Neubruckstr.10,3012B

ANovelApproachtoOn-LineHandwritingRecognitionBasedon

BidirectionalLongShort-TermMemoryNetworks

MarcusLiwicki1

1

AlexGraves2

J¨urgenSchmidhuber2,3

HorstBunke1

Inst.ofComputerScienceandAppliedMathematics,

UniversityofBern,Neubr¨uckstr.10,3012Bern,Switzerland2

IDSIA,Galleria2,6928Manno-Lugano,Switzerland

3

TUMunich,Boltzmannstr.3,85748Garching,Munich,GermanyAbstract

Inthispaperweintroduceanewconnectionistapproachtoon-linehandwritingrecognitionandaddressinpartic-ulartheproblemofrecognizinghandwrittenwhiteboardnotes.Theapproachusesabidirectionalrecurrentneu-ralnetworkwiththelongshort-termmemoryarchitecture.Weusearecentlyintroducedobjectivefunction,knownasConnectionistTemporalClassi cation(CTC),thatdirectlytrainsthenetworktolabelunsegmentedsequencedata.Ournewsystemachievesawordrecognitionrateof74.0%,comparedwith65.4%usingapreviouslydevelopedHMM-basedrecognitionsystem.

1.Introduction

Althoughtheproblemofhandwritingrecognitionhasbeenconsideredformorethan30years[1,12,16],therearestillmanyopenissues,especiallyinthetaskofun-constrainedhandwrittensentencerecognition.Handwritingrecognitionistraditionallypidedintoon-lineandoff-linerecognition.Inon-linerecognitionatimeorderedsequenceofcoordinates,representingthemovementofthetipofpen,iscaptured,whileintheoff-linemodeonlytheimageofthetextisavailable.

Inthispaperweconsideranon-linerecognitionprob-lem,namelytherecognitionofnoteswrittenonawhite-board.Thisisarelativelynewtask.Aspeoplestand,ratherthansit,duringwritingandthearmdoesnotrestonata-ble,handwritingrenderedonawhiteboardisdifferentfromhandwritingproducedwithapenonawritingtablet.De-spitesomeadditionaldif culty,thewhiteboardmodalityisimportantinseveralapplications,suchasthedocumenta-tionoflecturesormeetings.Intheparticularapplication

underlyingthispaperweaimatdevelopingahandwritingrecognitionsystemtobeusedinasmartmeetingroomsce-nario[17],inourcasethesmartmeetingroomdevelopedintheIM2project[11].Smartmeetingroomsusuallyhavemultipleacquisitiondevices,suchasmicrophones,cam-eras,electronictablets,andawhiteboard.Inordertoallowforindexingandbrowsing[18],automatictranscriptionoftherecordeddataisneeded.

Inthispaper,weintroduceanovelapproachtoon-linehandwritingrecognition,usingasinglerecurrentneuralnet-work(RNN)totranscribethedata.ThekeyinnovationisarecentlyintroducedRNNobjectivefunctionknownasCon-nectionistTemporalClassi cation(CTC)[5].Whereaspre-viousobjectivefunctionsonlytrainRNNstolabelinpid-ualdatapointswithinasequence,CTCtrainsthenetworktolabeltheentireinputsequenceatonce.Thismeansthenet-workcanbetrainedwithunsegmentedinputdata(animpor-tantrequirementforon-linehandwriting,wherecorrectseg-mentationofinpiduallettersisoftendif culttoachieve),andthe nallabelsequence(inthiscase,thecharacterleveltranscription)isgivendirectlybythenetworkoutput.

InourwriterindependentexperimentsontheIAM-OnDB[9]1,awordrecognitionrateofupto74.0%hasbeenachieved.Theseresultsaresigni cantlyhigherthenthosefrompreviousexperimentswithanHMM-basedsys-tem[10].

Therestofthepaperisorganizedasfollows.Section2givesanoverviewoftheproposedsystem.InSection3themainstepsforpreprocessingthedataandextractingthefeaturesarepresented.Section4introducesthenewclassi cationapproachforhandwritingrecognition.Exper-imentsandresultsarepresentedinSection5,and nallySection6drawssomeconclusionsandgivesanoutlooktofuturework.

1http://www.iam.unibe.ch/ fki/iamondb/

Figure1.Illustrationoftherecording

2.SystemOverview

TheeBeaminterface2isusedforrecordingthehandwrit-ing.Itallowstheusertowriteonawhiteboardwithanor-malpeninaspecialcasing,whichsendsinfraredsignalstoatriangularreceivermountedinoneofthecornersofthewhiteboard.Theacquisitioninterfaceoutputsasequenceof(x,y)-coordinatesrepresentingthelocationofthetipofthepentogetherwithatimestampforeachlocation.Theframerateoftherecordingsvariesfrom30to70framespersecond.AnillustrationisshowninFig.1.

Thesystemdescribedinthispaperconsistsofthreemainmodules:theon-linepreprocessing,wherenoiseintherawdataisreducedandthetextlineisnormalizedwithre-specttoskew,slant,widthandheight;thefeatureextrac-tion,wherethesequenceofpointsistransformedintoase-quenceoffeaturevectors;andtherecognition,whereanASCIItranscriptionofthehandwritingisgenerated.

3.Preprocessing

Beforefeatureextractioncanbeapplied,therecordeddatahastobenormalized.Thisisaveryimportantstepinhandwritingrecognitionsystems,becausethestylesofthewritersdifferwithrespecttoskew,slant,heightandwidthofthecharacters.Ifwedonotapplyanypreprocessingandusetherawfeatures,therecognitionrateissigni cantlylower.Thepreprocessingstepsappliedinthecurrentsystemhavebeenintroducedin[10],butforthepurposeofcomplete-ness,wegiveashortoverviewbelow.

Therecordedon-linedatausuallycontainnoisypointsandgapswithinstrokes,whicharecausedbylossofdata.Hence,weapplysomenoise lteringoperations rst.Thecleanedtextdataisthenautomaticallypidedintolinesus-ingsomesimpleheuristics.Astheskewoftensigni cantly

2eBeam

SystembyLuidia,Inc.-

http://doc.guandang.net

Figure2.Splittingatextlineintosubpartsandskew

correction

Figure3.Baselineandcorpuslineofanex-amplepartofatextline

varieswithinthesameline,wesplitlinesintosubparts.AnexampleofsplittingisshowninFig.2(upperline).

Nextthesubpartsarecorrectedwithrespecttotheirskewusingalinearregression.ThisprocessisillustratedinFig.2withtheresultingtextlineshowninthelowerpart.Forslantnormalization,wecomputethehistogramoverallanglesbetweenthelinesconnectingtwosuccessivepointsofthetrajectoryandthehorizontalline[8].Subsequently,thehis-togramisprocessedtorecovertheskewangle.Aftertheseoperations,weremovedelayedstokes,e.g.thecrossingofa“t”orthedotofan“i”,usingsimpleheuristics.Thenextim-portantstepisthecomputationofthebaselineandthecor-puslinebycomputingtwolinearregressionlinesthroughtheminimaandmaximaofthey-coordinatesofthestrokes.Figure3illustratestheestimatedbaselineandthec …… 此处隐藏:15233字,全部文档内容请下载后查看。喜欢就下载吧 ……

A Novel Approach to On-Line Handwriting Recognition Based on.doc 将本文的Word文档下载到电脑,方便复制、编辑、收藏和打印
本文链接:https://www.jiaowen.net/wenku/1732669.html(转载请注明文章来源)
Copyright © 2020-2025 教文网 版权所有
声明 :本网站尊重并保护知识产权,根据《信息网络传播权保护条例》,如果我们转载的作品侵犯了您的权利,请在一个月内通知我们,我们会及时删除。
客服QQ:78024566 邮箱:78024566@qq.com
苏ICP备19068818号-2
Top
× 游客快捷下载通道(下载后可以自由复制和排版)
VIP包月下载
特价:29 元/月 原价:99元
低至 0.3 元/份 每月下载150
全站内容免费自由复制
VIP包月下载
特价:29 元/月 原价:99元
低至 0.3 元/份 每月下载150
全站内容免费自由复制
注:下载文档有可能出现无法下载或内容有问题,请联系客服协助您处理。
× 常见问题(客服时间:周一到周五 9:30-18:00)