视频监控外文翻译.doc

资源描述

视频监控外文翻译.doc

《视频监控外文翻译.doc》由会员分享，可在线阅读，更多相关《视频监控外文翻译.doc（7页珍藏版）》请在冰点文库上搜索。

视频监控外文翻译.doc

京江学院

JINGJIANGCOLLEGEOFJIANGSUUNIVERSITY

外文文献翻译

学生学号：

3081155033

学生姓名：

缪成鹏

专业班级：

J电子信息工程0802

指导教师姓名：

李正明

指导教师职称：

教授

2012年6月

ASystemforRemoteVideoSurveillanceandMonitoring

ThethrustofCMUresearchundertheDARPAVideoSurveillanceandMonitoring（VSAM）projectiscooperativemulti-sensorsurveillancetosupportbattlefieldawareness.UnderourVSAMIntegratedFeasibilityDemonstration（IFD）contract,wehavedevelopedautomatedvideounderstandingtechnologythatenablesasinglehumanoperatortomonitoractivitiesoveracomplexareausingadistributednetworkofactivevideosensors.Thegoalistoautomaticallycollectanddisseminatereal-timeinformationfromthebattlefieldtoimprovethesituationalawarenessofcommandersandstaff.Othermilitaryandfederallawenforcementapplicationsincludeprovidingperimetersecurityfortroops,monitoringpeacetreatiesorrefugeemovementsfromunmannedairvehicles,providingsecurityforembassiesorairports,andstakingoutsuspecteddrugorterroristhide-outsbycollectingtime-stampedpicturesofeveryoneenteringandexitingthebuilding.

Automatedvideosurveillanceisanimportantresearchareainthecommercialsectoraswell.Technologyhasreachedastagewheremountingcamerastocapturevideoimageryischeap,butfindingavailablehumanresourcestositandwatchthatimageryisexpensive.Surveillancecamerasarealreadyprevalentincommercialestablishments,withcameraoutputbeingrecordedtotapesthatareeitherrewrittenperiodicallyorstoredinvideoarchives.Afteracrimeoccurs–astoreisrobbedoracarisstolen–investigatorscangobackafterthefacttoseewhathappened,butofcoursebythenitistoolate.Whatisneedediscontinuous24-hourmonitoringandanalysisofvideosurveillancedatatoalertsecurityofficerstoaburglaryinprogress,ortoasuspiciousindividualloiteringintheparkinglot,whileoptionsarestillopenforavoidingthecrime.

Keepingtrackofpeople,vehicles,andtheirinteractionsinanurbanorbattlefieldenvironmentisadifficulttask.TheroleofVSAMvideounderstandingtechnologyinachievingthisgoalistoautomatically“parse”peopleandvehiclesfromrawvideo,determinetheirgeolocations,andinsertthemintodynamicscenevisualization.Wehavedevelopedrobustroutinesfordetectingandtrackingmovingobjects.Detectedobjectsareclassifiedintosemanticcategoriessuchashuman,humangroup,car,andtruckusingshapeandcoloranalysis,andtheselabelsareusedtoimprovetrackingusingtemporalconsistencyconstraints.Furtherclassificationofhumanactivity,suchaswalkingandrunning,hasalsobeenachieved.Geolocationsoflabeledentitiesaredeterminedfromtheirimagecoordinatesusingeitherwide-baselinestereofromtwoormoreoverlappingcameraviews,orintersectionofviewingrayswithaterrainmodelfrommonocularviews.Thesecomputedlocationsfeedintoahigherleveltrackingmodulethattasksmultiplesensorswithvariablepan,tiltandzoomtocooperativelyandcontinuouslytrackanobjectthroughthescene.Allresultingobjecthypothesesfromallsensorsaretransmittedassymbolicdatapacketsbacktoacentraloperatorcontrolunit,wheretheyaredisplayedonagraphicaluserinterfacetogiveabroadoverviewofsceneactivities.Thesetechnologieshavebeendemonstratedthroughaseriesofyearlydemos,usingatestbedsystemdevelopedontheurbancampusofCMU.

Detectionofmovingobjectsinvideostreamsisknowntobeasignificant,anddifficult,researchproblem.Asidefromtheintrinsicusefulnessofbeingabletosegmentvideostreamsintomovingandbackgroundcomponents,detectingmovingblobsprovidesafocusofattentionforrecognition,classification,andactivityanalysis,makingtheselaterprocessesmoreefficientsinceonly“moving”pixelsneedbeconsidered.

Therearethreeconventionalapproachestomovingobjectdetection:

temporaldifferencing;backgroundsubtraction;andopticalflow.Temporaldifferencingisveryadaptivetodynamicenvironments,butgenerallydoesapoorjobofextractingallrelevantfeaturepixels.Backgroundsubtractionprovidesthemostcompletefeaturedata,butisextremelysensitivetodynamicscenechangesduetolightingandextraneousevents.Opticalflowcanbeusedtodetectindependentlymovingobjectsinthepresenceofcameramotion;however,mostopticalflowcomputationmethodsarecomputationallycomplex,andcannotbeappliedtofull-framevideostreamsinreal-timewithoutspecializedhardware.

UndertheVSAMprogram,CMUhasdevelopedandimplementedthreemethodsformovingobjectdetectionontheVSAMtestbed.Thefirstisacombinationofadaptivebackgroundsubtractionandthree-framedifferencing.Thishybridalgorithmisveryfast,andsurprisinglyeffective–indeed,itistheprimaryalgorithmusedbythemajorityoftheSPUsintheVSAMsystem.Inaddition,twonewprototypealgorithmshavebeendevelopedtoaddressshortcomingsofthisstandardapproach.First,amechanismformaintainingtemporalobjectlayersisdevelopedtoallowgreaterdisambiguationofmovingobjectsthatstopforawhile,areoccludedbyotherobjects,andthatthenresumemotion.Onelimitationthataffectsboththismethodandthestandardalgorithmisthattheyonlyworkforstaticcameras,orina”stepandstare”modeforpan-tiltcameras.Toovercomethislimitation,asecondextensionhasbeendevelopedtoallowbackgroundsubtractionfromacontinuouslypanningandtiltingcamera.Throughcleveraccumulationofimageevidence,thisalgorithmcanbeimplementedinreal-timeonaconventionalPCplatform.Afourthapproachtomovingobjectdetectionfromamovingairborneplatformhasalsobeendeveloped,underasubcontracttotheSarnoffCorporation.Thisapproachisbasedonimagestabilizationusingspecialvideoprocessinghardware.

ThecurrentVSAMIFDtestbedsystemandsuiteofvideounderstandingtechnologiesaretheendresultofathree-year,evolutionaryprocess.Impetusforthisevolutionwasprovidedbyaseriesofyearlydemonstrations.Thefollowingtablesprovideasuccinctsynopsisoftheprogressmadeduringthelastthreeyearsintheareasofvideounderstandingtechnology,VSAMtestbedarchitecture,sensorcontrolalgorithms,anddegreeofuserinteraction.Althoughtheprogramisovernow,theVSAMIFDtestbedcontinuestoprovideavaluableresourceforthedevelopmentandtestingofnewvideounderstandingcapabilities.Futureworkwillbedirectedtowardsachievingthefollowinggoals:

1.betterunderstandingofhumanmotion,includingsegmentationandtrackingofarticulatedbodyparts;

2.improveddataloggingandretrievalmechanismstosupport24/7systemoperations;

3.bootstrappingfunctionalsitemodelsthroughpassiveobservationofsceneactivities;

4.betterdetectionandclassificationofmulti-agenteventsandactivities;

5.bettercameracontroltoenablesmoothobjecttrackingathighzoom;and

6.acquisitionandselectionof“bestviews”withtheeventualgoalofrecognizingindividualsinthescene.

远程视频监控系统

在美国国防部高级研究计划局，视频监控系统项目下进行的一系列监控装置研究是一项合作性的多层传感监控，用以支持战场决策。

在我们的视频监控综合可行性示范条约下，我们已经研发出自动视频解析技术，使得单个操作员通过动态视频传感器的分布式网络来监测一个复杂区域的一系统活动。

我们的目标是自动收集和传播实时的战场信息，以改善战场指挥人员的战场环境意识。

在其他军事和联邦执法领域的应用包括为部队提供边境安防，通过无人驾驶飞机监控和平条约及难民流动，保证使馆和机场的安全，通过收集建筑物每个进口和出口的印时图片识别可疑毒品和恐怖分子藏匿场所。

自动视频监控在商业领域同样也是一个重要的研究课题。

随着科技的发展，安装摄像头捕捉视频图像已经非常廉价，但是通过人为监视图像的成本则非常高昂。

监视摄像头已经在商业机构中普遍存在，与相机输出记录到磁带或者定期重写或者存储在录像档案。

在犯罪发生后---比如商店被窃或汽车被盗后，再查看当时录像，往往为时已晚。

尽管避免犯罪还有许多其他的选择，但现在需要的是连续24小时的监测和分析数据，由视频监控系统提醒保安人员，及时发现正在进行的盗窃案，或游荡在停车场的可疑人员。

在城市或战场环境中追踪人员、车辆是一项艰巨的任务。

VSAM视频解析技术视频，确定其geolocations，并插入到动态场景可视化。

我们已经开发强有力的例程为发现和跟踪移动的物体。

被测物体分为语义类别，如人力，人力组，汽车和卡车使用形状和颜色分析，这些标签是用来改善跟踪一致性使用时间限制。

进一步分类的人类活动，如散步，跑步，也取得了。

Geolocations标记实体决心从自己的形象坐标使用广泛的基准立体声由两个或两个以上的重叠相机的意见，或查看射线相交的地形模型由单眼意见。

这些计算机的位置饲料进入了更高一级的跟踪模块，任务多传感器变盘，倾斜和缩放，以合作，不断追踪的对象，通过现场。

所有产生的对象假设所有传感器转交作为象征性的数据包返回到一个中央控制单元操作者，他们都显示在图形用户界面提供了广泛概述了现场活动。

这些技术已证明，通过一系列每年演示，使用的试验系统上发展起来的城市校园的债务工具中央结算系统。

检测移动物体的视频流被认为是一个重要和困难，研究问题。

除了固有的作用能够部分进入移动视频流和背景的组成部分，移动块检测提供了一个关注的焦点识别，分类，分析和活动，使这些后来过程更有效率，因为只有“移动”像素需要加以考虑。

有三种常规方法来进行移动物体的检测：

时间差分法;背景减法;和光流法。

时间差分非常适应动态环境，但通常是一个贫穷的工作中提取所有相关的功能像素。

背景减除提供最完整的功能数据，但极为敏感，动态场景的变化，由于灯光和不相干的活动。

光流可以用来检测独立移动的物体，在场的摄像机运动，但大多数的光流计算方法的计算复杂，不能适用于全帧视频流的实时没有专门的硬件。

根据VSAM计划，债务工具中央结算系统制定并实施了三种方法的运动目标检测的VSAM试验。

首先是结合自适应背景减除与三帧差分。

这种混合算法是非常快，令人惊讶的有效的-事实上，它是主要的算法所使用的大多数SPUs在VSAM系统。

此外，两个新的原型已经开发的算法来解决这一缺陷的标准办法。

首先，一个机制，保持颞对象层次开发，使更多的歧义的移动物体，可以有效地阻止了一会儿，是闭塞的其他物体，而且然后恢复动议。

一个限制，影响到该方法和标准算法是他们唯一的工作静态相机，或在“stepand凝视”模式泛倾斜相机。

为了克服这一局限，第二次延长了beendeveloped让背景减法从不断平移和倾斜相机。

通过巧妙的积累形象的证据，该算法可以实现实时的传统PC平台。

第四个办法来探测移动物体从空中移动平台，也得到了发展，根据分包合同的Sarnoff公司。

这种方法是基于图像稳定使用特殊的视频处理硬件。

目前VSAM通用试验系统和一套视频理解技术的最终结果是一项为期三年的，渐进的过程。

推动这一演变提供了一系列每年示威。

下列表格提供了一个简明的大纲方面所取得的进展在过去三年中在视频领域的理解，技术，VSAM试验架构，传感器控制算法，并一定程度的用户交互。

虽然该计划是在现在，VSAM通用试验继续提供宝贵的资源开发和测试新的视频理解能力。

今后的工作将致力于实现以下目标：

1、更好地理解人类的议案，其中包括分割和跟踪阐明身体部位;

2、改善数据记录和检索机制，以支持24/7系统的运作

3、引导功能的网站模式，通过被动观察现场活动;

4、更好的检测和分类Multi-lAgent的事件和活动

5、更好的相机控制，实现了流畅的目标跟踪高变焦;和

6、购置和选择的“最佳意见”的最终目标是承认个人在现场。

展开阅读全文