JP3511278B2

JP3511278B2 - Video anchor setting device

Info

Publication number: JP3511278B2
Application number: JP01498597A
Authority: JP
Inventors: 博信阿倍; 準史郎神田; 浩司脇本; 聡田中
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1996-01-31
Filing date: 1997-01-29
Publication date: 2004-03-29
Anticipated expiration: 2017-01-29
Also published as: JPH10187759A

Description

【発明の詳細な説明】【０００１】【発明の属する技術分野】この発明は、動画アンカー設
定装置に関する。この発明は特に、動画を入力し、これ
に含まれるターゲット等にアンカーを設定するための装
置に関する。【０００２】【従来の技術】従来一般的なハイパーメディア装置にお
ける情報の検索は、主にテキストや静止画像に対して情
報リンクのための論理単位を設定し、この論理単位に対
して関連情報を予めリンクし、ユーザがこの論理単位を
クリックしたときに前記関連情報が表示される形態をと
っていた。しかし、例えば動画像の符号化および復号に
関するＭＰＥＧに代表されるように、ここ数年、静止画
像のみならず動画像を処理の対象にする各種技術が提案
されている。動画像を扱うことにより、上記ハイパーメ
ディア装置にも、ＣＡＩ、各種プレゼンテーション、電
子カタログなど、コンテンツ作成の用途が開ける。動画
像の編集は、従来放送局等ある程度限られた産業分野で
利用されてきたが、今後はパーソナルコンピュータをベ
ースとする個人用システムとして急速に普及していくも
のと考えられる。【０００３】特開平４−１６３５８９号公報には、動画
について論理単位（その明細書ではノードと呼ぶ）を設
定することの可能な画像処理装置が開示されている。こ
の装置は、静止画像におけるノードの設定が単に表示範
囲の指定のみで可能である点に対して、動画像では
（１）表示範囲、（２）時間、の両面からそれぞれノー
ドの有効継続期間とその領域範囲を指定すればよい点に
着目し、これらの指定を可能とするものである。すなわ
ち、（１）については、動画像に登場する被写体などを
囲む領域をマウス等で指示することにより、その時間に
おけるノードの設定を行い、一方、（２）については、
動画像の出力開始時刻からの経過時間によってそのノー
ドの有効継続時間を指定する。従って、このノードは前
記領域と前記経過時間の２つの内容によって一意的に定
まり、各ノードに対して関連情報をリンクすることがで
きる。リンクの後、実際に動画像を再生する際、ユーザ
がマウス等で画面上のある領域をクリックすれば、その
位置と時刻によってノードが特定され、関連情報が表示
される。【０００４】【発明が解決しようとする課題】上記の装置では、論理
単位の領域設定を手作業で行うことにしていた。しか
し、当然ながら動画像には、静止画像と違って多数のフ
レームが存在し、被写体の位置や形状が刻々変化する。
ＮＴＳＣ方式の場合、１秒間に３０枚のフレームが必要
であるから、単純に計算しても、１秒の動画像を処理す
るとき１論理単位当たり３０回の設定作業が必要とな
る。例えば５分間のコンテンツを作成するとき、１フレ
ームに５個の論理単位を設定するとすれば、設定回数は
４５０００回に及ぶ。【０００５】本発明はこの課題に鑑みてなされたもの
で、その目的は、論理単位（本明細書ではアンカーと呼
ぶ）の設定作業の省力化、簡易化を可能とする装置、よ
り具体的には、フレームごとに行う必要のあったアンカ
ー情報を自動算出または自動設定するアンカー設定装置
の提供にある。【０００６】【課題を解決するための手段】本発明の動画アンカー設
定装置は、動画を構成する複数のフレームに対して所定
間隔で基準フレームを選定し、それら基準フレームの各
々に対してアンカー情報を設定するアンカー情報設定手
段と、設定される前記アンカー情報に基づいて非基準フ
レームのアンカー情報を算出するアンカー情報算出手段
と、を含む動画アンカー設定装置において、一又は複数
の基準フレームに設定される前記アンカー情報が他の基
準フレームに設定されるアンカー情報に基づいて所定誤
差範囲内で算出可能であるか否かを判断する判断手段
と、算出可能であると判断される場合、それら基準フレ
ームを非基準フレームに変更する基準フレーム削除手段
と、をさらに含むものである。【０００７】【発明の実施の形態】以下、本発明の動画ハイパーメデ
ィア装置の好適な実施の形態を説明する。この装置に
は、本発明の動画アンカー設定装置が組み込まれてい
る。本装置によれば、例えば水族館の水槽を写した動画
を素材とし、泳いでいる任意の魚をクリックしたとき
に、その魚の名前、補足説明等を表示することのできる
対話型のＣＡＩソフトを容易かつ効率的に作成すること
ができる。以下の実施の形態で「ユーザ」とは、主にこ
うしたコンテンツの作成者をいうが、これは当然なが
ら、自ら撮影したビデオテープを個人的に編集する者な
どであってもよい。【０００８】実施の形態１．この実施の形態では、ユーザが開始フレームと終了フレ
ームを明示的に指定し、これら第１のフレーム及び第２
のフレームである２枚のフレームを最初の基準フレーム
としてアンカーを設定する。「フレーム」とは画像の表
示単位であり、ＭＰＥＧでいうピクチャなどを含む。本
装置は、基準フレームに対して設定されたアンカーか
ら、補間計算によって他のフレームにおけるアンカー領
域の位置、形状、色などのアンカー情報を自動的に算出
する。「アンカー情報」とは、例えばアンカー領域の位
置や形状、アンカー領域を明示的に表示するときはその
色などをいう。なお、最初の基準フレームは１枚でも本
発明は成立するが、その例は後述する。【０００９】本実施の形態の動画ハイパーメディア装置
を含むシステム全体の構成は、アンカーおよびリンク設
定処理を制御するパーソナルコンピュータ（以下、Ｐ
Ｃ）と、このＰＣに動画を提供するビデオ再生装置であ
る。ＰＣには、ビデオ再生装置から提供された画像を捕
捉してデジタル化するビデオキャプチャボードが内蔵さ
れる。ビデオ再生装置は、通常の再生開始、停止、早送
り、コマ送り等の他に、指定したフレームまたは時刻か
ら再生を行うなどの機能がある。こうしたビデオ再生装
置は放送業務等で広く使用されるものであるが、当然そ
れに限る必要はない。ビデオ再生装置の各種機能の制御
は、ＰＣ上に展開されるユーザインタフェイス（以下、
ＵＩ）、例えば画面に表示される「再生ボタン」などを
介して行われる。この場合、ユーザがそのボタンをクリ
ックすれば、そのアクションが信号ケーブルを経由して
ＰＣからビデオ再生装置に伝送される。ビデオ再生装置
自体は、動画ハイパーメディア装置の必須構成ではない
が、ここではこれも含めたシステムとして説明する。図
１は本実施の形態に係る動画ハイパーメディア装置を含
むシステムの構成図である。【００１０】本装置は大別して、アンカー情報およびリ
ンク情報に関するデータを操作するデータ操作部１、こ
れらのデータを記憶するデータ記憶部２、これらのデー
タを意味のある形で表示する表示部３、ユーザ操作の受
付と管理を行うユーザ操作部４、ビデオ再生装置５で再
生された動画を入力する動画入力部６からなる。【００１１】（１）データ操作部１ユーザからデータ操作部１への指示は、後述するＵＩに
よって行われる。すなわち、以下の内部構成はソフトウ
エアモジュールである。【００１２】フレーム決定部１０は開始フレームと終了
フレームを決定する。本実施の形態では、ユーザが指定
したフレームがそのまま開始フレームおよび終了フレー
ムとなる。開始フレームと終了フレームの例は、前述の
水族館の映像のうち、水槽を写しているシーンの先頭と
末尾のフレームである。仮にシーンが水族館の入り口の
映像に移れば、それ以降、魚にアンカーを設定する必要
がないため、シーンの移行前に終了フレームを指定して
おく。【００１３】アンカー設定部１１は、開始フレームと終
了フレームの間で実際にアンカーを設定する。例えば、
ある魚にアンカーを設定する場合、まず開始フレームに
おいてその魚を囲む矩形をマウスによって表示させ、こ
れをアンカー領域として登録する。このとき、動画は停
止モードにある。つづいて終了フレームまで動画を進
め、同じ魚を再度囲んでアンカー領域を登録する。開始
フレームと終了フレームの間に魚は移動したり、方向を
変えたりするため、通常はその位置も形状も変化する。
開始フレームで登録されたアンカー領域の形状および位
置と終了フレームで登録されたものの形状および位置は
一般に一致しない。なお、アンカー設定部１１は、後述
のアンカー修正の際に使用するアンカー情報編集部１１
０と、文字列等テキストに対してアンカーを設定するテ
キストアンカー設定部１１１を含む。【００１４】アンカー推定部１２は、開始フレームと終
了フレームに設定された第１及び第２のアンカー情報を
もとに補間計算を行い、任意のフレーム（非基準フレー
ム）におけるアンカーの位置及び大きさの推定を行う。
この処理は後に詳述する。【００１５】アンカー検索部１５は、アンカー情報のう
ちアンカーの動きの特徴、またはアンカーの識別情報を
もとにアンカーの検索を行う。識別情報とは、そのアン
カーを他のアンカーと識別する手がかりとなる情報をい
い、例えばアンカーの名称、アンカーの設定対象、アン
カー設定日時などがある。【００１６】ハイパーリンク設定部１３は、設定された
アンカーにハイパーリンクの設定を行い、設定に関する
データ構造をテーブルの形態で作成する。ハイパーリン
ク検索部１４は、設定されたリンク情報の検索を行う。
上記の例の場合、魚のアンカーとその魚の名前を示すテ
キストデータ等がハイパーリンクによって関連づけられ
る。【００１７】（２）データ記憶部２データ記憶部２は、データベースでもよいし、各種ファ
イル装置、メモリ装置でもよい。この部分は主にハード
ウエアである。【００１８】動画データ記憶部２０は、動画入力部でキ
ャプチャされ、デジタル化された動画データを記憶す
る。アンカー情報記憶部２１、リンク情報記憶部２２は
それぞれ、設定されたアンカー情報、リンク情報を記憶
する。【００１９】（３）表示部３表示制御部３０は、ＵＩや編集中の動画など各種画像の
表示を統括的に制御する表示系のシステムプログラム、
ＶＧＡコントローラなどの表示回路、およびこのドライ
バを含む。この表示制御部３０はカーソル変更部３００
を持つ。カーソル変更部３００は、カーソルがアンカー
領域に入ったとき、カーソルの表示状態を変更する。表
示制御部３０の出力データはＰＣのモニタ等の表示装置
３１に与えられ、所期の表示が行われる。【００２０】（４）ユーザ操作部４ユーザによるコマンド入力を可能とするもので、キーボ
ード、マウス、各種ポインティングデバイス等のハード
ウエア、およびコマンドデスパッチャからなる。コマン
ドの例に、アンカーの設定、アンカー領域の修正、リン
ク、リンク検索などがある。【００２１】（５）動画入力部６ビデオキャプチャボードに相当するハードウエアで、Ａ
Ｄコンバータとフレームメモリ（図示せず）を持ち、入
力された動画をディジタル化する。この後、データを前
記動画データ記憶部２０に提供する。【００２２】以上の構成をもとに、まずアンカー、リン
クの設定の手順を説明し、後にアンカー設定のＵＩの様
子を説明する。【００２３】［１］アンカーの設定図２は本実施の形態によるアンカーの設定および修正手
順を示すフローチャート、図３は設定されたアンカー情
報のテーブルを示す図である。図２のごとく、まずハー
ドウエア等に対する各種初期化処理を行い（Ｓ２１）、
動画データ記憶部２０に記憶されている動画データの読
み込み（Ｓ２２）を行う。読み込まれた動画データの先
頭フレームは表示装置３１に、まず静止画像として表示
される。つぎに、その動画データに対してすでに設定さ
れているアンカー情報をアンカー情報記憶部２１から読
み込む（Ｓ２３）。アンカー情報が存在すれば、そのア
ンカー領域を実際に画面上に表示する（以降、アンカー
領域が画面に表示されるモードを「アンカー表示モー
ド」、表示されないモードを「アンカー非表示モード」
という）。【００２４】つづいて、今回新たにアンカーを設定した
い期間の開始フレームまで動画データを進め（Ｓ２
４）、所望のフレームが現れたら、画面上の「開始フレ
ーム」ボタンを押して開始フレームを登録する。この状
態で、このフレームにおけるアンカー領域の設定待ち状
態になり、ユーザは例えば別の魚を取り囲むようにマウ
スのクリックによって矩形領域を設ける。矩形領域が決
まれば、その左上点（ｘ１，ｙ１）と右下点（ｘ２，ｙ
２）の座標が取得され、これが開始フレームのフレーム
番号（動画の先頭フレームからの通し番号）とともにそ
の魚のアンカー情報として記録される（Ｓ２５）。【００２５】この後、再び動画データを進め、所望の終
了フレームが現れたところで止めて（Ｓ２６）、同じ魚
を取り囲むよう矩形領域を設ける。ここで終了フレーム
におけるアンカーの設定が完了する（Ｓ２７）。図３の
「ａｎｃｈｏｒ１」はこの魚を示すアンカーＩＤであ
る。ここでは開始フレームと終了フレームのフレーム番
号（それぞれフレーム１と１００）と、アンカー領域の
座標情報がテーブルに格納されている。【００２６】こうして両端基準フレームにおけるアンカ
ー情報が確定すれば、この間の第３のフレーム（非基準
フレーム）の第３のアンカー情報を補間計算によって求
める（Ｓ２８）。図４はアンカー情報の補間計算方法を
示す図である。ここで、・開始フレーム（時刻ｔ０）におけるアンカー情報をＡ
（ｔ０）・終了フレーム（時刻ｔ１）におけるアンカー情報をＡ
（ｔ１）・時刻ｔにおけるアンカー情報をＡ（ｔ）・ｔ１−ｔ０＝Δｔとおけば、 A(t) = {A(t1) −A(t0)}t/Δt + {A(t0)t1−A(t1)t0}/ Δt （式１）とかける。このＡとして、順次前記ｘ１、ｙ１、ｘ２、
ｙ２を代入すれば任意の時刻におけるアンカー領域の外
形が判明する。アンカー領域の重心座標を代入すればア
ンカー領域のおおまかな動きが判明する。Ａに色番号を
代入すれば、アンカー領域の色の変化を追跡できる。こ
れ以外にも、数値表現が可能な情報は同様に式１を用い
た内分計算により、補間することができる。補間によっ
て求められた非基準フレームのアンカー情報は、図３の
テーブルの「ａｎｃｈｏｒ１」に追加していってもよ
いし、図３のテーブルはそのままとし、フレームの表示
が指示されるたびにそのフレームについて式１の計算を
逐次行ってもよい。本実施の形態では、以降、逐次計算
を仮定する。【００２７】Ｓ２８が完了すると、実際にアンカー情報
を表示して内容を確認する（Ｓ２９）。このとき、開始
フレームに戻って動画データが再生され、各フレームで
アンカー領域が矩形で表示される。この矩形領域は計算
結果に従って連続的に移動していく。【００２８】「ａｎｃｈｏｒ１」の場合、魚が等速直線
運動をすれば結果は極めて良好となるが、途中で泳ぐ方
向を変更した場合には、中途のフレームで魚からアンカ
ー領域がずれる。そこでアンカー情報を修正する（Ｓ３
０）。ユーザはまず、ずれの大きなフレームまで動画デ
ータを進め、ここで画像を止める。次に、画面に表示さ
れているアンカー領域の端部をクリックし、マウスによ
って領域の形状または位置を変更する。アンカー推定部
１２は、こうして修正されたフレームを基準フレームに
格上げし（以下、昇格して基準フレームになったものを
「中間基準フレーム」ともよぶ）、このアンカー情報を
図３のテーブルに追加する。図５は図３に中間基準フレ
ームのアンカー情報を追加して得られるテーブルを示し
ている。一方、図６は中間基準フレームと両端基準フレ
ームの３つのフレームをもとに補間計算を行う方法を示
す図である。推定の対象である非基準フレームが第１の
フレームである開始フレームと中間基準フレームの間に
存在すればそれらのフレーム間で補間計算を行い、非基
準フレームが中間基準フレームと終了フレームの間に存
在すればそれらのフレーム間で補間計算を行う（Ｓ２
８）。以降、Ｓ２９による表示、Ｓ３０による再修正を
経て、良好なアンカー情報が得られたときに（Ｓ３１の
Ｙ）これを保存し（Ｓ３２）、アンカー設定処理を終え
る。Ｓ３０で別のフレームのアンカーが修正されれば、
当然このフレームも中間基準フレームとなる。なお、Ｓ
２５において同一フレームに２個以上のアンカーを設定
するときは、設定順に装置内部でアンカーＩＤを自動的
に変更しながら付与するとともに、これらのアンカー領
域の矩形表示を異なる色で行う等の対処をなせばよい。【００２９】以上の手順によれば、以下の効果が得られ
る。１．両端基準フレームにおけるアンカーの設定を行うだ
けで、その間に存在する多数のフレームに対する設定作
業が不要となる。２．補間計算でアンカーの位置にずれが生じた場合、こ
のずれを確認することができる。従って、修正すべきフ
レームの認識が容易であり、一旦修正されたフレームは
自動的に中間基準フレームに昇格されるため、ユーザは
基準フレームにすべきかどうかなどに注意を払う必要が
ない。３．例えば、アンカーが設定された魚が弧を描いて泳ぐ
ような場合でも、両端基準フレームに加えて、高々数フ
レームで修正を行えば、十分に良好なアンカー情報を得
ることができる。以上が本実施の形態の動画ハイパーメ
ディア装置のうち、特に動画アンカー設定装置の概要で
ある。【００３０】［２］リンクの設定つづいて、設定されたアンカーに対するリンクの設定を
行う。図７は本実施の形態によるリンクの設定および検
索手順を示すフローチャート、図８は設定されたリンク
情報のテーブルを示す図である。【００３１】図７は、アンカーの設定とリンクの設定を
全く独立して行う場合の処理手順を示しており、図２同
様、まず各種初期化処理（Ｓ４０）、動画データの読み
込み（Ｓ４１）を行う。つづいて、［１］で設定された
アンカー情報をアンカー情報記憶部２１から読み込む
（Ｓ４２）とともに、すでに設定されているリンク情報
をリンク情報記憶部２２から読み込む。【００３２】つぎに、両端基準フレームおよび中間基準
フレームのアンカー情報をもとに他のフレームのアンカ
ー情報を補間計算で求めながら（Ｓ４４）、動画の再生
に合わせて連続的にアンカー情報の表示を行う（Ｓ４
５）。この状態でユーザ操作部４においてユーザからの
入力待ち状態となる（Ｓ４６）。【００３３】ここでユーザが、動画上またはその動画を
一旦停止した上で、あるアンカー領域をクリックし、
「リンク作成・変更」ボタンを押せば、そのアンカーに
対してリンク情報の作成が行われる（Ｓ４７）。例え
ば、水槽内のある魚がクリックされると、その魚にリン
クさせるべきテキスト、イメージ等の候補が画面に現
れ、ユーザが選択したテキスト等がその魚のアンカー
（より正確には、そのアンカーに含まれる魚というオブ
ジェクト）にリンクされる。候補がない場合は、ユーザ
が自ら文字列を入力し、これをリンクすることも可能で
ある。図８は、「ａｎｃｈｏｒ１」にテキスト形式の情
報「ａｎｃｈｏｒ１．ｔｘｔ」、同様に「ａｎｃｈｏｒ
２」のアンカーにビットマップイメージ「ａｎｃｈｏｒ
２．ｂｍｐ」がリンクされた状態が示されている。こう
してリンク情報が確定すれば、リンクの内容をリンク情
報記憶部２２に保存し、再度ユーザの入力待ちとなる。【００３４】一方、Ｓ４６においてユーザが「リンク検
索」ボタンを押してアンカーを指定すれば、そのアンカ
ーに対応するリンク情報が検索され、表示される（Ｓ４
９）。図８の場合、例えばａｎｃｈｏｒ１の魚に対し
て、魚の名前や体長、特徴などが文字列で表示され、ａ
ｎｃｈｏｒ２の魚については、その魚が実際に棲息して
いる海の写真などが表示される。この表示によってリン
ク動作が確認できるため、ユーザはこの時点でコンテン
ツ作成を完了することができる。コンテンツは、例えば
ＣＤ−ＲＯＭのような記録媒体に保存することにより、
商品化することもできる。商品として出荷する場合、一
般的には、アンカー領域を表示しないアンカー非表示モ
ードに変更しておく。【００３５】なお、ここではアンカーとリンクの設定を
独立の処理として説明したが、例えばリンク設定中の画
面に「アンカー設定に戻る」というボタンを設ければ、
両者の行き来が自由になり、さらに編集が容易になる。【００３６】［３］アンカー設定のためのＵＩ図９はアンカー設定のためのＵＩ画面例を示す図であ
る。同図中、画像表示領域５０には、処理の対象となる
動画が表示される。上欄の黒塗りのボタン群５２はビデ
オの再生、停止等を直接指示するオブジェクトボタンで
ある。その隣には、画像表示領域に表示されたフレーム
に対してアンカー領域を設定するための矩形ボタン５
４、同様に、表示されているフレームを開始フレームま
たは終了フレームとして指定するための開始フレーム指
定ボタン５６、終了フレーム指定ボタン５８が設けられ
ている。同図では、１匹の魚に対してアンカー領域６０
が設定されている。【００３７】画面中央右側には、設定または修正しよう
とするアンカーの名称、ＩＤ、開始フレーム番号、終了
フレーム番号を示すアンカー関連ボックス群６２があ
る。画像表示領域５０の下には、現在表示中のフレーム
が含まれるシーンの番号と、そのフレームのそのシーン
における通し番号を示すシーン関連ボックス群６４があ
る。さらにその下には、編集のために動画を微少量だけ
進め、または戻すためのボックス６６がある。この右端
のボタンを押せば動画は進み、左端を押せば戻る。現在
表示しているフレームのそのシーンにおける位置は同ボ
ックス６６のなかで、縦線７０で示されている。このボ
ックスの下には、そのシーン中の開始フレームと終了フ
レームの位置を示すボックス６８がある。開始フレーム
と終了フレームの位置はそれぞれ二重縦線７２、７４に
よって示され、その間の中間基準フレームの位置が三角
形の記号７６で示されている。【００３８】同図において、まずユーザはシーン番号を
手がかりとして、アンカー設定を望むシーンの先頭まで
ビデオテープを進める。この場合、例えば複数のシーン
からなる水族館の映像のうち、シーン番号「５」の水槽
の映像に進んでいる。ここでユーザは、ボックス６６の
右端のボタンを押し、１フレームづつ動画を進めてい
く。アンカーを設定しようとする最初のフレームが現れ
れば、ユーザは開始フレーム指定ボタン５６を押し、こ
れを登録する。このとき、ボックス６６の対応する個所
に、開始フレームの位置を示す二重縦線７２が現れる。
ここで矩形ボタン５４を押し、画像表示領域５０中、設
定すべきアンカー領域の左上点と右下点をマウスでクリ
ックする。これで開始フレームのアンカー設定が終わ
る。つづいて動画を進め、同様に終了フレームの登録と
アンカー設定を行う。【００３９】両端基準フレームにおける設定が完了した
ことを検出すると、本装置のアンカー推定部１２は自動
的にアンカー情報を式１に代入し、計算を開始する。こ
こでユーザが、例えば開始フレームまで戻って動画を１
フレームづつ進めていくと、アンカー推定部１２は、現
在表示中のフレームに対応する時刻を求め、この時刻に
対応する推定結果をもとにアンカー領域を表示する。表
示されたアンカー領域がずれていれば、ユーザは再度矩
形ボタン５４を押し、領域の修正を行う。修正後、その
フレームに対応する個所に三角形の記号７６が現れる。
このＵＩによれば、実際に動画データ上にアンカー情報
を表示させるため、編集結果がリアルタイムで確認で
き、また容易にその修正を行うことができる。【００４０】以上が本実施の形態の概要である。なお、
本実施の形態については、以下の改良、変形等が考えら
れる。【００４１】（１）テキストアンカーの設定図１のテキストアンカー設定部１１１によって行う。ま
ず、画面上でテキストデータを編集してこれを動画上に
重ね、アンカーを設定する。通常のアンカー設定との違
いは、再生された画像の一部領域を指定するのではな
く、作成したテキストを一旦画像に乗せ、しかる後にこ
のテキストを囲むようにアンカー領域を設定する点にあ
る。従来、例えばビデオ映像にアノテーションを直接入
れる方法が一般的だったが、その場合は後でアノテーシ
ョンを削除するなど、再編集の際に不都合である。本実
施の形態はこれを解消する。【００４２】テキストアンカーが設定されたとき、その
アンカー情報もアンカー情報テーブルに記憶される。た
だし、図３に示すテーブルにおいて、「フレーム」の個
所が「テキスト」となり、その欄にテキスト名が入る。【００４３】テキストアンカーについても関連情報のリ
ンクが可能である。例えば図９の水槽のシーンに対して
「南海の魚たち」というテキストを貼り付け、このテキ
ストに対して「南の海には鮮やかな色の魚がたくさんい
ます…」というようなテキストをリンクさせることがで
きる。【００４４】（２）カーソルの表示状態の変更図１のカーソル変更部３００によって行う。この機能は
特に、アンカー非表示モード、例えばコンテンツが市場
で使用されるときに有用である。この機能のため、カー
ソル変更部３００は、カーソルの位置を常時取得する位
置取得プログラムと、取得された位置がいずれかのアン
カー領域に含まれるかどうかを判定する判定プログラム
と、カーソルがあるアンカー領域に入ったとき、カーソ
ルの表示状態をどのように変更するか決定し、その決定
に従って実際にカーソルの形状等を変更する変更プログ
ラムを持つ。【００４５】カーソルの変更については、アンカーごと
に変更内容を変えない場合と変える場合がある。前者の
場合、例えば通常は＋記号であるカーソルを◎に変更し
たり、カーソルの輝度を高める方法がある。この態様に
よれば、特に、ターゲットの動きや形の変化が速く、ア
ンカー領域の変化が激しいときに利益がある。【００４６】一方、後者の場合は、前記変更プログラム
から、カーソルが入ったアンカー領域のアンカーＩＤを
検索し、これをそのままカーソルの代わりにカーソルの
位置に表示することが考えられる。例えば、カーソルが
ある魚のアンカー領域に入ったとき、このカーソルを
「鮫」などのように、そのアンカーのターゲットの内容
を示せばよい。この態様によれば、ユーザはわざわざ魚
をクリックするまでもなく、その魚の名前を知ることが
できる。【００４７】（３）中間基準フレームの明示的な指定本実施の形態では、最初に両端基準フレームのみを決め
ることにしたが、ターゲットの動きが不規則な場合な
ど、修正の必要が予想できる場合もある。その場合は、
当初から開始フレーム、終了フレーム以外のフレームで
もアンカー領域の指定を受け付けるものとする。例えば
図９のＵＩにおいて、開始フレーム指定ボタン５６、終
了フレーム指定ボタン５８に加え、中間フレーム指定ボ
タンを設けて対応する。このフレームは当初より基準フ
レームとして利用されるため、補間計算が図６の状態か
ら開始されると考えればよい。【００４８】（４）矩形以外のアンカー領域アンカー領域を矩形に限る必要はない。例えば円または
楕円の場合、長径、短径および中心の３点の座標によっ
て領域を指定すればよい。多角形なら各頂点の座標でよ
い。ターゲットの外周自体をアンカー領域とした場合
は、外周上の一点の座標と、その点から表現したチェー
ンコードにより、領域を特定することもできる。【００４９】（５）非線形補間の採用本実施の形態では、最も単純に線形補間を利用したが、
これは当然、非線形補間でもよい。補間に用いる式は、
処理すべき動画の特徴に合わせて実験等によって決める
ことができる。【００５０】（６）開始、終了フレームの決定本実施の形態ではこれらのフレームをユーザが明示的に
指定したが、以下の方法もある。１．ユーザは開始、終了フレームを意識することなく、
単にフレームを指定してアンカーを設定する。指定され
たフレームが基準フレームとなる。フレーム決定部１０
は、ユーザがアンカーを設定したフレームのうちフレー
ム番号が最小のものを開始フレーム、最大のものを終了
フレームと決める。この場合、図９の開始フレーム指定
ボタン５６、終了フレーム指定ボタン５８が不要とな
る。【００５１】２．ユーザは１枚のフレームを指定し、こ
れにアンカーを設定するとともに、アンカー設定の対象
となったターゲットを指定する。このフレームが基準フ
レームとなる。フレーム決定部１０は、その基準フレー
ムの前後のフレームを調べることにより、そのターゲッ
トが出現するフレームと消失するフレームを検出し、こ
れらをそれぞれ開始フレーム、終了フレームとする。【００５２】ターゲットの存否は、画像のマッチングを
とることで判断する。つまり、基準フレームで指定され
たターゲットをモデルとして前後のフレームに対してマ
ッチング処理を行う。マッチングがとれる限り探索の対
象フレームを前後に広げていく。最終的にマッチングが
とれなくなれば開始、終了フレームが判明する。この方
法によれば、当初設けるべき基準フレームが１枚でよ
い。【００５３】（７）アンカー領域の三次元表示アンカー設定部１１に、設定されたアンカー領域を画面
の縦横であるｘ、ｙ方向、および時間ｔ方向に展開して
三次元表示する機能を設ける。これはアンカーの編集中
に図４がそのまま画面に表示されると考えればよい。こ
の表示の結果、ユーザはアンカーの全体的な把握を視覚
的に行うことができる。【００５４】なお、この技術の応用として、三次元表示
されたアンカー情報に対して直接編集可能としてもよ
い。例えば図４の中間基準フレームにおけるアンカー領
域を画面上で左に移動すれば、図６のような表示がなさ
れる。ユーザは編集の効果をリアルタイムに把握するこ
とができる。【００５５】（８）動画の断面表示アンカー設定部１１に、開始フレームから終了フレーム
までのアンカー領域の軌跡の横方向の断面図（図１０
（ａ））および縦方向の断面図（図１０（ｂ））を作成
し、これをアンカーの軌跡とともに表示する機能を付加
する。まず基準フレームのアンカー情報からアンカー領
域の重心Ｇのｘ、ｙ座標を求める。図１０（ａ）の場
合、重心からｘ軸に平行な直線をフレーム上に引く。隣
接する基準フレーム間で、これらの直線を含む平面（図
中斜線部）を設ける。つぎにこの平面で動画を切断す
る。得られた断面図をｘ−ｔ平面（図中点画部）に投影
する。図１０（ｂ）の場合はｘとｙを入れ換えて同じ処
理をする。アンカーの軌跡が正しく計算されていると
き、すなわち非基準フレームにおけるアンカー情報の推
定精度が十分に高いとき、２つの投影図にはアンカーの
移動経路が現れるはずである。例えば、赤いボールをア
ンカー設定の対象とすれば、動画の断面に赤い筋状の移
動経路が現れる。鉛筆を縦に割れば芯が直線状に現れる
のと同じである。この移動経路が途中で切れたり太くな
っていれば、その箇所におけるアンカーの位置を修正す
ればよい。【００５６】（９）アンカーのグループ化アンカー情報編集部１１０に、別々に設定されたアンカ
ー情報をグループ化し、仮想的にひとつのアンカーとし
て扱う機能を付加する。例えば、人物ＡがフレームＮ１
〜Ｎ２とフレームＮ３〜Ｎ４では画面内に存在し、フレ
ームＮ２〜Ｎ３では存在しないとき、フレームＮ１〜Ｎ
２およびフレームＮ３〜Ｎ４における人物Ａに関するア
ンカーをひとつのものとして扱う。この結果、アンカー
情報を設定したり修正する作業が軽減される。この他、
同じフレームに登場する人物Ａと人物Ｂをグループ化す
ることもできる。【００５７】（１０）アンカー情報の一覧表示アンカー情報編集部１１０に、現在処理中の動画に対し
て設定されたアンカー情報の一覧表示機能を設ける。例
えば動画のタイトル「ＡＱＵＡＲＩＵＭ」とともに、
「ＦＩＳＨ１」「ＦＩＳＨ２」…などのアンカー名称を
画面に一覧表示する。内容を確認したいアンカー名称を
ユーザが選択したとき、そのアンカーの開始フレームま
で戻って動画を再生する構成としてもよい。【００５８】（１１）アンカーの検索アンカー情報の検索ＵＩを設ける。検索したいアンカー
情報の名称などの文字情報をキーワードとして入力する
と、アンカー検索部１５がアンカー情報記憶部２１から
そのキーワードを持つものを検索して表示する。その
他、アンカー領域の動きを検索キーにしてもよい。例え
ば、右に動くオブジェクトを見つけたいとき、ユーザは
検索ＵＩにおいて例えば「→」のボタンを押す。アンカ
ー検索部１５は各アンカー領域の軌跡を計算し、右に移
動するオブジェクトを含むアンカーを検索して表示す
る。【００５９】（１２）アンカー情報表示画像の一覧表示アンカー情報編集部１１０に、開始フレームと終了フレ
ームの間に含まれるフレーム（基準フレーム、非基準フ
レームのいずれでも可）と、アンカー設定部１１によっ
て設定されたアンカー情報、またはアンカー推定部１２
によって推定されたアンカー情報を重ね合わせてアンカ
ー情報表示画像を作成し、時系列で一覧表示する機能を
設ける。例えば図１１に示すように、まず開始フレーム
８１と終了フレーム８２を両端に配置し、フレーム間隔
Δｔでフレームを選択する。つづいて、これらの各フレ
ームにアンカー情報８０を重ねてアンカー情報表示画像
を作成し、これらを表示時刻の早いほうから並べて表示
する。この構成により、アンカー設定の適否を一目で見
渡すことができる。このため図９のボックス６６による
位置決定作業が省略でき、アンカーの修正が容易にな
る。なお、フレームは一定間隔で選択する必要はなく、
例えば基準フレームのみを選択してもよい。また、表示
されたアンカー情報表示画像において、アンカー領域を
マウスでドラッグするなどして直接編集可能としてもよ
い。【００６０】（１３）アンカー情報の動画再生中の補正図１２のごとく、アンカー情報編集部１１０に、動画を
画面上に再生する動画再生部１１８と、表示中のフレー
ムに関するアンカー情報を表示するアンカー情報表示部
１２０と、動画再生中の一時刻または複数の時刻におい
てアンカー情報補正操作が行われた場合、各アンカー情
報補正操作が行われた時刻に再生していたフレームを特
定し、それらのフレームが非基準フレームであれば基準
フレームに昇格させるとともに、それらのフレームにお
けるアンカー情報を各アンカー情報補正操作に基づいて
補正するアンカー情報補正部１１９を設ける。この構成
にて、まず動画再生部１１８により、図９の画像表示領
域５０に動画を表示する。このとき同時に、アンカー情
報表示部１２０により、表示中のフレームにおけるアン
カー情報をアンカー領域６０として表示する。ユーザ
は、アンカー領域６０がターゲットとずれているフレー
ムを見つけたとき、動画中のターゲットの中心をマウス
でクリックする。このアクションにより、アンカー情報
補正部１１９はクリックの行われた時刻に表示していた
フレームを特定し、クリックされた点を中心とするアン
カー情報を生成することより、アンカー情報を補正す
る。新たに設定されるアンカー領域のサイズは、例えば
もとのアンカー領域と同一でもよい。以降、このフレー
ムは基準フレームとして扱われる。この構成によって、
動画の再生中にターゲットの位置を逐次指定できるの
で、後に確認して修正する手間が省ける。【００６１】実施の形態２．実施の形態１では、主に補間によってアンカー情報の自
動計算を行い、修正を手作業で行った。本実施の形態で
は、動画の解析をもとに予めある程度の枚数のフレーム
を基準フレームとしてアンカーを自動設定し、これらの
基準フレーム間に実施の形態１の補間方法を用いる。こ
の態様の場合、実施の形態１の中間基準フレームに相当
するフレームが最初から存在するため、手作業による修
正の労力が軽減される。【００６２】図１３は本実施の形態に係る動画ハイパー
メディア装置のアンカー設定部１１の構成図である。ア
ンカー設定部１１以外の構成は図１同等である。【００６３】図１３において、アンカー自動設定部１１
２は近接フレーム抽出部１１７をもつ。近接フレーム抽
出部１１７は隣接する基準フレーム間において一定間隔
で非基準フレームを抽出し、これらを基準フレームに昇
格させる。アンカー自動設定部１１２はまた、動きベク
トル利用設定部１１３、輪郭情報利用設定部１１４およ
びパターンマッチング利用設定部１１６を有する。これ
ら３つの設定部は、本来いずれかひとつを実装すればよ
いが、本実施の形態ではすべてを実装し、状況に応じて
そのうちひとつを選択する。【００６４】基準フレーム削除部１１５は、後述のよう
に、アンカー自動設定部１１２によって設定された基準
フレームのうち冗長なものを非基準フレームに戻す。以
下、この構成による動作を説明する。【００６５】［１］動きベクトルの利用によるアンカー
の自動設定この処理の特徴は、一旦開始フレームから終了フレーム
までブロックの動きベクトルを求め、しかる後、ターゲ
ットの仮想的な移動経路と動きベクトルの一致度を判定
することによってターゲットの位置推定の追跡精度を高
める二段階の構成にある。【００６６】１．動きベクトルの取得開始フレーム、終了フレームの時刻をそれぞれｔ０、ｔ
１とする。これらの他に、近接フレーム抽出部１１７に
より、まず非基準フレームのいくつかを基準フレームに
変更する。ここでは単純に５フレームおきに変更するも
のとし、以降簡単のため基準フレーム間の経過時間を１
と正規化する。開始フレームから終了フレームまでの期
間における特定アンカーの動きベクトルを求めるため
に、アンカーの重心付近の画像領域をブロックとして、
ブロックマッチングを行う。なお、任意の時刻ｔに対応
するフレームをフレーム（ｔ）と表記する。【００６７】図１４は本実施の形態における動きベクト
ルの取得手順を示すフローチャートである。同図のごと
く、まず時刻カウンタｔをｔ０に設定する（Ｓ１０
０）。つぎに、開始フレームにおいて設定されたブロッ
クのうち、動きベクトルを取得すべきブロックを指定す
る。動きベクトル利用設定部１１３は指定されたアンカ
ーの重心を含む領域をブロックマッチングの際に使用す
るブロック（以下「アンカーブロック」という）として
記憶する（Ｓ１０１）。つづいて、フレーム（ｔ）の画
像データＩ（ｔ）と、フレーム（ｔ＋１）のＩ（ｔ＋
１）を取得する（Ｓ１０２）。Ｉ（ｔ）はフレームに含
まれる各画素の画素値ｐの集合データである。【００６８】この後、ブロックをフレーム（ｔ＋１）内
で動かしながら、最適マッチングを探索する（Ｓ１０
３）。ブロック自身の各画素値はＩ（ｔ）から判明する
ため、ブロックをフレーム（ｔ＋１）の任意の個所に配
置し、重なり合う画素どうしで画素値の２乗誤差を計算
し、これをブロック全域で積算する。ブロックを少しづ
つ移動させながらこの積算を行い、積算値が最小になる
位置をもって、ブロックの移動先であると判断する。【００６９】移動先が判明すれば、フレーム（ｔ）にお
けるブロックからフレーム（ｔ＋１）におけるブロック
への移動量と移動方向が確定するため、これを動きベク
トルＶ（ｔ）として取得する（Ｓ１０４）。ここで、ｔ
＋１が終了フレームの時刻ｔ１に到達したかどうかを判
定し（Ｓ１０５）、到達していなければｔをインクリメ
ントして（Ｓ１０６）、動きベクトルを繰り返し取得す
る。ｔ＋１がｔ１に等しくなれば、いままで取得したＶ
（ｔ）を保存した後（Ｓ１０７）、処理を終える。【００７０】図１５はｔ０＝０、ｔ１＝３とした場合に
得られた動きベクトルＶ（０）〜Ｖ（２）の例を示す図
である。同図に示すごとく、Ｖ（ｔ）は画面上の縦横で
定まるｘ、ｙ、および時間方向で定まるｔにより、
（ｘ，ｙ，ｔ）の３成分で表現することができる。【００７１】２．一致度の判定動きベクトルの際に設けたブロックと同等の大きさのブ
ロックによって各フレームを分割し、ターゲットが辿っ
た可能性のある全経路を見い出す。図１６はそうした経
路のうちの１つを示す図である。同図では、フレームが
１６のブロックに分割され、開始フレームにおける経路
の起点と、終了フレームにおける経路の終点が図１４の
ブロックと一致している。この条件下では、全経路は１
６×１６通りとなる。つづいて、この経路（以下「仮想
経路」という）の各区間に、図１６に示すベクトル（以
下「経路ベクトル」という）ｖ（ｔ）を定義する。経路
ベクトルはあるフレームから次のフレームまで仮想経路
を辿るとき、その方向で決まる。ｖ（ｔ）も（ｘ，ｙ，
ｔ）の３成分で記述される。【００７２】ここで、各区間においてＶ（ｔ）とｖ
（ｔ）のなす角度をθｔとおき、内積を用いた次の式に
よってｆ（ｔ）＝ｃｏｓθｔを計算する。【００７３】ｆ（ｔ）＝（ V(t) , v(t) ）／|V(t)|・|v(t)| （式２）図１７は図１６のｖ（ｔ）に対し、図１５のＶ（ｔ）を
追加して表示した図で、θｔの意味を示している。式２
のｆ（ｔ）が大きいほど、その区間における仮想経路と
動きベクトルの一致度は高いが、ある区間で一致度が最
大になっても、他の区間の一致度が非常に低ければ、全
体としての一致度は低いとみる必要がある。そこで、各
区間の一致度を加味しつつ、全体としての一致度を評価
するために、次の評価式を導入する。【００７４】ｇ（ｔ）＝ｍａｘ｛ｆ（ｔ−１）＋ｇ（ｔ−１）｝（式３）式３を再帰的に計算していくことにより、常にその時刻
まで最も一致度の高かった仮想経路が判明する。この計
算を終了フレームまで行えば、全体を通して最も一致度
の高い仮想経路が判明するため、この経路をもってター
ゲットの移動経路とみなす。後は、この移動経路と各基
準フレームの交差する個所にその時刻におけるターゲッ
トが存在するものとして、アンカーの自動設定を行う。
設定されたアンカー情報は、図３に示すアンカー情報テ
ーブルに追加していけばよい。なお、基準フレーム以外
のフレームについては、実施の形態１同様の方法によ
り、補間計算からアンカー情報を逐次計算して求めれば
よい。【００７５】［２］輪郭情報の利用によるアンカーの自
動設定アンカー自動設定の別の方法として、ターゲットの輪郭
の移動をもとにターゲットの位置の推定を行う方法があ
る。輪郭情報利用設定部１１４では、図１４同様の繰り
返し処理により、各フレームについて輪郭画像を生成す
る。輪郭画像は、輪郭線上が１その他が０となる２値化
画像であり、画像にコンパス・グラディエント（Compas
s-gradient）型フィルタなどをかけることで生成可能で
ある。輪郭画像が求まれば、以降、アンカーがターゲッ
トと全く同じ移動をするものとしてアンカー情報の設定
を行えばよい。【００７６】［３］マッチングを利用したアンカーの自
動設定アンカー自動設定のさらに別の方法として、図１８に示
すパターンマッチングによるターゲットの位置の推定が
ある。この方法の場合も、まず近接フレーム抽出部１１
７により、予めある程度の基準フレームを設ける。つぎ
に、開始フレーム１３０において設定されたアンカー領
域１３２からパターンマッチング用のモデル１３４を作
成し、隣接する基準フレーム１３６の中で最も一致度の
高い領域１３８を求める。パターンマッチングの方法と
して、モデルの画像データをそのまま重ね合わせるテン
プレートマッチング法と、画像から抽出された特徴点の
位置関係をもとに重ね合わせを行う構造マッチング法な
どがある。パターンマッチングはモデルの近傍を中心に
行う。【００７７】こうして２枚目の基準フレームで領域１３
８が見い出されれば、この領域１３８を新しいモデルと
して同様の処理を繰り返し、ターゲットを追跡してい
く。なお、ターゲットの動きや変形が激しい場合や基準
フレームの設定間隔が広すぎるときには良好なマッチン
グがとれない場合もある。そのようなとき、近接フレー
ム抽出部１１７は基準フレームの間隔を狭めてさらに多
くの基準フレームを設けたうえでマッチング処理をやり
なおすものとする。【００７８】［４］不要な基準フレームの削除上述の例では、５フレームごとに基準フレームを設け
た。しかし、例えばターゲットが等速直線運動をするよ
うな場合、基準フレームは開始フレームと終了フレーム
だけで十分である。ターゲットが開始フレームから終了
フレームまで、すべて等速直線運動をしていない場合で
あっても、そのような運動をしている期間については、
その期間の両端の基準フレームだけがあればよい。基準
フレームが減るたびに計算の負荷も軽くなる。［１］の
場合、仮想経路が激減するため、特に効果的である。【００７９】この観点から、基準フレーム削除部１１５
は不要な基準フレームの削除を行う。図１９〜２１は基
準フレームを削除していく様子を示す図である。これら
の図の横軸は時間、縦軸はフレーム上に設けられたｘ−
ｙ座標の原点からの距離を示している。また、各図中の
○は、アンカー領域を模式的に示すものである。削除は
以下の手順による。【００８０】［図１９］当初、基準フレームは両端基準フレームを含めて６個設
けられている。ここで開始フレームのアンカーと終了フ
レームのアンカーを直線で結び、この直線と各アンカー
の距離を計算する。距離が所定値以下になったアンカー
があれば、その時刻の基準フレームを削除する。同図で
は削除されるアンカーはなかったものとする。つぎに、
直線から最も遠いアンカー（以下、最遠アンカーとよ
ぶ）を見つける。ここでは、ｔ＝３のアンカーが最遠ア
ンカーである。［図２０］前記の直線を消去し、開始フレームのアンカー、最遠ア
ンカー、終了フレームのアンカーをこの順に折れ線で結
び、再びこの折れ線と各アンカーの距離を求める。距離
が前記所定値以下になったｔ＝４のアンカーが削除され
る。最遠アンカーはｔ＝２のアンカーに変更される。［図２１］新たな最遠アンカーを通るよう、折れ線を修正する。こ
こで、新たな折れ線との距離が前記所定値以下になった
ｔ＝１の基準フレームが削除される。これで処理が終了
する。【００８１】この例では、２つの基準フレームが削除さ
れたことになる。最初の基準フレームが多いときは、
１．所定値以下の基準フレームの削除、２．最遠アンカ
ーの探索、３．折れ線の修正、を繰り返せばよい。【００８２】以上が本実施の形態の概要である。なお、
本実施の形態については以下のような改良または変形が
考えられる。（１）式２の変更式２ではｆ（ｔ）＝ｃｏｓθｔを採用したが、当然これ
は別の関数でもよい。θｔと増減をともにする関数はｆ
（ｔ）の候補になり得る。（２）ブロックのとりかた図１５では、アンカー領域の重心付近を含むようにブロ
ックを決めたが、これは別の決め方でもよい。例えば、
アンカー領域をそのままアンカーブロックとしてもよ
い。同様に図１６でも、ブロックの大きさと関係なくブ
ロックを決めてもよい。【００８３】（３）上述［３］の別方法（その１）上述のマッチングによるアンカー自動設定の別態様を挙
げる。あるフレームの領域をモデルとして次々にマッチ
ングをとっていく場合、誤差が積もって次第にターゲッ
トから外れていくおそれがある。このため、近接フレー
ムからのマッチングだけでなく、ある程度時間的に離れ
たフレームからのマッチング結果を加味して判断する。【００８４】図２２に示すように、ここでは時間距離の
離れた基準フレームとして開始フレームと終了フレーム
を採用する。いま、アンカーの位置を特定すべき新たな
基準フレーム４０４が時刻ｔ＋Δｔのものとする。一
方、開始フレームのアンカー領域のモデルＡ４００、終
了フレームのアンカー領域のモデルＢ４０１、時刻ｔの
基準フレーム４０２のアンカー領域のモデルＣ４０３が
すべて既知である。そこで、これら３枚の基準フレーム
と時刻ｔ＋Δｔの基準フレーム４０４との間でそれぞれ
マッチングをとる。このとき、すべてのマッチング結果
が一致すれば、その領域の追跡結果は信頼できる。一
方、マッチング結果が一致しない場合、例えば以下の方
法でアンカー位置を決める。１．３回のマッチングの結果時刻ｔ＋Δｔの基準フレー
ム４０４上に得られる３つの領域を重ね合わせ、重複部
分の中心を中心とする、もとの領域と同一サイズのアン
カー領域を作成する。【００８５】２．ひとつのモデルであるモデルＣ４０３
について複数のマッチング結果を求め、マッチング結果
の上位から順に、その領域内にモデルＡ４００、モデル
Ｂ４０１のマッチング結果から得られる領域が含まれる
か否かを判定していく。一定の割合以上で含まれる場
合、マッチング結果から得られる領域どうしの重複部分
の中心を中心とする、もとの領域と同一サイズのアンカ
ー領域を作成する。なお、ここでは近接する基準フレー
ムのほかに開始フレームおよび終了フレームを考慮した
が、組合せには自由度がある。たとえば、近接する基準
フレームおよび一定の時間距離だけ隔たった任意数の基
準フレームを採用してもよい。【００８６】（４）上述［３］の別方法（その２）図２３のごとく、アンカー自動設定部１１２に自動設定
信頼度判定部１３０と自動設定信頼度表示部１３１を設
ける。自動設定信頼度判定部１３０は、最終フレームま
で順方向でマッチングを行って得られたアンカー領域
と、終了フレームで指定されているアンカー領域との重
複の程度により、追跡の信頼度を判定する。例えば、重
複部分の面積が７０％以上なら追跡結果は信頼でき、５
０％以下なら信頼できないと判定する。自動設定信頼度
表示部１３１は追跡の信頼度（信頼ができるか否か、ま
たはその％）を表示する。【００８７】例えば図２４に示すように、ターゲットの
本来の軌跡１４０に対してマッチングによる追跡結果１
４１が反れたとき、重複の程度が低いため、追跡は信頼
できないと判定される。そこで、今度は終了フレームの
アンカー領域をモデルにして逆方向に画像のマッチング
をとり、ターゲットの位置を推定位置を追跡しなおす。
この際、各基準フレームにおいて、先に順方向のマッチ
ングで設定されたアンカー領域と今回の逆方向マッチン
グによって得られたアンカー領域を比較し、両者が所定
の割合以上で重なっていれば追跡を終了する。このとき
別の方法として、指定されたフレームで追跡を打ち切っ
てもよい。追跡を終了すべきフレームは、開始フレーム
および終了フレームからの距離の比などで決めてもよ
い。重複を判定する方法と追跡を終了すべきフレームを
指定する方法を併用してもよい。【００８８】以上、この方法によれば、追跡の途中で誤
りが発生した場合でも、以降の追跡結果の悪化を回避す
ることができ、最終的な修正作業が軽減される。また、
重複の度合いを見て逆方向のマッチングを終了する場
合、必要な部分だけが修正されるため、処理時間の短縮
につながる。【００８９】なお、信頼度が低い場合であっても、逆方
向のマッチングを自動的に開始するのではなく、単に信
頼度を表示するにとどめてもよい。その場合、ユーザは
逆方向のマッチングを実行させてもよいし、自ら望む修
正を施してもよい。いずれにせよ、自動設定信頼度表示
部１３１によってユーザは追跡の良否を知ることがで
き、適切な対処が可能になる。【００９０】（５）上述［３］の別方法（その３）図１２の各構成、すなわち動画再生部１１８、アンカー
情報表示部１２０、アンカー情報補正部１１９をアンカ
ー自動設定部１１２の中に設ける。ここでは、動画再生
部１１８は動画を構成する各フレームを時系列に従い、
適当な時間間隔で表示する。アンカー情報補正部１１９
は、動画再生中の任意の時刻にアンカー情報補正操作が
行われたとき、その時刻に表示されていたフレームのア
ンカー情報を補正する。それとともに、そのフレームの
直前の所定枚数または所定期間に表示されたフレームの
アンカー自動設定結果を無効化する。【００９１】この構成における動作を説明する。ここで
は、動画像の再生に従い、開始フレームから逐次順方向
でマッチングをとりながら表示していくとする。この動
作の場合、既述のごとく、いったんマッチングが良好で
なくなると、以降アンカー情報が次第に外れていくおそ
れがある。ユーザは、再生される動画像とアンカー情報
を見ながら、アンカー領域がターゲットから外れたとき
画面をクリックする。この時点で再生が停止する。ここ
で、たとえばユーザがターゲットの中心をクリックすれ
ば、その点が中心になるようアンカー位置が修正され
る。以降のマッチングは修正されたアンカー情報をもと
に行われるため、良好となる。【００９２】この方法では、動画像の再生中にユーザが
画面をクリックするため、アクションの遅延を考える必
要がある。すなわち、アンカー領域がターゲットから外
れたことを認識してクリックしたときには、すでに数フ
レームにわたってそうした現象が徐々に進行していたと
考えられる。そこでアンカー情報補正部１１９は、アン
カー情報が修正されたフレームの前に存在する所定の複
数フレームについて自動設定されたアンカー情報を無効
化する。【００９３】図２５は、ターゲットの本来の軌跡とマッ
チングによって得られた追跡結果の関係、およびアンカ
ー情報補正部１１９の動作を示す図である。同図の実線
１５０はターゲットの軌跡と追跡結果が一致している
間、破線１５１は、追跡結果が軌跡から外れている間を
示している。同図のごとく、時刻ｔ０〜ｔ１は追跡結果
が信頼できるが、時刻ｔ１で外れだす。ユーザはこのこ
とに気付き、時刻ｔ２で画面をクリックする。この結
果、時刻ｔ２〜ｔ３の間でまた正しい追跡が行われる。
時刻ｔ１〜ｔ２は正しくない追跡結果が残るため、これ
が無効化される。無効化された部分については、同図の
ごとく、時刻ｔ１、ｔ２におけるアンカー情報を線形補
間することにより、欠けた追跡結果を補うことができ
る。なお、アンカー情報を無効化するフレーム数は予め
指定してもよいし、修正を行う時点で指定してもよい。【００９４】実施の形態３．実施の形態１、２の動画ハイパーメディア装置を応用
し、以下の装置またはシステムを構築することができ
る。【００９５】１．対話型ビデオ教材制作装置本発明に係る動画ハイパーメディア装置はＣＡＩコンテ
ンツの制作にも最適である。すなわち、本装置でビデオ
教材にアンカーを設定し、必要な付加情報をリンクさせ
る。図２６はこの装置によって制作されたビデオ教材を
示す図である。同図のごとく、アンカーＡに対して説明
Ａ、アンカーＢに対して説明Ｂなどがリンクされてい
る。生徒はビデオを再生しながら、より詳しい説明が欲
しいオブジェクトを画面上でクリックする。クリックさ
れたオブジェクトがアンカーＡに関連していれば、説明
Ａが画面に表示される。【００９６】２．対話型ビデオサーバシステム本発明に係る動画ハイパーメディア装置はビデオサーバ
システムにも最適である。図２７はこのビデオサーバシ
ステムの構成図である。同図のごとく、このシステム
は、図１の構成をほぼ分け合う形のサーバ２００とクラ
イアント２５０からなる。【００９７】サーバ２００は、動画とそのアンカー情
報、およびアンカーにリンクされた関連データを記憶す
るデータ記憶部２０４、アンカー推定部２０６、任意の
アンカーにリンクされた関連データを検索するハイパー
リンク検索部２０８を備える。アンカー推定部２０６は
非基準フレームにおけるアンカー情報を推定する。【００９８】一方、クライアント２５０は、ユーザが動
画中の対象をクリックしたとき、いずれのアンカー領域
がクリックされたかを判定するアンカー判定部２５２を
備える。【００９９】この構成において、クライアント２５０で
ユーザが画面上のあるオブジェクトをクリックすると、
アンカー判定部２５２がクリックされたアンカーを特定
する。この情報はサーバ２００に送られる。サーバ２０
０のハイパーリンク検索部２０８はデータ記憶部２０４
からそのアンカーにリンクされた関連データを検索し、
これをクライアント２５０に送る。【０１００】以上、このシステムによれば、動画データ
やアンカー情報をサーバ２００の側に一括して蓄積して
おき、多数のユーザから必要な動画とそれにリンクされ
た情報を見ることができる。【０１０１】【発明の効果】本発明の動画アンカー設定装置によれ
ば、基準フレームのみに対してアンカー情報を設定する
ことで非基準フレーム、すなわち基準フレーム以外のフ
レームのアンカー情報を推定することができるため、非
基準フレームに対するアンカー情報の設定が不要とな
る。この結果、アンカー設定作業を省力化することがで
きる。【０１０２】また、基準フレーム削除手段を含むので、
アンカー情報を保持しておくべき基準フレームの数が減
り、必要な記憶容量を減らすことができる。 DETAILED DESCRIPTION OF THE INVENTION [0001] BACKGROUND OF THE INVENTION 1. Field of the Invention
Related to the fixed device. The invention is particularly useful for
For setting anchors to targets etc. included in
About the installation. [0002] 2. Description of the Related Art Conventionally, a general hypermedia device is used.
Searching for information is mainly performed on text and still images.
Set a logical unit for the information link, and
Link the relevant information in advance and let the user
The form in which the related information is displayed when clicked
I was However, for example, for encoding and decoding moving images
In recent years, as represented by MPEG,
Various technologies to process not only images but also moving images are proposed
Have been. By handling moving images,
Media devices, CAI, various presentations,
Useful for creating content such as child catalogs. Video
Image editing is limited to a limited number of industrial fields such as broadcasting stations.
Although it has been used, personal computers will be
Is rapidly spreading as a personal system
it is considered as. [0003] Japanese Patent Application Laid-Open No. 4-163589 discloses a moving image.
A logical unit (called a node in the specification)
There is disclosed an image processing device that can be specified. This
In this device, the setting of the node in the still image is simply the display range.
For points that can be specified only by specifying
(1) Display range and (2) Time
In that you only need to specify the valid duration of the
Attention is paid to these specifications. Sand
Regarding (1), subjects appearing in a moving image
By pointing the surrounding area with a mouse or the like,
Is set for the node, while for (2),
Depending on the elapsed time from the start time of the moving image output,
Specify the valid duration of the code. So this node is
Uniquely defined by the two contents of the storage area and the elapsed time
In other words, related information can be linked to each node.
Wear. After the link, when actually playing the moving image, the user
Clicks an area on the screen with a mouse, etc.
Nodes are identified by location and time and relevant information is displayed
Is done. [0004] In the above apparatus, the logic
It was decided to set the unit area manually. Only
Naturally, moving images contain many files, unlike still images.
There are frames, and the position and shape of the subject change every moment.
For NTSC, 30 frames are required per second
Therefore, even if it is simply calculated, it processes a one-second moving image.
Requires 30 setting operations per logical unit
You. For example, when creating content for 5 minutes, one frame
Assuming that 5 logical units are set in the
45,000 times. [0005] The present invention has been made in view of this problem.
Its purpose is to define logical units (referred to herein as anchors).
A device that enables labor saving and simplification of setting work
More specifically, the anchor that had to be performed for each frame
Setting device that automatically calculates or automatically sets information
In the offer. [0006] SUMMARY OF THE INVENTION A moving image anchor of the present invention is provided.
Setting device, DynamicMultiple frames that make up the picturePrescribed for
At intervalsReference frameSelectionAndThoseEach of the reference frames
Anchor information setting method for setting anchor information for each
And a non-reference file based on the set anchor information.
Anchor information calculation means for calculating frame anchor information
And a moving image anchor setting device including
The anchor information set in the reference frame of
A predetermined error is determined based on the anchor information set in the sub-frame.
Judgment means for judging whether calculation is possible within the difference range
If it can be calculated,
Frame deletion means for changing a frame to a non-reference frame
And further includes [0007] BEST MODE FOR CARRYING OUT THE INVENTION Hereinafter, a moving image hyper-me
A preferred embodiment of the ear device will be described. This device
Is equipped with the video anchor setting device of the present invention.
You. According to this device, for example, a moving image of an aquarium tank
When you click on any fish swimming
Can display the name of the fish, supplementary explanations, etc.
Create interactive CAI software easily and efficiently
Can be. In the following embodiments, “user” is mainly
The creator of the content
Those who personally edit videotapes
It may be any. Embodiment 1 In this embodiment, the user specifies the start frame and end frame.
Frames, the first frame and the second
The two frames that are the frames of the first reference frame
Set the anchor as "Frame" is a table of images
It is a display unit and includes a picture referred to in MPEG. Book
Is the device an anchor set for the reference frame?
The anchor region in other frames
Automatically calculates anchor information such as area position, shape, and color
I do. “Anchor information” is, for example, the position of the anchor area.
When explicitly displaying the position, shape, or anchor area,
Refers to color. Note that even the first reference frame is a book.
The invention is established, but an example thereof will be described later. A moving image hypermedia device according to the present embodiment
The configuration of the entire system, including
Personal computer (hereinafter referred to as P
C) and a video playback device that provides a moving image to this PC.
You. The PC captures the image provided from the video playback device.
Built-in video capture board to capture and digitize
It is. Video playback device, normal playback start, stop, fast forward
Frame, time advance, etc.
It has functions such as playing back from the computer. Such video playback equipment
The device is widely used in broadcasting services, etc.
There is no need to be limited to this. Control of various functions of the video playback device
Is a user interface developed on a PC (hereinafter referred to as
UI), such as a "play button" displayed on the screen
Done through. In this case, the user clicks the button
If you check it, the action will go through the signal cable
It is transmitted from the PC to the video playback device. Video playback device
As such, it is not an essential component of a video hypermedia device
However, here, the system including this will be described. Figure
1 includes the moving image hypermedia device according to the present embodiment.
FIG. 1 is a configuration diagram of a system. [0010] This apparatus is roughly divided into anchor information and resource information.
Data operation unit 1, which operates data related to link information,
A data storage unit 2 for storing these data;
Display unit 3 that displays data in a meaningful form,
The user operation unit 4 that performs attachment and management and the video playback device 5
It comprises a moving image input section 6 for inputting the generated moving image. (1) Data operation unit 1 An instruction from the user to the data operation unit 1 is sent to a UI described later.
This is done. That is, the following internal configuration is software
It is an air module. [0012] The frame determination unit 10 determines a start frame and an end.
Determine the frame. In this embodiment, the user specifies
Frame is the start frame and end frame
It becomes. Examples of start and end frames are given above.
In the aquarium video, the top of the scene that depicts the aquarium and
This is the last frame. Suppose the scene is at the entrance of the aquarium
After moving to the video, it is necessary to set an anchor on the fish after that
Because there is no, specify the end frame before the scene transition
deep. The anchor setting unit 11 includes a start frame and an end frame.
The anchor is actually set between the acknowledgment frames. For example,
When you set an anchor for a fish,
Display a rectangle around the fish with the mouse.
This is registered as an anchor area. At this time, the video stops
In stop mode. Next, advance the video to the end frame.
Therefore, the anchor area is registered by surrounding the same fish again. start
The fish moves or changes direction between the frame and the end frame.
In general, both the position and the shape change.
Shape and position of the anchor area registered in the start frame
The shape and position of those registered in the
Generally not. The anchor setting unit 11 will be described later.
Information editing unit 11 used when the anchor is modified
0 and a text to set an anchor for text such as a character string.
A text anchor setting unit 111 is included. The anchor estimating unit 12 determines the start frame and the end
The first and second anchor information set in the
Interpolation calculation is performed based on an arbitrary frame (non-reference frame).
E) the position and size of the anchor in step (1) are estimated.
This processing will be described later in detail. [0015] Anchor search unit 15 outputs anchor information.
Anchor movement characteristics or anchor identification information
Search anchors based on. The identification information is
Find information to help identify the car from other anchors
For example, anchor name, anchor setting object, anchor
There is a car setting date and time. The hyperlink setting unit 13 sets the
Set the hyperlink to the anchor,
Create a data structure in the form of a table. Hyperlin
The search unit 14 searches for the set link information.
In the case of the above example, a fish anchor and a text
Text data etc. are linked by hyperlinks
You. (2) Data storage unit 2 The data storage unit 2 may be a database or various files.
File device or memory device. This part is mainly hard
Wear. The moving image data storage unit 20 has a key
Store captured and digitized video data
You. The anchor information storage unit 21 and the link information storage unit 22
Stores the set anchor information and link information respectively
I do. (3) Display unit 3 The display control unit 30 controls various images such as a UI and a moving image being edited.
A display system program that controls the display as a whole,
A display circuit such as a VGA controller and this drive
Including ba. The display control unit 30 includes a cursor change unit 300
have. The cursor changing unit 300 determines that the cursor is an anchor.
Change the display state of the cursor when entering the area. table
The output data of the display control unit 30 is a display device such as a PC monitor.
31 is displayed. (4) User operation unit 4 It allows the user to enter commands.
Hardware such as keyboard, mouse, and various pointing devices
And a command dispatcher. Koman
Examples of anchors include setting anchors, modifying anchor areas, and
And link search. (5) Moving picture input unit 6 A hardware equivalent to a video capture board.
It has a D converter and a frame memory (not shown).
Digitize the input moving image. After this, the data
This is provided to the moving image data storage unit 20. Based on the above configuration, first, anchor and phosphorus
The procedure for setting the network is explained, and later the UI for anchor setting
Explain the child. [1] Setting of anchor FIG. 2 shows an anchor setting and correction procedure according to this embodiment.
FIG. 3 is a flowchart showing the order, and FIG.
It is a figure showing a table of information. First, as shown in FIG.
Various initialization processing is performed on the software and the like (S21),
Reading of moving image data stored in the moving image data storage unit 20
The embedding (S22) is performed. Beyond loaded video data
The head frame is first displayed as a still image on the display device 31.
Is done. Next, the video data already set
Read anchor information from the anchor information storage unit 21.
(S23). If anchor information exists,
Display the anchor area on the screen (hereinafter, anchor
Set the mode in which the area is displayed on the screen to "Anchor display mode".
Mode), the mode that is not displayed is "anchor non-display mode"
). Subsequently, a new anchor is set this time.
Video data up to the start frame of the current period (S2
4) When the desired frame appears, select “Start Frame” on the screen.
Button to register the start frame. This state
Waiting state for setting the anchor area in this frame
State, and the user moves the mouse around, for example, another fish.
A rectangular area is provided by clicking the button. The rectangular area is determined
Then, the upper left point (x1, y1) and the lower right point (x2, y
The coordinates of 2) are obtained, and this is the frame of the start frame.
Along with the number (serial number from the first frame of the video)
Is recorded as the anchor information of the fish (S25). Thereafter, the moving image data is advanced again, and
Stop when the end frame appears (S26), and the same fish
Is provided with a rectangular area. End frame here
Is completed (S27). Of FIG.
“Anchor1” is an anchor ID indicating this fish.
You. Here, the frame numbers of the start frame and end frame
Number (frames 1 and 100 respectively) and the anchor area
Coordinate information is stored in a table. In this manner, the anchor in the both-end reference frame
-If the information is confirmed, the third frame (non-standard
Frame) is obtained by interpolation calculation.
(S28). FIG. 4 shows an interpolation calculation method of anchor information.
FIG. here, The anchor information at the start frame (time t0) is A
(T0) The anchor information at the end frame (time t1) is A
(T1) The anchor information at time t is A (t) T1−t0 = Δt If so,   A (t) = {A (t1) −A (t0)} t / Δt + {A (t0) t1−A (t1) t0} / Δt (Equation 1) And multiply. As A, the x1, y1, x2,
If y2 is substituted, outside the anchor area at an arbitrary time
The shape turns out. Substituting the barycentric coordinates of the anchor area
The rough movement of the anchor area is found. Color number for A
By substituting, the color change of the anchor area can be tracked. This
In addition to this, information that can be expressed numerically also uses Equation 1.
Interpolation can be performed by the internal division calculation. By interpolation
The anchor information of the non-reference frame obtained by
You can add it to "anchor1" on the table
In other words, the table shown in FIG.
Is calculated for each frame.
It may be performed sequentially. In the present embodiment,
Is assumed. When S28 is completed, the anchor information is actually
Is displayed to confirm the contents (S29). At this time, start
The video data is played back to the frame,
The anchor area is displayed as a rectangle. This rectangular area is calculated
It moves continuously according to the result. In the case of "anchor1", the fish is a straight line
Exercise gives very good results, but if you swim halfway
If the direction is changed, the fish
-The area shifts. Therefore, the anchor information is corrected (S3
0). First, the user first downloads the video up to the frame
Data and stop the image here. Then, the screen display
Click the end of the anchor area that is
To change the shape or position of the area. Anchor estimator
12 sets the frame thus corrected as a reference frame.
Upgraded (hereinafter, those that have been promoted to the reference frame)
This anchor information is also called “intermediate reference frame”).
Add to the table in FIG. FIG. 5 shows the intermediate reference frame in FIG.
Shows the table obtained by adding the anchor information of the
ing. On the other hand, FIG.
Shows how to perform interpolation calculation based on three frames
FIG. The non-reference frame to be estimated is the first
Between the start frame, which is a frame, and the intermediate reference frame
If it exists, interpolation calculation is performed between those frames,
A subframe exists between the intermediate reference frame and the end frame.
If there is, interpolation calculation is performed between those frames (S2
8). Thereafter, the display in S29 and the re-correction in S30
After that, when good anchor information is obtained (S31
Y) Save this (S32) and end the anchor setting process
You. If the anchor of another frame is modified in S30,
Of course, this frame is also an intermediate reference frame. Note that S
In 25, set two or more anchors in the same frame
If you do, the anchor ID is automatically set inside the device in the setting order.
To the anchor area.
What is necessary is just to take measures such as performing rectangular display of the area in different colors. According to the above procedure, the following effects can be obtained.
You. 1. Set the anchor in both reference frames
Settings for a large number of frames in between.
Work becomes unnecessary. 2. If the position of the anchor is shifted in the interpolation calculation,
Can be confirmed. Therefore, the
Frames are easy to recognize, and once a frame is modified,
Automatically promoted to an intermediate reference frame,
You need to pay attention to whether it should be a reference frame
Absent. 3. For example, a fish with an anchor swims in an arc
In such cases, in addition to the reference frames at both ends,
If you make corrections in the frame, you will get enough good anchor information
Can be The above is a description of the moving image hyperme
Of the media devices, in particular, an overview of the video anchor setting device
is there. [2] Link setting Next, change the link settings for the set anchor.
Do. FIG. 7 shows link setting and detection according to this embodiment.
A flowchart showing a search procedure, and FIG. 8 shows a set link.
It is a figure showing a table of information. FIG. 7 shows the setting of an anchor and the setting of a link.
FIG. 2 shows a processing procedure in the case of performing completely independently, and FIG.
First, various initialization processing (S40), reading of moving image data
(S41). Next, set in [1]
Read anchor information from anchor information storage unit 21
Link information that has already been set with (S42)
From the link information storage unit 22. Next, the both-end reference frame and the intermediate reference
Anchor of another frame based on the anchor information of the frame
-Reproduction of moving image while obtaining information by interpolation calculation (S44)
Is displayed continuously in accordance with (S4)
5). In this state, the user operation unit 4
An input wait state is set (S46). [0033] Here, the user can display the moving image or the moving image.
After stopping once, click on a certain anchor area,
If you press the "Create / Change Link" button, the anchor
Then, link information is created (S47). example
For example, if a fish in the aquarium is clicked, phosphorus is added to that fish.
Text, images, etc. that should be
The text selected by the user is the anchor of the fish.
(More precisely, an object called fish contained in the anchor
Project). If there are no suggestions, the user
It is also possible to enter a character string by itself and link this
is there. FIG. 8 shows the information in text format in “anchor1”.
The report “anchor1.txt”, as well as “anchor1.txt”
2 ”bitmap image“ anchor ”
2. "bmp" is shown in a linked state. like this
Once the link information is confirmed, the content of the link is
The information is stored in the information storage unit 22, and the user waits for an input again. On the other hand, in S46, the user
If you press the “Search” button and specify an anchor, the anchor
Is retrieved and displayed (S4).
9). In the case of FIG. 8, for example, for the fish of anchor1
And the name, length, and characteristics of the fish are displayed as character strings.
For nchor2 fish, the fish actually inhabit
A photo of the sea where you are located is displayed. This display shows
The user can confirm content operation at this point.
Tool creation can be completed. Content is, for example,
By storing on a recording medium such as a CD-ROM,
It can also be commercialized. When shipping as goods,
Generally, an anchor hiding mode that does not display the anchor area
Change to mode. Here, the setting of the anchor and the link is
Although described as an independent process, for example, an image during link setting
If you provide a button on the surface that says "Return to anchor settings"
Both are free to go and edit more easily. [3] UI for anchor setting FIG. 9 is a diagram showing an example of a UI screen for setting an anchor.
You. In the figure, an image display area 50 includes a processing target.
The movie is displayed. The buttons 52 in black in the upper column are
Use the object buttons to directly instruct playback, stop, etc.
is there. Next to it, the frame displayed in the image display area
Rectangular button 5 for setting anchor area for
4. Similarly, change the displayed frame to the start frame.
Or start frame finger to designate as end frame
Button 56 and an end frame designation button 58 are provided.
ing. In the figure, the anchor area 60 for one fish is shown.
Is set. At the center right of the screen, set or modify
Name, ID, start frame number, end of anchor
There is an anchor-related box group 62 indicating the frame number.
You. Below the image display area 50, the currently displayed frame
And the scene number of that frame
The scene-related box group 64 indicating the serial number in
You. Below that, a small amount of video for editing
There is a box 66 to go forward or back. This right end
Press the button to advance the video, and press the left edge to return. Current
The position of the displayed frame in the scene
In the box 66, it is indicated by a vertical line 70. This button
Below the box are the start and end frames for the scene.
There is a box 68 indicating the position of the frame. Start frame
And the end frame positions are indicated by double vertical lines 72 and 74, respectively.
And the position of the intermediate reference frame
This is indicated by the shape symbol 76. In the figure, first, the user inputs a scene number.
As a clue, to the beginning of the scene where you want to set the anchor
Advance videotape. In this case, for example, multiple scenes
Aquarium with scene number "5"
The video is going on. Here, the user
Press the rightmost button to advance the video one frame at a time.
Good. The first frame that tries to set the anchor appears
Then, the user presses the start frame designation button 56, and
Register it. At this time, the corresponding part of the box 66
A double vertical line 72 appears indicating the position of the start frame.
At this point, the user presses the rectangular button 54 to set the image in the image display area 50.
Click the upper left and lower right points of the anchor area to be set with the mouse.
Click. This completes the anchor setting of the start frame.
You. Next, advance the video, register the end frame in the same way
Perform anchor setting. The setting in the both-end reference frame is completed.
Is detected, the anchor estimating unit 12 of the present apparatus
The calculation is started by substituting the anchor information into the equation (1). This
Here, the user returns to the start frame and
As the frame advances, the anchor estimating unit 12
Find the time corresponding to the currently displayed frame, and
An anchor area is displayed based on the corresponding estimation result. table
If the indicated anchor area is misaligned, the user must
By pressing the shape button 54, the area is corrected. After modification,
A triangular symbol 76 appears at the location corresponding to the frame.
According to this UI, the anchor information is actually displayed on the moving image data.
Display the editing result in real time.
And it can be easily modified. The above is the outline of the present embodiment. In addition,
Regarding the present embodiment, the following improvements, modifications, etc. are considered.
It is. (1) Setting of text anchor This is performed by the text anchor setting unit 111 in FIG. Ma
Edit the text data on the screen and put it on the video
Overlap and set the anchor. Difference from normal anchor setting
Or, do not specify a partial area of the reproduced image.
Put the created text on the image once, and then
At the point where the anchor area is set to surround the text
You. Conventionally, for example, annotations were directly added to video images.
Was a common practice, but in that case it was later annotated.
This is inconvenient when re-editing, such as deleting an option. Real truth
The embodiment eliminates this. When a text anchor is set,
Anchor information is also stored in the anchor information table. Was
However, in the table shown in FIG.
The place is "text" and the text name is entered in that field. As for the text anchor, the related information
Link is possible. For example, for the aquarium scene in FIG.
Paste the text "South Sea Fishes"
"The southern sea is full of brightly colored fish
You can link text like
Wear. (2) Changing the display state of the cursor This is performed by the cursor changing unit 300 of FIG. This feature
In particular, the anchor non-display mode, for example, when the content is
Useful when used in Because of this feature, the car
The sol change unit 300 is configured to always acquire the cursor position.
Location acquisition program and the acquired location is
Judgment program to judge whether it is included in the car area
When the cursor enters the anchor area, the cursor
How to change the display state of the
Change program that actually changes the cursor shape etc. according to
With ram. The cursor can be changed for each anchor.
There is a case where the change is not changed and a case where it is changed. Former
For example, change the cursor, which is usually a + sign, to ◎
Or increasing the brightness of the cursor. In this aspect
According to the report, especially, the target moves and changes
There is a benefit when the change of the anchor area is drastic. On the other hand, in the latter case, the change program
From the anchor ID of the anchor area where the cursor is
Search, and replace it with the cursor
It may be displayed at the position. For example, if the cursor is
When you enter the anchor area of a fish,
The target content of the anchor, such as "Shark"
Can be shown. According to this aspect, the user has to bother with the fish.
You don't need to click to know the name of the fish
it can. (3) Explicit designation of intermediate reference frame In this embodiment, first, only the both-end reference frames are determined.
However, when the movement of the target is irregular,
In some cases, the need for correction can be anticipated. In that case,
Frames other than the start frame and end frame from the beginning
Also accepts designation of an anchor area. For example
In the UI shown in FIG.
In addition to the end frame designation button 58, the intermediate frame designation button
To respond by providing a tongue. This frame has been
Because the frame is used as a frame, the interpolation
It can be thought that it is started from. (4) Anchor area other than rectangle The anchor area need not be limited to a rectangle. For example a circle or
In the case of an ellipse, the coordinates are based on the coordinates of the major axis, minor axis, and center.
You can specify the area by using For polygons, use the coordinates of each vertex
No. When the outer periphery of the target itself is the anchor area
Is the coordinates of a point on the outer circumference and the chain expressed from that point.
The region can be specified by the code. (5) Use of nonlinear interpolation In the present embodiment, linear interpolation is used most simply.
This may of course be a non-linear interpolation. The equation used for interpolation is
Determined by experiment etc. according to the characteristics of the moving image to be processed
be able to. (6) Determination of start and end frames In the present embodiment, these frames are explicitly specified by the user.
Although specified, there are also the following methods. 1. The user does not need to be aware of the start and end frames,
Simply specify the frame and set the anchor. Specified
The reference frame becomes the reference frame. Frame determination unit 10
Is the frame of the frame where the user has set the anchor.
Start the frame with the lowest program number and end the frame with the highest
Decide with a frame. In this case, specify the start frame in FIG.
The button 56 and the end frame designation button 58 become unnecessary.
You. 2. The user specifies one frame and
And set the anchor,
Specify the target that became. This frame is the reference frame.
It becomes a frame. The frame determining unit 10 determines the reference frame.
By examining the frames before and after the
Frames that appear and disappear, and
Let these be the start frame and end frame, respectively. The presence or absence of the target is determined by matching the image.
Judge by taking. In other words, the
Using the target as a model,
Performs a switching process. Search pairs as long as matching is achieved
Spread the elephant frame back and forth. Eventually the matching
If it cannot be obtained, the start and end frames are determined. This one
According to the law, only one reference frame should be provided initially.
No. (7) Three-dimensional display of anchor area Display the set anchor area on the anchor setting unit 11
In the x, y and time t directions
Provide a function for three-dimensional display. This is editing the anchor
4 may be displayed on the screen as it is. This
Display gives the user a visual overview of the anchor
Can be done As an application of this technology, three-dimensional display
You can edit the anchor information directly.
No. For example, the anchor area in the intermediate reference frame of FIG.
If you move the area to the left on the screen, the display shown in Fig. 6 will not be displayed.
It is. Users can understand the effects of editing in real time
Can be. (8) Cross-section display of moving image In the anchor setting unit 11, the start frame to the end frame
FIG. 10 is a cross-sectional view in the lateral direction of the locus of the anchor area up to FIG.
(A)) and vertical sectional view (Fig. 10 (b))
And added a function to display this along with the path of the anchor
I do. First, the anchor area is determined from the anchor information of the reference frame.
The x and y coordinates of the center of gravity G of the area are obtained. FIG. 10 (a)
In this case, a straight line parallel to the x-axis is drawn on the frame from the center of gravity. next to
A plane containing these straight lines between the touching reference frames (Fig.
(Middle shaded area). Next, cut the video on this plane
You. Project the obtained cross-sectional view on the xt plane (dotted area in the figure)
I do. In the case of FIG. 10B, the same processing is performed by exchanging x and y.
Make sense. If the anchor trajectory is calculated correctly
I.e., the estimation of anchor information in non-reference frames.
When the precision is sufficiently high, the two projections
A travel route should appear. For example, a red ball
If the target is to set an
A motion path appears. If you break the pencil vertically, the core will appear in a straight line
Is the same as If this route is cut off or thick
If so, correct the anchor position at that point.
Just do it. (9) Grouping of anchors The anchors set separately in the anchor information editing unit 110
ー Group information into a virtual anchor
Add functions to handle. For example, the person A is in the frame N1
N2 and frames N3 to N4 exist in the screen,
Frames N1 to N3 do not exist in frames N2 to N3.
2 and person A in frames N3 to N4.
Treat the anchor as one. As a result, the anchor
The task of setting and correcting information is reduced. In addition,
Grouping person A and person B appearing in the same frame
You can also. (10) List display of anchor information The anchor information editing unit 110
A function to display a list of anchor information set in advance is provided. An example
For example, along with the video title "AQUARIUM"
Anchor names such as "FISH1", "FISH2" ...
Display a list on the screen. Enter the anchor name you want to check
When selected by the user, the anchor's start frame
, And the moving image may be reproduced. (11) Search for anchor A search UI for anchor information is provided. Anchor to search
Input text information such as information name as a keyword
From the anchor information storage unit 21
Search for and display those with that keyword. That
Alternatively, the movement of the anchor area may be used as a search key. example
If you want to find an object that moves to the right,
For example, a button “→” is pressed in the search UI. Anchor
-The search unit 15 calculates the locus of each anchor area, and moves to the right.
Find and display anchors containing moving objects
You. (12) List display of anchor information display images In the anchor information editing unit 110, the start frame and the end frame
Frames (reference frames, non-reference frames)
Frame), and the anchor setting unit 11
Anchor information set in advance, or anchor estimation unit 12
The anchor information estimated by
-A function to create information display images and display them in a chronological list
Provide. For example, as shown in FIG.
81 and end frame 82 at both ends
Select a frame with Δt. Next, each of these frames
Information display image with anchor information 80 superimposed on the
And display them side by side with the earlier display time
I do. With this configuration, it is possible to see at a glance whether the anchor setting is appropriate.
You can pass. For this reason, according to box 66 in FIG.
Positioning work can be omitted, making it easy to modify anchors.
You. It is not necessary to select frames at regular intervals.
For example, only the reference frame may be selected. Also display
In the anchor information display image
You can also edit directly by dragging with the mouse
No. (13) Correction of anchor information during reproduction of moving image As shown in FIG. 12, the moving image is
A moving image reproducing unit 118 for reproducing on the screen and a frame being displayed
Information display section that displays anchor information about the system
120 and one or more times during video playback
When the anchor information correction operation is performed by
The frame that was playing at the time when the
If they are non-reference frames
Promoted to frames and
Anchor information based on each anchor information correction operation
An anchor information correction unit 119 for correction is provided. This configuration
First, the image display area of FIG.
A moving image is displayed in the area 50. At the same time, the anchor information
The information display unit 120 displays an
The car information is displayed as the anchor area 60. A user
Is a frame in which the anchor area 60 is displaced from the target.
When you find a program, mouse over the center of the target in the video
Click with. This action causes the anchor information
The correction unit 119 was displayed at the time when the click was performed.
Identify the frame and click on the center of the clicked point.
Correcting anchor information by generating car information
You. The size of the newly set anchor area is, for example,
It may be the same as the original anchor area. Hereafter, this frame
The system is treated as a reference frame. With this configuration,
You can specify the target position sequentially while the video is playing
This saves the trouble of checking and correcting later. Embodiment 2 In the first embodiment, the anchor information is mainly obtained by interpolation.
Dynamic calculations were performed and corrections were made manually. In this embodiment
Is a certain number of frames based on the video analysis
The anchor is automatically set as a reference frame and these
The interpolation method of the first embodiment is used between the reference frames. This
Corresponds to the intermediate reference frame of the first embodiment.
Since there is a frame to be installed from the beginning,
Positive effort is reduced. FIG. 13 shows a moving image hyper according to this embodiment.
FIG. 3 is a configuration diagram of an anchor setting unit 11 of the media device. A
Configurations other than the anchor setting unit 11 are the same as those in FIG. In FIG. 13, the automatic anchor setting unit 11
2 has an adjacent frame extraction unit 117. Proximity frame extraction
The output unit 117 has a fixed interval between adjacent reference frames.
To extract non-reference frames and raise them to reference frames.
Classify. The automatic anchor setting unit 112 also sets the motion vector.
Toll use setting unit 113, contour information use setting unit 114,
And a pattern matching use setting unit 116. this
These three setting units should implement one of them originally
However, in this embodiment, everything is implemented, and depending on the situation,
Select one of them. The reference frame deletion unit 115 is used as described later.
The criteria set by the automatic anchor setting unit 112
Redundant frames are returned to non-reference frames. Less than
The operation of this configuration will be described below. [1] Anchor by using motion vector
Automatic setting of The feature of this processing is that once from the start frame to the end frame
The motion vector of the block until
Judgment of the degree of coincidence between the virtual movement path of the unit and the motion vector
Tracking accuracy of target position estimation
In a two-stage configuration. 1. Get motion vector The time of the start frame and the time of the end frame are t0 and t, respectively.
Let it be 1. In addition to these, the proximity frame extraction unit 117
First, some of the non-reference frames are used as reference frames.
change. Here we simply change every 5 frames
For simplicity, the elapsed time between reference frames is set to 1
Is normalized. Period from start frame to end frame
To find the motion vector of a specific anchor between
In the image area near the center of gravity of the anchor as a block,
Perform block matching. At any time t
The frame to be written is referred to as a frame (t). FIG. 14 shows a motion vector according to the present embodiment.
9 is a flowchart showing a procedure for acquiring a file. Per figure
First, the time counter t is set to t0 (S10).
0). Next, the block set at the start frame is
Of the blocks from which the motion vector is to be obtained.
You. The motion vector usage setting unit 113 specifies the specified anchor.
Area including the center of gravity of the
Block (hereinafter referred to as “anchor block”)
It is stored (S101). Next, the picture of frame (t)
The image data I (t) and I (t +
1) is acquired (S102). I (t) is included in the frame.
This is a set of pixel values p of each pixel. Thereafter, the block is placed in the frame (t + 1).
Search for the best matching while moving with (S10)
3). Each pixel value of the block itself is found from I (t).
Therefore, the block is arranged at an arbitrary position in the frame (t + 1).
And calculate the squared error of the pixel value between overlapping pixels
This is integrated over the entire block. A little block
This integration is performed while moving
The position is determined to be the destination of the block. If the destination is determined, the frame (t)
From block to block in frame (t + 1)
Since the amount and direction of movement to
It is acquired as tor V (t) (S104). Where t
+1 has reached the end frame time t1.
(S105), and if not reached, increment t.
(S106) to repeatedly acquire a motion vector.
You. If t + 1 is equal to t1, the V obtained so far
After saving (t) (S107), the process ends. FIG. 15 shows a case where t0 = 0 and t1 = 3.
The figure which shows the example of the obtained motion vector V (0) -V (2).
It is. As shown in the figure, V (t) is
With x and y defined and t defined in the time direction,
It can be represented by three components of (x, y, t). 2. Judgment of coincidence A block of the same size as the block set for the motion vector
Locks divide each frame so that the target
Find all possible routes. FIG.
FIG. 3 shows one of the roads. In the figure, the frame is
The path in the start frame is divided into 16 blocks
14 and the end point of the path in the end frame are shown in FIG.
Match the block. Under these conditions, all routes are 1
There are 6 × 16 patterns. Next, this route (hereinafter referred to as “virtual
Each section of the route is referred to as a vector (hereinafter, referred to as a route) shown in FIG.
V (t) is defined below. Route
Vector is a virtual path from one frame to the next
When you follow, it is determined by that direction. v (t) is also (x, y,
It is described by three components of t). Here, V (t) and v
The angle formed by (t) is defined as θt, and the following equation using the inner product
Therefore, f (t) = cos θt is calculated. [0073]   f (t) = (V (t), v (t)) / | V (t) | · | v (t) | (Equation 2) FIG. 17 shows V (t) of FIG.
In the additionally displayed diagram, the meaning of θt is shown. Equation 2
Is larger, the virtual route in that section is
Although the degree of coincidence of motion vectors is high, the degree of coincidence is
Even if it becomes large, if the degree of coincidence of other sections is very low,
It is necessary to consider that the degree of agreement as a body is low. So, each
Evaluate the overall degree of matching while taking into account the degree of matching between sections
In order to do so, the following evaluation formula is introduced. [0074]   g (t) = max {f (t-1) + g (t-1)} (Equation 3) By calculating Equation 3 recursively, the time
The virtual route with the highest degree of coincidence is found. This meter
If the calculation is performed up to the end frame,
Because the virtual route with the highest
It is regarded as the route of the get. After that, this route and each
At the intersection of the quasi-frames,
The automatic setting of the anchor is performed assuming that the anchor exists.
The set anchor information is the anchor information text shown in FIG.
Just add it to the table. Note that other than the reference frame
For the frame of No. 1, the same method as in the first embodiment is used.
If the anchor information is sequentially calculated from the interpolation calculation,
Good. [2] Self-control of anchor by using contour information
Dynamic setting As another method of automatic anchor setting, the target contour
There is a method to estimate the position of the target based on the movement of
You. The contour information use setting unit 114 performs the same processing as in FIG.
Contour image is generated for each frame by the return process.
You. The outline image is binarized so that 1 on the outline and 0 on the other.
Image, and the image contains a compass gradient (Compas
s-gradient) type filter
is there. Once the contour image has been obtained, the anchor
Setting of anchor information as moving exactly the same as
Should be performed. [3] Identification of anchor using matching
Dynamic setting As yet another method of automatic anchor setting, FIG.
Estimation of target position by pattern matching
is there. In the case of this method as well, first, the adjacent frame extracting unit 11
7, a certain number of reference frames are provided in advance. Next
The anchor area set in the start frame 130
Create a model 134 for pattern matching from the area 132
And the highest matching level among the adjacent reference frames 136.
A high region 138 is determined. Pattern matching method and
To overlay the image data of the model
The plate matching method and the feature points extracted from the image
Structural matching method that performs superposition based on positional relationship
There is. Pattern matching is centered around the model
Do. Thus, the area 13 in the second reference frame
8 is found, this area 138 is used as a new model.
And repeat the process to track the target
Good. Note that if the movement or deformation of the target is
Good match when frames are set too wide
There is a case that cannot be taken. In such a case, the proximity frame
The system extraction unit 117 narrows the interval between the reference frames to increase
After setting up a number of reference frames,
It shall be corrected. [4] Deletion of Unnecessary Reference Frame In the above example, a reference frame is provided every 5 frames.
Was. However, for example, the target makes a linear motion
In such cases, the reference frame is the start frame and end frame
Is just enough. Target ends from start frame
In the case where not all are performing uniform linear motion up to the frame
Even so, for the duration of such exercise,
Only the reference frames at both ends of the period need be provided. Standard
Each time the frame is reduced, the computational load is reduced. [1]
This is particularly effective because the number of virtual routes is drastically reduced. From this viewpoint, the reference frame deleting unit 115
Deletes unnecessary reference frames. Figures 19 to 21
It is a figure showing signs that a quasi-frame is deleted. these
In the figure, the horizontal axis is time, and the vertical axis is x- provided on the frame.
The distance from the origin of the y coordinate is shown. Also, in each figure
○ schematically shows an anchor area. Delete
The following procedure is used. [FIG. 19] Initially, there are six reference frames, including both-end reference frames.
Have been killed. Here, start frame anchor and end frame
Connect the frame anchors with a straight line, and
Calculate the distance of Anchor whose distance has become less than the specified value
If there is, the reference frame at that time is deleted. In the figure
Has no anchor to be deleted. Next,
The anchor farthest from the straight line (hereinafter referred to as the farthest anchor)
Find). Here, the anchor at t = 3 is the farthest anchor.
It is a car. [FIG. 20] Delete the straight line and set the start frame anchor and farthest
And the anchor of the end frame with a polygonal line in this order.
Then, the distance between the polygonal line and each anchor is calculated again. distance
Is smaller than the predetermined value, the t = 4 anchor is deleted.
You. The farthest anchor is changed to t = 2 anchor. [FIG. 21] Modify the line to pass through the new furthest anchor. This
Here, the distance to the new polygonal line has become less than the predetermined value.
The reference frame at t = 1 is deleted. This ends the process
I do. In this example, two reference frames are deleted.
It will be. If there are many initial reference frames,
1. 1. Deletion of reference frames below a predetermined value; Furthest anchor
2. Search for The correction of the broken line may be repeated. The above is the outline of the present embodiment. In addition,
The following improvements or modifications can be made to this embodiment.
Conceivable. (1) Modification of equation 2 In equation (2), f (t) = cos θt is adopted.
May be another function. The function of increasing and decreasing θt is f
(T) can be a candidate. (2) How to take a block In FIG. 15, the block is set to include the vicinity of the center of gravity of the anchor area.
Decided, but this could be a different decision. For example,
Anchor area may be used as anchor block
No. Similarly, in FIG. 16 as well, regardless of the size of the block,
You may decide to lock. (3) Another method of the above [3] (No. 1) Another mode of automatic anchor setting by matching described above
I'm sorry. Match one after another using the area of a certain frame as a model
Errors, build up errors and gradually target
May be removed from the For this reason, the proximity frame
Not only matching from the
The judgment is made in consideration of the matching result from the frame. As shown in FIG. 22, here, the time distance
Start frame and end frame as separate reference frames
Is adopted. Now, a new anchor position should be specified.
It is assumed that the reference frame 404 is at time t + Δt. one
The model A400 of the anchor area of the start frame and the end
Model B401 of the anchor area of the
The model C403 of the anchor area of the reference frame 402 is
All are known. So, these three reference frames
And the reference frame 404 at time t + Δt, respectively.
Take matching. At this time, all matching results
If the matches, the tracking result for that area is reliable. one
If the matching result does not match, for example,
Determine the anchor position by the method. 1.3 Reference frame at time t + Δt as a result of matching three times
The three regions obtained on the program 404 are overlapped,
An axis of the same size as the original area, centered on the center of the minute
Create a car area. 2. One model, model C403
Find multiple matching results for
Model A400, model A400
The area obtained from the matching result of B401 is included
It is determined whether or not. Places that are included in a certain percentage or more
Area, the overlapping part between the regions obtained from the matching result
Anchor of the same size as the original area, centered on the center of
Create an area. Note that here, the reference frame
The start and end frames in addition to the
However, there is a degree of freedom in the combination. For example, close criteria
Frame and any number of bases separated by a fixed time distance
A quasi-frame may be employed. (4) Another method of the above [3] (No. 2) As shown in FIG. 23, automatic setting is performed in the automatic anchor setting unit 112.
A reliability judgment unit 130 and an automatic setting reliability display unit 131 are provided.
I can. The automatic setting reliability judgment unit 130
Area obtained by performing matching in the forward direction with
Overlaps with the anchor area specified in the end frame.
The reliability of the tracking is determined based on the degree. For example, heavy
If the area of multiple parts is 70% or more, the tracking result is reliable.
If it is 0% or less, it is determined that it is not reliable. Automatic setting reliability
The display unit 131 displays the reliability of tracking (whether or not
Or its%). For example, as shown in FIG.
Tracking result 1 by matching against original trajectory 140
When 41 is warped, tracking is reliable due to low degree of overlap
It is determined that it cannot be done. So, this time,
Reverse image matching using anchor area as model
And re-track the estimated position of the target.
At this time, in each reference frame, match in the forward direction first
Area and the current match in the reverse direction
Compare the anchor areas obtained by
If they overlap by more than the ratio, the tracking ends. At this time
Alternatively, stop tracking at specified frame
You may. The frame to end tracking is the start frame
And the distance ratio from the end frame.
No. How to determine duplication and what frame to end tracking
The specified method may be used together. As described above, according to this method, an error occurs during tracking.
Avoids subsequent deterioration of tracking results even if
And the final correction work is reduced. Also,
If you want to finish matching in the reverse direction by looking at the degree of overlap
In this case, only the necessary parts are modified, reducing processing time
Leads to. Note that even if the reliability is low,
Instead of automatically starting
You may just display the reliability. In that case, the user
The matching in the reverse direction may be executed,
Positive may be applied. In any case, automatic setting reliability display
The unit 131 allows the user to know whether the tracking is good or not.
And appropriate measures can be taken. (5) Another method of the above [3] (No. 3) 12, that is, the moving image reproducing unit 118 and the anchor
The information display unit 120 and the anchor information correction unit 119
-Provided in the automatic setting unit 112. Here, video playback
The unit 118 chronologically arranges each frame constituting the moving image,
Display at appropriate time intervals. Anchor information correction unit 119
Indicates that the anchor information correction operation can be performed at any time during video playback.
When performed, the frame of the frame displayed at that time
Correct the anchor information. At the same time,
Of the frame displayed during the immediately preceding specified number of images or the specified period
Invalidate the result of automatic anchor setting. The operation in this configuration will be described. here
Is the forward direction from the start frame according to the video playback
It is assumed that the display is performed while matching is performed. This dynamic
In the case of the work, as described above, once the matching is good
If it disappears, the anchor information will gradually come off
There is. The user can select the video and anchor information to be played.
When the anchor area has deviated from the target while watching
Click the screen. At this point, playback stops. here
For example, if the user clicks on the center of the
If the anchor position is modified so that the point is the center,
You. Subsequent matching is based on the modified anchor information.
, So it is good. According to this method, during reproduction of a moving image, the user
It is necessary to consider the delay of the action because the screen is clicked.
It is necessary. That is, the anchor region is outside the target
When you click when you recognize that
That such a phenomenon was gradually progressing over the frame
Conceivable. Therefore, the anchor information correction unit 119
Predetermined duplication that exists before the frame whose car information has been modified
Invalidate automatically set anchor information for several frames
Become FIG. 25 shows the original trajectory of the target and the map.
Relationship between tracking results obtained by
FIG. 14 is a diagram illustrating the operation of the information correction unit 119. Solid line in the figure
150 indicates that the trajectory of the target matches the tracking result
While the broken line 151 indicates that the tracking result is off the track.
Is shown. As shown in the figure, the tracking results at times t0 to t1
Is reliable, but starts to fall off at time t1. The user
And clicks the screen at time t2. This result
As a result, correct tracking is performed again between time t2 and t3.
At time t1 to t2, an incorrect tracking result remains.
Is invalidated. For the invalidated part,
Thus, the anchor information at times t1 and t2 is linearly complemented.
Can compensate for missing tracking results
You. The number of frames for which the anchor information is invalidated is set in advance.
It may be specified, or may be specified at the time of correction. Embodiment 3 Application of the moving image hypermedia device of the first and second embodiments
And the following devices or systems can be built.
You. 1. Interactive video teaching material production equipment The moving image hypermedia device according to the present invention is a CAI container.
It is also ideal for producing music. That is, the video
Set anchors on teaching materials and link necessary additional information
You. Figure 26 shows the video teaching materials produced by this device.
FIG. Explanation for anchor A as shown in FIG.
A and anchor B are linked to explanation B etc.
You. Students want more explanation while playing the video
Click the new object on the screen. Clicked
If the object is associated with anchor A, explain
A is displayed on the screen. 2. Interactive video server system A moving image hypermedia device according to the present invention is a video server.
Ideal for systems. FIG. 27 shows this video server system.
It is a block diagram of a stem. As shown in the figure, this system
Is a server 200 that shares the configuration of FIG.
It consists of a client 250. The server 200 stores the moving image and its anchor information.
Information and related data linked to the anchor
Data storage unit 204, anchor estimation unit 206, arbitrary
Hyper search for related data linked to anchor
A link search unit 208 is provided. The anchor estimation unit 206
Estimate anchor information in a non-reference frame. On the other hand, the client 250
When you click the target in the picture, any anchor area
Anchor determination unit 252 that determines whether is clicked
Prepare. In this configuration, the client 250
When the user clicks on an object on the screen,
Anchor determination unit 252 specifies the clicked anchor
I do. This information is sent to server 200. Server 20
0 hyperlink search unit 208 is the data storage unit 204
From the relevant data linked to that anchor,
This is sent to the client 250. As described above, according to this system, moving image data
And anchor information collectively on the server 200 side.
Video from many users and linked to it
You can see the information. [0101] According to the moving picture anchor setting apparatus of the present invention,
For example, set anchor information only for the reference frame
This means that non-reference frames, that is,
Since the anchor information of the frame can be estimated,
There is no need to set anchor information for the reference frame.
You. As a result, labor for setting an anchor can be saved.
Wear. [0102]Also,Including reference frame deletion meansBecause,
The number of reference frames to hold the anchor information has been reduced.
Reduce storage requirementsYou.

【図面の簡単な説明】【図１】実施の形態１に係る動画ハイパーメディア装
置を含むシステムの構成図である。【図２】実施の形態１によるアンカーの設定および修
正手順を示すフローチャートである。【図３】実施の形態１で設定されたアンカー情報のテ
ーブルを示す図である。【図４】実施の形態１によるアンカー情報の補間計算
方法を示す図である。【図５】図３に中間基準フレームのアンカー情報を追
加して得られるテーブルを示す図である。【図６】実施の形態１において中間基準フレームと両
端基準フレームの３つのフレームをもとに補間計算を行
う方法を示す図である。【図７】実施の形態１によるリンクの設定および検索
手順を示すフローチャートである。【図８】実施の形態１で設定されたリンク情報のテー
ブルを示す図である。【図９】アンカー設定のためのＵＩ画面例を示す図で
ある。【図１０】図１０（ａ）は開始フレームから終了フレ
ームまでのアンカー領域の軌跡の横方向の断面図、図１
０（ｂ）は同様に縦方向の断面図である。【図１１】実施の形態１において、アンカー情報編集
部によって時系列に表示されたアンカー情報表示画像を
示す図である。【図１２】実施の形態１に係る動画ハイパーメディア
装置のアンカー情報編集部の内部構成例を示す図であ
る。【図１３】実施の形態２に係る動画ハイパーメディア
装置のアンカー設定部の構成図である。【図１４】実施の形態２における動きベクトルの取得
手順を示すフローチャートである。【図１５】図１４においてｔ０＝０、ｔ１＝３とした
場合に得られた動きベクトルＶ（０）〜Ｖ（２）の例を
示す図である。【図１６】アンカーが辿った可能性のある経路のうち
の１つを示す図である。【図１７】図１６のｖ（ｔ）に対し、図１５のＶ
（ｔ）を追加して表示した図である。【図１８】実施の形態２においてパターンマッチング
に基づくアンカーの自動設定方法を示す図である。【図１９】実施の形態２において基準フレームを削除
していく様子を示す図である。【図２０】実施の形態２において基準フレームを削除
していく様子を示す図である。【図２１】実施の形態２において基準フレームを削除
していく様子を示す図である。【図２２】実施の形態２で、近接する基準フレームの
ほかに、ある程度時間距離の離れた基準フレームを用い
てマッチングをとる方法を示す図である。【図２３】実施の形態２に係る動画ハイパーメディア
装置のアンカー自動設定部の構成例を示す図である。【図２４】図２３の構成による効果を説明するため
に、その構成がなければ生じる可能性のある、誤った追
跡結果を示す図である。【図２５】ターゲットの本来の軌跡とマッチングによ
って得られた追跡結果の関係、およびアンカー情報補正
部の動作を示す図である。【図２６】実施の形態３の対話型ビデオ教材制作装置
の構成を示す図である。【図２７】実施の形態３の対話型ビデオサーバシステ
ムの構成を示す図である。【符号の説明】１データ操作部、２データ記憶部、３表示部、４
ユーザ操作部、６動画入力部、１０フレーム決定
部、１１アンカー設定部、１２アンカー推定部、１
３ハイパーリンク設定部、１４ハイパーリンク検索
部、１５アンカー検索部、２０動画データ記憶部、
２１アンカー情報記憶部、２２リンク情報記憶部、
３０表示制御部、３１表示装置、５０画像表示領
域、５２ボタン群、５４矩形ボタン、５６開始フレ
ーム指定ボタン、５８終了フレーム指定ボタン、６０
アンカー領域、６２アンカー関連ボックス群、６４
シーン関連ボックス群、６６，６８ボックス、８０
アンカー情報、８１開始フレーム、８２終了フレー
ム、１１０アンカー情報編集部、１１１テキストア
ンカー設定部、１１２アンカー自動設定部、１１３
動きベクトル利用設定部、１１４輪郭情報利用設定
部、１１５基準フレーム削除部、１１６パターンマッ
チング利用設定部、１１７近接フレーム抽出部、１１
８動画表示部、１１９アンカー情報補正部、１２０
アンカー情報表示部、１３０自動設定信頼度判定部、
１３１自動設定信頼度表示部、１４０ターゲットの
本来の軌跡、１４１マッチングによる追跡結果、２０
０サーバ、２０４データ記憶部、２０６アンカー
推定部、２０８ハイパーリンク検索部、２５０クライ
アント、２５２アンカー判定部、３００カーソル変
更部、４００モデルＡ、４０１モデルＢ、４０２時
刻ｔの基準フレーム、４０３モデルＣ、４０４時刻
ｔ＋Δｔの基準フレーム。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a configuration diagram of a system including a moving image hypermedia device according to Embodiment 1. FIG. 2 is a flowchart showing a procedure for setting and correcting an anchor according to the first embodiment. FIG. 3 is a diagram showing a table of anchor information set in the first embodiment. FIG. 4 is a diagram showing an interpolation calculation method of anchor information according to the first embodiment. FIG. 5 is a diagram showing a table obtained by adding anchor information of an intermediate reference frame to FIG. 3; FIG. 6 is a diagram illustrating a method of performing an interpolation calculation based on three frames of an intermediate reference frame and both-end reference frames in the first embodiment. FIG. 7 is a flowchart showing a link setting and search procedure according to the first embodiment; FIG. 8 is a diagram showing a table of link information set in the first embodiment. FIG. 9 is a diagram illustrating an example of a UI screen for anchor setting. FIG. 10A is a lateral cross-sectional view of the trajectory of the anchor area from the start frame to the end frame, and FIG.
0 (b) is a vertical sectional view similarly. FIG. 11 is a diagram showing an anchor information display image displayed in chronological order by the anchor information editing unit in the first embodiment. FIG. 12 is a diagram showing an example of an internal configuration of an anchor information editing unit of the moving image hypermedia device according to the first embodiment. FIG. 13 is a configuration diagram of an anchor setting unit of the moving image hypermedia device according to the second embodiment. FIG. 14 is a flowchart showing a procedure for acquiring a motion vector according to the second embodiment. FIG. 15 is a diagram illustrating an example of motion vectors V (0) to V (2) obtained when t0 = 0 and t1 = 3 in FIG. 14; FIG. 16 is a diagram illustrating one of the paths that may be followed by the anchor. FIG. 17 shows V (t) of FIG. 16 and V (t) of FIG.
It is the figure which added and displayed (t). FIG. 18 is a diagram showing a method for automatically setting an anchor based on pattern matching in the second embodiment. FIG. 19 is a diagram illustrating a state where reference frames are deleted in the second embodiment. FIG. 20 is a diagram showing a state where reference frames are deleted in the second embodiment. FIG. 21 is a diagram illustrating a state where reference frames are deleted in the second embodiment. FIG. 22 is a diagram illustrating a method of performing matching using a reference frame that is separated by a certain time distance in addition to an adjacent reference frame in the second embodiment. FIG. 23 is a diagram illustrating a configuration example of an automatic anchor setting unit of the moving image hypermedia device according to the second embodiment. FIG. 24 is a diagram illustrating an erroneous tracking result that may occur without the configuration in order to explain the effect of the configuration of FIG. 23; FIG. 25 is a diagram showing a relationship between an original trajectory of a target and a tracking result obtained by matching, and an operation of an anchor information correction unit. FIG. 26 is a diagram illustrating a configuration of an interactive video teaching material production device according to a third embodiment; FIG. 27 is a diagram illustrating a configuration of an interactive video server system according to a third embodiment. [Description of Signs] 1 Data operation unit, 2 Data storage unit, 3 Display unit, 4
User operation unit, 6 moving image input unit, 10 frame determination unit, 11 anchor setting unit, 12 anchor estimation unit, 1
3 hyperlink setting section, 14 hyperlink search section, 15 anchor search section, 20 video data storage section,
21 anchor information storage unit, 22 link information storage unit,
Reference Signs List 30 display control unit, 31 display device, 50 image display area, 52 button group, 54 rectangular button, 56 start frame designation button, 58 end frame designation button, 60
Anchor area, 62 Anchor related box group, 64
Scene-related boxes, 66, 68 boxes, 80
Anchor information, 81 start frame, 82 end frame, 110 anchor information editing unit, 111 text anchor setting unit, 112 automatic anchor setting unit, 113
Motion vector use setting unit, 114 contour information use setting unit, 115 reference frame deletion unit, 116 pattern matching use setting unit, 117 adjacent frame extraction unit, 11
8 moving image display section, 119 anchor information correction section, 120
Anchor information display unit, 130 automatic setting reliability determination unit,
131 Automatic setting reliability display section, 140 Original trajectory of target, 141 Tracking result by matching, 20
0 server, 204 data storage unit, 206 anchor estimation unit, 208 hyperlink search unit, 250 client, 252 anchor determination unit, 300 cursor change unit, 400 model A, 401 model B, 402 reference frame at time t, 403 model C , 404 Reference frame at time t + Δt.

フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩＧ０６Ｔ 13/00 Ｇ０６Ｔ 13/00 Ｂ (72)発明者脇本浩司東京都千代田区丸の内二丁目２番３号三菱電機株式会社内 (72)発明者田中聡東京都千代田区丸の内二丁目２番３号三菱電機株式会社内 (56)参考文献特開平３−52070（ＪＰ，Ａ) 特開平３−292571（ＪＰ，Ａ) 高野，的場，原「ハイパーメディアのためのビデオデータモデルの一考察」情報処理学会第46回（平成５年前期）全国大会講演論文集（４）ｐ．221−222 （平５−３−23) 高野，的場，原「ビデオデータ中に現れる物体をノードとするハイパーメディア構成方式」第７回ヒューマン・インタフェース・シンポジウム論文集，ｐ. 301−306，1991（平３−10−23) 高野，的場，原「ビデオ・ハイパーメディアのナビゲーション方式」第８回ヒューマン・インタフェース・シンポジウム論文集，ｐ．607−612，1992（平４− 10−21) 平田，川崎，高野，原「ネットワーク環境化における動画ハイパーメディア実装方式」情報処理学会シンポジウム論文集，Ｖｏｌ．94，Ｎｏ．13，ｐ．165− 173，1994（平６−12−７) 田中一生，田中譲「ハイパームービーのアーキテクチャとその応用」情報処理学会研究報告Ｖｏｌ．96，Ｎｏ. 119（96−ＩＭ−28），ｐ．73−77（平８−11−29) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06F 17/30 G06T 13/00 Continuation of the front page (51) Int.Cl. ⁷ Identification symbol FI G06T 13/00 G06T 13/00 B (72) Inventor Koji Wakimoto 2-3-2 Marunouchi, Chiyoda-ku, Tokyo Mitsubishi Electric Corporation (72) Inventor Satoshi Tanaka 2-3-2 Marunouchi, Chiyoda-ku, Tokyo Inside Mitsubishi Electric Corporation (56) References JP-A-3-52070 (JP, A) JP-A-3-292571 (JP, A) Takano, Hajime, “Consideration of Video Data Model for Hypermedia,” Proc. Of the 46th Annual Meeting of the Information Processing Society of Japan (early 1993) (4) p. 221-222 (Heisei 5-3-23) Takano, Matoba, Hara "Hypermedia Construction Method with Nodes Appearing in Video Data as Nodes," Proceedings of the 7th Human Interface Symposium, p. 301 −306, 1991 (Heisei 3-10−23) Takano, Matoba, Hara “Navigation Method for Video Hypermedia”, 8th Human Interface Symposium, p. 607-612, 1992 (Hira 4-10-21) Hirata, Kawasaki, Takano, Hara "Moving Video Hypermedia Implementation Method in Network Environment" Proceedings of IPSJ Symposium, Vol. 94, no. 13, p. 165-173, 1994 (Heisei 6-12-7) Kazuo Tanaka, Joe Tanaka "HyperMovie Architecture and Its Applications" Information Processing Society of Japan Vol. 96, No. 119 (96-IM-28), p. 73-77 (Heisei 8-11-29) (58) Fields investigated (Int. Cl. ⁷ , DB name) G06F 17/30 G06T 13/00

Claims

(57) [Claims 1] for a plurality of frames constituting the moving picture
Select a reference frame at predetermined intervals, and the anchor information setting means for setting the anchor information for each of their reference frames, anchor information calculating means for calculating the anchor information of the non-reference frame based on the anchor information set In the moving image anchor setting device, it is determined whether or not the anchor information set in one or a plurality of reference frames can be calculated within a predetermined error range based on the anchor information set in another reference frame. Determining means for determining; and reference frame deleting means for changing the reference frames to non-reference frames when it is determined that calculation is possible;
A moving image anchor setting device.