Keynote & Special Talks

基調講演 - Keynote (Jul 29)

映像意味検索の未来

The Future of Video Semantic Indexing

篠田浩一（東京工業大学）

Koichi Shinoda (Tokyo Institute of Technology)

概要

現在，インターネット上には大量の映像が溢れています．それらは多様であり，かつ，必ずしも品質が高くありません．現在，そのような映像の集まりから，我々にとって意味のある情報を抽出する映像意味検索技術が急速に進歩しています．本講演では，映像意味検索の国際競争型ワークショップTREVIDにおける現在主流の技術を紹介した上で，今後のトレンドを大胆に予測します．多様性や低品質に対応するための機械学習に基づくアプローチ，実用において必須な高速化技術，画像・音声・テキストなどの様々な情報を統合するマルチモーダル認識技術に特に焦点を当て，それらにおける課題と予想される今後の発展について議論します．

Abstract

Now the internet teems with consumer videos which are highly diverse and often with low quality. Video semantic indexing, which extracts meaningful information from those Internet videos, has been extensively studied. In this paper, we introduce the research activities in TRECVID workshop, an international workshop, where many participants compete with each other on video semantic indexing techniques and forecast their prospect. We focus on machine learning approach robust against the variety and the low quality, fast computational techniques required for real applications, and multi-modal recognition techniques which combine various modes such as image, audio, and text. We discuss their problem, and predict their future development.

略歴

篠田浩一

1989年東京大学修士卒．2001年に東京工業大学より博士号(工学)取得．1989年日本電気(株)入社，1997年から1998年までルーセント・ベル研究所客員研究員．現在，東京工業大学教授．専門は音声認識，映像検索，統計的パターン認識，ヒューマンインタフェース．1998年電子情報通信学会論文賞受賞．現在，Elsevier Speech Communication誌およびComputer Speech and Language誌編集委員，情報処理学会音声言語情報処理研究会主査．

Biography

Koichi Shinoda received M.S. degrees from the University of Tokyo in 1989, and the D. Eng. Degree from Tokyo Institute of Technology in 2001. In 1989, he joined NEC Corporation. From 1997 to 1998, he was a Visiting Scholar with Bell Labs, Lucent Technologies. He is currently a Professor with the Tokyo Institute of Technology. His research interests include speech recognition, video information retrieval, statistical pattern recognition, and human interfaces. He received the Excellent Paper Award from the IEICE in 1998. He is currently an associate editor of Computer Speech and Language and Speech Communication, Elsevier, and the chief of SIG-SLP of IPSJ.

特別講演 - Special Talk (Jul 30)

ウェアラブルコンピューティングの実世界応用

Real-world Applications for Wearable Computing

寺田努（神戸大学）

Tsutomu Terada (Kobe University)

概要

コンピュータを常時装着して利用するウェアラブルコンピューティングという研究分野は，入出力機器などのハードウェア，センサ情報処理や状況認識などのデータ処理，情報提示，ミドルウェアなどの融合領域である．本講演では，我々のこれまでの取り組みを紹介しながら，ウェアラブルシステム(特に行動認識技術)を現場で実際に活用するにあたって生じる問題点とその解決について述べる．研究で作ったシステムを実際のイベントや業務の現場で利用することの楽しさを伝えられればと考えている．

Abstract

Wearable Computing consists of various research area including context recognition, user interfaces, middlewares, information visualization, and sensing. In this talk, I introduce our works that were actually used in the real-world, and clarify what kind of techniques are required in the real-world focusing on the wearable sensing and context recognition technologies.

略歴

寺田努

1997年大阪大学工学部情報システム工学科卒業．1999年同大学院工学研究科博士前期課程修了．2000年同大学院工学研究科博士後期課程退学．同年より大阪大学サイバーメディアセンター助手．2005年より同講師．2007年神戸大学大学院工学研究科准教授．現在に至る．2004年より特定非営利活動法人ウェアラブルコンピュータ研究開発機構理事，2004年には英国ランカスター大学客員研究員，2006年IPA未踏クリエータ，2010年よりJSTさきがけ研究員を兼務．博士（工学）．ウェアラブルコンピューティング，ユビキタスコンピューティング，エンタテインメントコンピューティングの研究に従事．

Biography

Tsutomu Terada is an Associate Professor at Graduate School of Engineering, Kobe University. He received B.Eng., M.Eng. and Ph.D. degrees from Osaka University in 1997, 1999, and 2003, respectively. Dr. Terada is currently working on wearable computing, ubiquitous computing, entertainment computing technologies. He is a member of IEEE.

特別講演 - Special Talk (Jul 31)

大規模テキスト処理を支える形態素解析技術

Advances in Japanese morphological analysis for large-scale text corpora

工藤拓（グーグル（株））

Taku Kudo (Google Inc.)

概要

ソーシャルメディアへ投稿される大規模テキストから知識獲得をしたり，それらをマーケティングに活用しようとする試みが増えている．単語境界が自明ではない日本語を処理するにあたり，単語に分割するという処理は極めて重要であり最も基本的な処理として認識されている．本講演では，日本語単語分割・形態素解析の代表的な手法とその中で使われているアルゴリズム・データ構造等を解説する．さらに，形態素解析における問題点と我々の取り組みについて紹介する．

Abstract

Text mining and social media marketing from large scale text corpora have evolved significantly over the past few years. Word segmentation (morphological analysis) is known to be a core technology that allows for Japanese textual information to be converted into machine-readable structured data. This talk presents basic algorithms, models and data structures used in Japanese morphological analyzers. We also show the current status of Japanese morphological analysis and what sort of issues we are coping with.

略歴

工藤拓

1976年生グーグル株式会社ソフトウェアエンジニア．京都大学工学部電気電子工学科卒業，奈良先端科学技術大学院大学情報科学研究科博士後期課程修了，NTTコミュニケーション科学基礎研究所リサーチアソシエイトを経て現職．専門は統計的自然言語処理，機械学習，データマイニング．オープンソース形態素解析システムMeCabの開発者．