Home

常識(多數人共享、一般非專業的知識)是人們間溝通、解決難題的基本要素。不幸的是, 雖

然現代電腦的運算能力與儲存容量均急遽成長,電腦的「沒常識」卻是一個眾所週知的缺陷。欲將數百萬筆人類知識轉換成機器可處理的格式的確是一件費時且昂貴的工作。經過二十五年的努力,OpenCyc 2.0甫於2009年七月正式推出,其知識庫含47,000個「概念」,以及306,000筆知識工程師悉心編撰的「事實」。

相對的,MIT媒體實驗室的「開放常識」計畫於十年內順利的從一萬五千名使用者貢獻了超過百萬筆英文句子。目前,兩個知識庫的內容均以英文為主,而且還極不完整。本研究計畫挑戰多語言常識知識庫的資料蒐集、驗證、與推理技術的開發,以期改善常識資料的涵蓋度、正確性、以及有效推理的能力。尤其是,本研究將旨在結合機器學習技術與具生產力社群遊戲來建構一個中文的嘗試知識庫。前者自動從非結構式與半結構式線上文件擷取出結構式知識;而後者則累積線上社群遊戲玩家的常識。所產出的知識庫可能含有錯誤或矛盾的語句。

Common sense (beliefs or propositions that most people consider prudent and of sound judgment, without reliance on esoteric knowledge or study or research, but based upon what they see as knowledge held by people "in common" – by Merriam-Webster Online) is the fundamental framework of communication and problem solving for human beings. A person who lack of common sense may be considered as dull, even ridiculous. Unfortunately, this is just how the computers nowadays looked like – when you try to interact with them in the “human way”.

Our research interests include knowledge collection, verification, and reasoning in multi-language common sense knowledge bases; and currently focus on developing technologies which enhance the coverage, correctness and reasoning ability of common sense knowledge bases.