Tenth International Workshop on SCIentific DOCument Analysis (SCIDOCA 2026)
June 8-9, 2026 G MESSE GUNMA (GUNMA Convention Center), Gunma, Japan and Online
associated with JSAI International Symposia on AI
June 8-9, 2026 G MESSE GUNMA (GUNMA Convention Center), Gunma, Japan and Online
associated with JSAI International Symposia on AI
Recent proliferation of scientific papers and technical documents has become an obstacle to efficient information acquisition of new information in various fields. It is almost impossible for individual researchers to check and read all related documents. Even retrieving relevant documents is becoming harder and harder. This workshop gathers all the researchers and experts who are aiming at scientific document analysis from various perspectives, and invite technical paper presentations and system demonstrations that cover any aspects of scientific document analysis.
This workshop is associated with JSAI International Symposia on AI, an event that hosts multiple international workshops together and is held in conjunction with JSAI annual conference.
Relevant topics include, but are not limited to, the following:
text analysis
document structure analysis
logical structure analysis
figure and table analysis
citation analysis of scientific and technical documents
scientific information assimilation
summarization and visualization
knowledge discovery/mining from scientific papers and data
similar document retrieval
entity and relation linking between documents and knowledge base
survey generation
resources for scientific documents analysis
document understanding in general
NLP systems aiming for scientific documents including tagging, parsing, coreference, etc.
Submission deadline: February 14, 2026 March 7, 2026 (extended)
Notification: March 18, 2026
Camera-ready: March 29, 2026
Workshop days: June 8-9, 2026
Registration link https://isai2026-ai-gakkai.peatix.com/
More information regarding registration can be found here https://www.ai-gakkai.or.jp/isai/registration-2026.
14:25 Workshop opening
14:30 Invited talk
Prof. Fei Cheng Kyoto University, Japan
"Building Japanese Medical LLMs: Domain Pretraining, Structured Summarization, and Knowledge-enhanced RAG"
coffee break
15:45 Journal-Level Citation Impact of Articles with Dataset Links in Abstracts Identified Using a Generative AI Ensemble
Hiroyuki Tsunoda, Yuan Sun, Masaki Nishizawa, Xiaomin Liu and Kou Amano
16:15 An Approach for Improving Entity-Aware Machine Translation via Reinforcement Learning
An Trieu, Vu Tran and Le-Minh Nguyen
16:45 Advanced Legal Case Retrieval: Evaluating Generative LLM-Based Feature Extraction and Hybrid Reranking
Merouane Taleb, Amine-Samy Hedroug, Vu Tran and Minh Le Nguyen
17:15 Day 1 closing
10:00 Invited talk
Dr. Van-Khanh Tran GenAI Center, FPT Smart Cloud, and Thai Nguyen University of Information and Communication Technology, Vietnam
"From Personalized Tutors to Pedagogical Classrooms: Scaffolded Multi-Agent LLMs for Collaborative Learning"
11:00 lunch break
14:00 VietPS-Hallu: A Vietnamese Dataset for Hallucination Detection in Large Language Models within the Public Services
Dinh Bao Bui, Tien Nhat Nguyen, Tung Le and Huy Tien Nguyen
14:30 CiteData: A Large-Scale Dataset for Citation Discovery, Prediction, and Placement
An Dao, An Trieu, Vu Tran, Le Minh Nguyen, Akiko Aizawa and Yuji Matsumoto
15:00 Toward Efficient Entity-Focus RAG for Biomedical Question Answering
Vu Tran, Trung Vo and Le-Minh Nguyen
15:30 HoAstBench:A Method for evaluating LLMsin Smart Homes
Dong Peizhe and Vu Tran
coffee break
16:15 Organization Session
17:15 Workshop closing
There are two classes of submissions:
Long paper on original and completed work, including concrete evaluation and analysis wherever appropriate; and
Short paper on a small, focused contribution, work in progress, a negative result, or an opinion piece.
The page limits are up to 14 pages including references for the longer papers, and up to 7 pages including references for the short papers. (Reviewers will be told that there is no penalty for writing a shorter submission.)
All submissions should be written in English, formatted according to the Springer Verlag LNCS style in a pdf form, which can be obtained from https://www.springer.com/gp/computer-science/lncs/conference-proceedings-guidelines. The paper should be anonymized. If you use a word file, please follow the instruction of the format, and then convert it into a pdf form and submit it at the paper submission page.
For both classes, in addition to the original unpublished work, we also accept the papers that have already been published or presented in other venues. This submission should also be anonymized, and will be reviewed by the program committee.
You can submit your paper at https://easychair.org/conferences/?conf=scidoca2026 . If you cannot submit a paper by EasyChair System by some trouble, please send email to "nguyenml[at]jaist.ac.jp"
If a paper is accepted, at least one author of the paper must register the workshop and present it. Please register the workshop at registration page.
Le-Minh Nguyen, Japan Advanced Institute of Science and Technology
Yuji Matsumoto, RIKEN Center for Advanced Intelligence Project (Advisor)
Vu Tran, Japan Advanced Institute of Science and Technology (Co-Chair)
Le-Minh Nguyen, Japan Advanced Institute of Science and Technology
Yuji Matsumoto, RIKEN Center for Advanced Intelligence Project
Vu Tran, Japan Advanced Institute of Science and Technology
Noriki Nishida, RIKEN Center for Advanced Intelligence Project
Yusuke Miyao, The University of Tokyo
Yoshinobu Kano, Shizuoka University
Akiko Aizawa, National Institute of Informatics
Ken Satoh, Center for Juris-Informatics, ROIS
Junichiro Mori, The University of Tokyo
Kentaro Inui, Tohoku University
Nguyen Ha Thanh, National Institute of Informatics
Nguyen Minh Phuong, Japan Advanced Institute of Science and Technology
An Dao, RIKEN Center for Advanced Intelligence Project
May Myo Zin, Center for Juris-informatics
Danilo Carvalho, University of Manchester
Hai-Long Trieu, University of Cambridge