The 10th National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG-2025)
16th-18th July 2025
National Institute of Technology - Srinagar

DeHaDo-AI

Challenge on Deciphering Handwritten Documents using Robust AI Models

🏆 WINNERS - TOP 5 TEAMS (SCOREBOARD) 🏆

🥇 Rank 1:

Team Name: Pre-Par-e

Affiliation: Vellore Institute of Technology, Chennai

Score: 77.26

🥈 Rank 2:

Team Name: Tokenwise

Affiliation: IGCAR, HBNI, Chennai

Score: 53.11

🥉 Rank 3:

Team Name: Ezhuthani

Affiliation: Velammal College of Engineering and Technology, Madurai; Thiagarajar College of Engineering, Madurai

Score: 50.08

🏅 Rank 4:

Team Name: Inkognito

Affiliation: Madras Institute of Technology, Chennai

Score: 42.45

🏅 Rank 5:

Team Name: ElecrtoNexus

Affiliation: Mar Baselios College of Engineering and Technology, Thiruvananthapuram, Kerala; APJ Abdul Kalam Technological University, Thiruvananthapuram, Kerala

Score: 34.74

Gallery

Introduction

In India, a vast number of official documents, especially application forms, are still filled out by hand. Manually processing these forms are time-consuming, error-prone, and inefficient, highlighting the need for an intelligent system that can accurately extract, interpret, and validate handwritten data. These handwritten forms—often written in English but with diverse handwriting styles and ink types—pose a significant challenge. The aim of DeHaDo-AI challenge is to develop robust AI models capable of accurately recognizing handwritten English text from scanned application forms filled by Indian citizens. The challenge will focus on handling diverse handwriting styles, varying image quality, and missing or incomplete fields. Through this initiative, researchers, AI practitioners, and developers will have the opportunity to create state-of-the-art models that enhance handwritten character recognition accuracy, field validation, and automation. These models will ultimately reduce human effort and improve data integrity in real-world applications.

Click Here for Challenge Registration

Click Here for the Awards

Significance of the Challenge

The DeHaDo-AI Challenge supports the development of AI-driven solutions for handwritten application forms in real time settings. The main significance of this challenge is listed below:

❖ Boosting Documentation in Public Sector Workflow – Large organizations handle millions of handwritten documents annually, including recruitment applications, legal agreements, financial applications, and HR records. Automating this process through AI-driven solutions can significantly reduce processing time, enhance accuracy, and improve workflow efficiency across global enterprises.

❖ Enhancing ICR Accuracy – Unlike printed text recognition, handwritten text poses challenges due to variations in handwriting styles, ink quality, and document conditions. This challenge fosters innovation in ICR (Intelligent Character Recognition) and NLP (Natural Language Processing) models to improve accuracy, making document processing more efficient for enterprises and large-scale operations.

❖ Ensuring Data Completeness & Validation – Many real-world applications require form completeness verification to detect missing or incorrect entries. DeHaDo-AI focuses on automated field validation, ensuring compliance with corporate and regulatory standards to reduce errors in MNC operations, financial records, and customer data processing.

By addressing these challenges, the DeHaDo-AI Challenge encourages researchers, AI practitioners, and industry experts to develop cutting-edge solutions that will shape the future of handwritten document processing in both corporate and public sectors.

Participation Rules

Teams should consist of a maximum of four members each, with the designation of a team lead for effective communication purposes:

❖ The dataset will exclusively be accessible to teams that have completed the registration process.

❖ Participants are required to submit their algorithm's source code, authored in Python, and adequately commented. Additionally, teams must provide a comprehensive summary of their approach and algorithms in a written document. Moreover, participants must disclose the inference time of their code, utilized as an evaluation metric, and furnish details regarding the system specifications on which their code was developed.

❖ Fair practice is essential. Violation could lead to the disqualification of entire team

Evaluation Criteria

Sample input image

[

{

"text": "Guardian Fire Security Service",

"bbox": [274, 67, 503, 86]

},

{

"text": "M. Deepika",

"bbox": [274, 94, 352, 113]

},

{

"text": "Murugan",

"bbox": [274, 119, 344, 138]

}

]

Sample predicted output json

Evaluation Metrics

Dataset details

The DeHaDo-AI Challenge dataset consists of scanned handwritten application forms with varying handwriting styles, image qualities, and structural layouts. The dataset is designed to test the robustness of AI models in text recognition, field validation, and completeness verification.

Click Here to Download NDA Agreement

Model Development Guidelines

Participants must not use any paid APIs for handwritten text recognition.
Participants are required to train their own models using deep learning, large language models (LLMs), or similar approaches.
The model should be inferred using Python. You may use any of the following libraries for training the model:

PyTorch

TensorFlow

Keras

Hugging Face Transformers

JAX

FastAI

ONNX (for deployment)

The results must be reproducible.

Evaluation Process

Submission Requirements: Participants must provide the trained model and the testing script to allow for reproduction of results.
Code and Model Ownership: The submitted code and model will be owned by the organizers.
Evaluation Results: Validation and test phase results will be shared with participants via email; there will be no public leaderboard or online result checking.
Transparency: These guidelines help ensure a fair and consistent evaluation process for all participants.

Submission Details

Each participating team must submit their solutions in the following format. Each team is required to email their solution in a zipped file to ncvpripg2025dehadoai@gmail.com with the following specifications:

The final submission should include:

The Python inference script
The trained model file(s)
Any required utility/helper scripts
A clear README with instructions for running the code
Naming format: TeamName_DeHaDo-AI_Challenge.zip
Submission through the designated portal (to be announced).

Recognition Output Format – The text extracted from handwritten forms should be converted into a structured JSON format, including both the recognized text and corresponding coordinates. Each image should have an individual JSON output file.
Model Architecture & Code – AI model implementation, including training scripts and inference pipeline. Teams retain ownership of their code, and evaluation results will only be used with explicit consent.
Technical Report – A document outlining the approach, methodology, and performance analysis.
Evaluation Metrics Report – Summary of text recognition accuracy, field validation performance, and computational efficiency.
Executable Demo (Optional) – A working prototype or API demonstrating real-time handwritten form processing

*Note on the model/Folder

We have provided a model/ folder containing sample trained model files to help participants test their inference pipeline. You can use any one of the provided models for initial testing—this is sufficient to verify your code structure and functionality.

During final submission, you must replace the sample model with your own trained model, ensuring that the code can reproduce your submitted results using this model.

Sample Folder Structure for Submission:

submission/

│

├── model/

│ └── handwritten_model.pth # Sample trained model (PyTorch format)

│ └── handwritten_model.h5 # OR TensorFlow/Keras format (if applicable)

│ └── handwritten_model.onnx # Optional: Exported ONNX model

│

├── src/

│ ├── inference.py # Main Python script for running inference

│ ├── utils.py # Helper functions (e.g., image preprocessing)

│ └── model_architecture.py # Model architecture definition

│

├── data/

│ └── sample_input/ # Optional: sample test data

│

├── README.md # Instructions to reproduce the results

└── requirements.txt # Python dependencies

Participants may use the provided fact sheet template to submit their result during the final phase of the competition.

Click here to download Factsheet

Awards

The top five teams will be invited for a presentation of their solution in a dedicated session at NCVPRIPG 2025 and will also have the opportunity to contribute to a paper summarizing the challenge outcomes which will be submitted to the NCVPRIPG 2025 proceedings.

🏆 Winner: ₹20,000

🥇 First Runner-up: ₹15,000

🥈 Second Runner-up: ₹10,000

📃 Collaboration on writing summary paper

The 10th National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG-2025)
16th-18th July 2025
National Institute of Technology - Srinagar

DeHaDo-AI

Challenge on Deciphering Handwritten Documents using Robust AI Models

Gallery

Introduction

Significance of the Challenge

Participation Rules

Evaluation Criteria

Evaluation Metrics

Dataset details

Model Development Guidelines

Evaluation Process

Submission Details

Awards

Important Dates

April 24, 2025

Web Posting of Challenge

May 10, 2025

Dataset Release & Opening of Challenge Submissions

June 12, 2025

Test Dataset Release

June 22, 2025

Closing Date for Submission to Challenge

July 02, 2025

Announcement of Challenge Winners

July 08, 2025

Registration Deadline for Challenge Attendees

Keynote Talk

Dr Deepak Mishra, Professor
IIST
Thiruvananthapuram

Sponsors

Organizers

Dr Sasithradevi A

Dr Prakash P

Dr S Md Mansoor Roomi

Mr D Sabarinathan

Volunteers

Vijayalakshmi M

Ms. Kasthuri P

Akilesh S

Tulasi Raman R

The 10th National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG-2025)16th-18th July 2025 National Institute of Technology - Srinagar

DeHaDo-AI

Challenge on Deciphering Handwritten Documents using Robust AI Models

Gallery

Introduction

Significance of the Challenge

Participation Rules

Evaluation Criteria

Evaluation Metrics

Dataset details

Model Development Guidelines

Evaluation Process

Submission Details

Awards

Important Dates

April 24, 2025

Web Posting of Challenge

May 10, 2025

Dataset Release & Opening of Challenge Submissions

June 12, 2025

Test Dataset Release

June 22, 2025

Closing Date for Submission to Challenge

July 02, 2025

Announcement of Challenge Winners

July 08, 2025

Registration Deadline for Challenge Attendees

Keynote Talk

Dr Deepak Mishra, ProfessorIIST Thiruvananthapuram

Sponsors

Organizers

Dr Sasithradevi A

Dr Prakash P

Dr S Md Mansoor Roomi

Mr D Sabarinathan

Volunteers

Vijayalakshmi M

Ms. Kasthuri P

Akilesh S

Tulasi Raman R

The 10th National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG-2025)
16th-18th July 2025
National Institute of Technology - Srinagar

Dr Deepak Mishra, Professor
IIST
Thiruvananthapuram