AVDN

Challenge

Overview:

Based on the Aerial Vision-and-Dialog Navigation (AVDN) dataset, we're organizing a challenge for the Aerial Navigation from Dialog History (ANDH) task. This event will be part of the 5th workshop on Closing the Loop Between Vision and Language (CLVL) workshop at ICCV 2023. To participate, visit the challenge page on Eval.ai: https://eval.ai/web/challenges/challenge-page/2049/overview.

Dates:

Challenge launch: May 18, 2023
EvalAI submission deadline: Aug 15, 2023, 4:59:59 PM PST (submission to leaderboard is reopened for further development after the challenge)
Report deadline: Aug 22, 2023, 4:59:59 PM PST

To submit the report, please send the following to yfan71@ucsc.edu:

Team name;
Team members (no maximum limit);
The username used in CodaLab submissions;
A link to your 2-4 page report on arXiv (please upload yourself), which describes your systems (including data processing, methods, experimental results, etc.) using the CLVL workshop paper template. The report can be considered as a submission to the non-archival track of the CLVL workshop. Code release is also encouraged to facilitate future research.

Winner (announcement date: Aug 28, 2023):

Congratulations to the winning team!

First place winner:

Team susanping (Yifei Su, Dong An, Yuan Xu, Kehan Chen, Yan Huang). Report: Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation.

Guideline:

The AVDN dataset is a dataset for aerial embodied AI, which includes human-human dialogs, drone navigation trajectories, and drone's visual observation (simulated using the xView dataset) with human attention. The dialog involves the user (commander) that provides instructions, and the aerial agent (follower) that followes the instruction and askes questions when needed. Based on the AVDN dataset, we introduce the Aerial Navigation from Dialog History (ANDH) task. The goal of the task is to let the agent predict aerial navigation actions that lead to goal areas, following the instructions in the dialog history.

How to participate:

Follow the instructions on AVDN project Github to download the xView dataset, AVDN dataset and baseline code.
Register an account on Eval.ai.
Create a new team or select an existing one to participate in the challenge on the participate page of AVDN challenge.
After participation, a submission page will show up with detailed submission instructions.

Challenge phases:

Dev Phase: Use the valid unseen split. This phase is for checking your submission format. The submission to the phase will be kept private (i.e., invisible on the leaderboard).
Test Phase: Use the test split. This phase is for your official submission. The number of submissions is limited to once per day and 10 times per month. The submission will be made public (i.e., visible on the leaderboard).

FAQ:

Is there a prize for the winner?
Yes. The winning teams will be invited to present their methods at the ICCV 2023 CLVL workshop. In addition, an Method Innovation Award may be awarded to the most innovative method team based on the evaluation of the technical reports.
Do you need to submit a report?
To be eligible for result archives and consideration for awards, you will need to submit a 2-4 page report which describes your systems (including data processing, methods, experimental results, etc.) using the ICCV 2023 paper template. Code release is also encouraged to facilitate future research.

Orgnizers:

Yue Fan

Xin Eric Wang

Have any questions or suggestions? Feel free to reach out at yfan71@ucsc.edu.

Please add [AVDN Challenge] to the email title.

Please cite our paper as below if you use our work.

@article{fan2022aerial,

title={Aerial vision-and-dialog navigation},

author={Fan, Yue and Chen, Winson and Jiang, Tongzhou and Zhou, Chun and Zhang, Yi and Wang, Xin Eric},

journal={arXiv preprint arXiv:2205.12219},

year={2022}

}

Page updated

Google Sites

Report abuse