NVBench is a large dataset for complex and cross-domain NL2VIS tasks, which covers 105 domains, supports seven common types of visualizations, and contains 25,750 (NL, VIS) pairs. This repository contains the corpus of NL2VIS, with JSON format and Vega-Lite format.
Publications:
Yuyu Luo, Nan Tang, Guoliang Li, et al. Synthesizing Natural Language to Visualization (NL2VIS) Benchmarks from NL2SQL Benchmarks. SIGMOD 2021 Conference.