Document Analysis

This technology includes PDF, table analysis, and NLP processing. In PDF analysis, object detection technology recognizes text, images, and tables in documents. In table analysis, the structure of image-type tables is parsed and converted into structured data. Finally, in the NLP processing stage, the previously obtained text and structured data can be used to generate text from data through model training, achieving the task of a data-to-text generation.