Object-Oriented Programming OCR a Level

Researchers Present Latest Work in Programming, Systems

Georgia Tech researchers recently presented their work at leading programming and systems conferences, focusing on static ...

IEEE

Development of OCR Service for Page-Level Recognition for Camera-Captured Document Images

Abstract: The emergence of Large Language Models (LLMs) has driven significant advancements in Natural Language Processing (NLP) and introduced new text-related applications, such as Visual Question ...

IEEE

Structure-Guided Image Completion With Image-Level and Object-Level Semantic Discriminators

Abstract: Structure-guided image completion aims to inpaint a local region of an image according to an input guidance map from users. While such a task enables many practical applications for ...

GitHub

UniPixel: Unified Object Referring and Segmentation

UniPixel is a unified MLLM for pixel-level vision-language understanding. It flexibly supports a variety of fine-grained tasks, including image/video segmentation, regional understanding, and a novel ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results