Enhanced Document Intelligence: New Prebuilts & Image Support
Image Source: Shutterstock.com
All about AI
Mar 15, 2024 12:06 PM

Enhanced Document Intelligence: New Prebuilts & Image Support

by HubSite 365 about Microsoft

Software Development Redmond, Washington

Pro UserAll about AIM365 Release

Unlock new Azure AI Document Intelligence features: Image & figure support, US Tax 1040 models & more!

Key insights

 

  • Azure AI Document Intelligence enhances features including image and figure extraction, and introduces new prebuilt models for common tax and mortgage forms, along with improved custom models featuring confidence scores and support for overlapping fields.
  • The service simplifies extracting insights from documents with its Layout API and supports the processing of tax forms and custom documents efficiently, vital for the fast-paced digital world.
  • Gone are the days of tedious manual data entry, as Document Intelligence automates document processing, boosting productivity and revealing hidden insights.
  • New updates feature figure and image detection, hierarchical document structure support, enhancements in prebuilt models including the 1040 tax form, and new additions like models for marriage certificates and credit cards.
  • The release adds container support for Read and Layout models for local or fully disconnected operations and custom models now produce confidence scores vital for the human in the loop (HITL) pattern.
 

A Deeper Dive into Azure AI Document Intelligence

Azure AI Document Intelligence, significantly evolving from its previous moniker as Form Recognizer, is geared towards modernizing how businesses handle documents. With the constant influx of documents in various formats, this AI service is becoming an indispensable tool in extracting valuable information without the need for manual intervention. This latest update is particularly noteworthy for its forward strides in artificial intelligence by embracing image and figure extraction and refining its custom model capabilities. These features are not just enhancements but are transformative in the field of document processing.

The introduction of new prebuilt models and the improvement to the custom models with confidence scoring are particularly compelling. These features speak directly to the needs of businesses dealing with tax, mortgage, and various other document-heavy operations. Additionally, the advancements in dealing with hierarchical document structures and the added capacity for more nuanced data extraction like image detection indicate a significant leap towards more intelligent document processing automation.

Moreover, the update emphasizes ease of integration and flexibility through container support, aligning with the growing trend towards Edge computing and data sovereignty. With these enhancements, Azure AI Document Intelligence is not just a tool but a cornerstone technology that propels businesses into a new era of efficiency and insight-driven operations.

 

 

Read the full article Document Intelligence preview adds more prebuilts, support for image and figures, and more!

 -

 

Document Intelligence preview adds more prebuilts, support for image and figures, and more! Azure AI Document Intelligence, formerly known as Form Recognizer, is an AI service for all your document understanding needs. The latest update previews new features including image and figure extraction, new prebuilt models for US Tax 1040 form and other common tax and mortgage forms.

Custom models are also updated with the addition of confidence scores for tables, rows, and cells, support for overlapping fields, and updates to the classification model to support incremental training and Office file types. In today's fast-paced digital world, businesses are drowning in a sea of documents, requiring manual review. Document Intelligence makes it easy to extract insights from documents, you can use the Layout API to extract content and structure to query documents for insights with the RAG (retrieval augmented generation) pattern.

As tax season approaches in the US, you may need to process tax forms like 1040 or 1099 with the prebuilt models or you could build custom models in minutes to classify and extract specific fields from any form or document. Gone are the days of tedious manual data entry. With Document Intelligence, your team can automate document processing, freeing up valuable time to focus on what really matters. Boost productivity, streamline operations, and uncover hidden insights—all with Azure AI Document Intelligence.

What is new in Preview? Document Intelligence continues to evolve adding new models and updates to existing models. The Layout API extracts content and structure from PDF, images, and Office file types like Word, PowerPoint, Excel, and HTML. The most recent update to layout is:

  • Figure and Image Detection
  • Hierarchical Document Structure
  • Prebuilt Models

Documents like business plans, financial reports, manuals usually contain graphs and figures as well. For more complete ingestion of these document types, Layout has added figure and image detection, this includes extracting the bounding region of the image, associated captions, and context. When using the content of a document to extract insights with a large language model (LLM), layout now enables the extraction and processing of information in embedded images and figures. Pair this feature with the formula add-on and you have a simple solution for extracting all the information from academic papers.

One of the challenges in document ingestion is not only extracting all the elements but also maintaining meaningful structure and semantic relationships. This understanding is vital for extracting meaningful insights, summarization, and contextual analysis. In the latest preview, layout added support for section hierarchies, where the paragraphs, sections, tables, and figure are grouped in respect to the document structure. You can use output to markdown format to easily get the document structure and its associate content in markdown.

Prebuilt models offer an out-of-the-box solution that provides the fields for a known document type with a simple API call. Tax and mortgage processing in the US just got easier with the addition of the 1040, 1099 forms and the 1003 URLA, 1008 and closing disclosure mortgage form prebuilt models. Need to extend the schema of a prebuilt model to meet your specific needs? Just add the fields you need as query fields to extract the expanded schema.

Exploring the Evolution of Document Processing with AI and Machine Learning

The advent of AI and Machine Learning technologies has dramatically altered the landscape of document processing. Azure AI Document Intelligence leverages these technologies to offer sophisticated solutions for understanding complex documents. With features like prebuilt models, figure and image detection, and hierarchical document structure, it massively simplifies the ingestion and analysis of diverse document types. This not only saves valuable time but also enhances the accuracy and efficiency of data extraction and interpretation tasks. As businesses continue to grapple with vast amounts of data, leveraging such AI and Machine Learning technologies becomes crucial for staying competitive and uncovering valuable insights from their documents. Further enhancements in these technologies promise an even more streamlined and intelligent document processing capability, making manual reviews a thing of the past. Azure AI Document Intelligence is at the forefront of this evolution, continuously adding capabilities and improvements to meet the dynamic needs of modern businesses.

 

 

People also ask

What formats are supported by Azure Document Intelligence?

Azure Document Intelligence permits the utilization of training data up to 1GB, capped at 10,000 pages for custom classification model training. It accommodates various file formats such as JPEG, PNG, PDF, and TIFF. For handling PDF and TIFF files specifically, it supports document sizes up to 2,000 pages or restricts usage to the first two pages for those subscribed to the free tier.

Which of the following is an Azure AI Document Intelligence prebuilt model?

Analysis features are included as part of Azure AI Document Intelligence's prebuilt models.

In which two scenarios can you use Azure AI document intelligence?

Azure AI Document Intelligence is versatile, enabling users to apply its capabilities across multiple scenarios such as custom forms, prebuilt models, and layout APIs for meticulously extracting information from documents. This includes a detailed analysis of documents to identify and extract text alongside layout elements, such as tables, check boxes, and objects, providing a comprehensive suite of tools that range from prebuilt to customizable solutions for refining text extraction processes.

Is Document Intelligence better than Azure Computer Vision?

Document Intelligence stands out for its high-resolution Optical Character Recognition (OCR) model when compared to Azure AI Vision. This optimization allows it to accurately extract printed and handwritten text from PDF documents and scanned images. For those seeking to elevate OCR accuracy, it is advised to consider adjusting the image resolution accordingly.

 

Keywords

Document Intelligence Preview, Prebuilt Support, Image Analysis, Figure Analysis, SEO Keywords, Enhanced Document Processing, AI-Powered Document Intelligence, Content Optimization