Exam AI-102 All QuestionsBrowse all questions from this exam
Question 150

HOTSPOT

-

You have a collection of press releases stored as PDF files.

You need to extract text from the files and perform sentiment analysis.

Which service should you use for each task? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

    Correct Answer:

Discussion
zellck

1. Computer Vision 2. Language https://learn.microsoft.com/en-us/azure/cognitive-services/computer-vision/overview-ocr OCR or Optical Character Recognition is also referred to as text recognition or text extraction. Machine-learning based OCR techniques allow you to extract printed or handwritten text from images, such as posters, street signs and product labels, as well as from documents like articles, reports, forms, and invoices. The text is typically extracted as words, text lines, and paragraphs or text blocks, enabling access to digital version of the scanned text. This eliminates or significantly reduces the need for manual data entry.

zellck

https://learn.microsoft.com/en-us/azure/cognitive-services/language-service/sentiment-opinion-mining/overview Sentiment analysis and opinion mining are features offered by Azure Cognitive Service for Language, a collection of machine learning and AI algorithms in the cloud for developing intelligent applications that involve written language. These features help you find out what people think of your brand or topic by mining text for clues about positive or negative sentiment, and can associate them with specific aspects of the text.

M25

https://learn.microsoft.com/en-us/azure/ai-services/computer-vision/overview-ocr#next-steps At the very bottom of the same link, under “Next steps”: • OCR for PDF, Office and HTML documents and document images: start with Document Intelligence Read.

rdemontis

agree with you

Pffffff

The service you should use to extract text from the PDF files is B. Computer Vision. Computer Vision has the ability to extract text from images and PDF files, making it a suitable choice for this scenario. Once the text has been extracted, you can then use a text analytics service, such as the Azure Cognitive Services Text Analytics API, to perform sentiment analysis on the extracted text. Azure Cognitive Search is a search-as-a-service solution that allows you to index and search structured and unstructured data. It can also extract text from PDF files, but it may not provide the level of accuracy required for sentiment analysis. Form Recognizer is a service that is designed to extract structured data from forms, such as receipts, invoices, and business cards. It may not be the best choice for extracting text from press releases.

973b658

But, Computer Vision < Form Recognizer in this case.

AnonymousJhb

we are reading pdfs and not receipts so we use Computer Vision

Mehe323

Answers are correct. Form recognizer is the old name of Document Intelligence. About Document Intelligence on Microsoft Learn: "Document Intelligence Read Optical Character Recognition (OCR) model runs at a higher resolution than Azure AI Vision Read and extracts print and handwritten text from PDF documents and scanned images." https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/concept-read?view=doc-intel-4.0.0

mon2002

Azure AI Document Intelligence. Azure AI Language.

davidorti

Answer seems correct. 1. Form Recognizer (now Document Intelligence) 2. Language In CV Read API docu for OCR it says clearly: OCR for Images: "Optimized for general, *non-document images* with a performance-enhanced synchronous API" Document Intelligence: "Optimized for text-heavy scanned and digital documents with an asynchronous API to help automate intelligent document processing at scale" Here we're dealing with a collection of PDFs.

davidorti

Link to the docu: https://learn.microsoft.com/en-us/azure/ai-services/computer-vision/how-to/call-read-api

ziggy1117

answer is correct: 1. form recognizer -> READ. Form Recognizer v3.0's Read Optical Character Recognition (OCR) model runs at a higher resolution than Computer Vision Read and extracts print and handwritten text from PDF documents and scanned images. It also includes preview support for extracting text from Microsoft Word, Excel, PowerPoint, and HTML documents. It detects paragraphs, text lines, words, locations, and languages. The Read model is the underlying OCR engine for other Form Recognizer prebuilt models like Layout, General Document, Invoice, Receipt, Identity (ID) document, in addition to custom models. https://learn.microsoft.com/en-us/azure/applied-ai-services/form-recognizer/overview?view=form-recog-3.0.0 https://learn.microsoft.com/en-us/azure/applied-ai-services/form-recognizer/concept-read?view=form-recog-3.0.0 2. Language obviously

HaraTadahisa

Service name was changed. Current answer is must be 1. Azure AI Document Intelligence 2. Azure AI Language

omankoman

1. Computer Vision 2. Language Computer Vision provides advanced algorithms that process images and return information based on visual features of interest. It offers four services: OCR, Face Service, Image Analysis, and Spatial Analysis. Form Recognizer is an advanced version of OCR.

MaliSanFuu

I think the answer is correct as the FormRecognizer supports the ability for document analysis. There you can easy the read API to extract printed or handwritten text from images and documents.

MaliSanFuu

easily use*

krzkrzkra

1. Computer Vision 2. Language

eskimolight

I feel the given answer is correct.. Form Recognizer can extract text, key-value pairs, and tables from forms and documents. It's particularly useful for processing structured documents like invoices, receipts, and business forms.

NagaoShingo

1. Computer Vision 2. Language

takaimomoGcup

Computer Vision Language

sismer

The answer is correct: For extracting text from PDF files, you can use Azure Cognitive Services specifically the Azure Form Recognizer service. Azure Form Recognizer is designed to extract key-value pairs, tables, and text from documents, including PDFs. It supports various document types, making it suitable for extracting text from press releases in PDF format. For sentiment analysis, you can use the Azure Text Analytics service. Azure Text Analytics includes a sentiment analysis feature that can analyze the sentiment of text documents and provide a sentiment score. This service can help you determine whether the sentiment expressed in the press releases is positive, negative, or neutral.

shahnawazkhot

Answer is - Azure Cognitive Search Language Yes, Azure Cognitive Search can be used to extract text from PDF files 12. The Azure Cognitive Search blob indexer can extract text from PDF and other document formats 2. However, extracting text from embedded images or tables is not yet integrated in Azure Search, but it is on the roadmap 3. To extract text from PDF files using Azure Cognitive Search, you can use the Document Extraction cognitive skill 1. This skill extracts content from a file within the enrichment pipeline and can extract text and images with high accuracy 1. You can use this skill to extract text from PDF files and perform sentiment analysis on the extracted text using the Sentiment Analysis feature provided by Azure Cognitive Services.

AnonymousJhb

no its not. Azure Computer Vision API offers OCR scanning, which makes it much easier to extract text from PDF files. Sentinment analysis is done by Language Services.

kail85

Azure Cognitive Search can be used to extract text from PDF files. It can ingest and index the content of various file formats, including PDFs, by using built-in document cracking capabilities or custom skills. The indexing process extracts text and metadata from the files, making the content searchable.