laptop and a wrench

漏洞

The ArcGIS LocateXT tools fail to extract locations from PDFs created in ArcGIS Pro.

上次发布: March 6, 2023 ArcGIS Pro
漏洞 ID 编号 BUG-000155378
已提交January 26, 2023
上次修改时间June 20, 2025
适用范围ArcGIS Pro
找到的版本N/A
操作系统Windows OS
操作系统版本10.0 64 Bit
状态Known Limit

附加信息

Detailed information about how LocateXT works is provided in the Pro help system in the topic "Adjust how locations and attributes are extracted" in the section 'Scan files' (https://pro.arcgis.com/en/pro-app/latest/help/data/locatext/adjust-how-locations-and-attributes-are-extracted.htm#ESRI_SECTION1_5B0CE46F1BE4444FA60E37132F3D4BAC) under the subheading 'Some files are not processed' . LocateXT does not extract text from the PDF. Text is extracted from PDFs using IFilter plug-ins to the Windows operating system. IFilters are primarily created to facilitate Windows Search. LocateXT asks the IFilters available on the local computer to extract text from the PDF file. LocateXT then processes the text returned by the IFilter to find x, y coordinates, place names, etc., and generate spatial features and attributes for those features accordingly. If the IFilters on the local machine cannot extract text from the PDF, LocateXT has nothing to process. A PDF file is not a text file. A PDF file contains objects and data. In the PDF, text can be encoded as binary data and compressed inside the file. Every PDF file is not structured in the same manner. It is true that with the PDF files provided by the customer, both files display text that defines an x, y coordinate. However, you can open these PDFs in some text editors and see that the two files are structured in very different ways. It makes sense that the standard IFilter provided by Microsoft that comes with the Windows operating system would be able to read PDF files generated by Microsoft Office products. The different internal PDF structure used in the files generated by ArcGIS Pro is obviously not as readable to that specific IFilter for some reason. Other IFilters are available that can extract more text out of different PDF files. Adobe provides IFilters that can extract text from PDFs. One Adobe IFilter that was tested would not read text from the PDF file provided by the customer that was generated by ArcGIS Pro, but other newer IFilters may be available with more advanced Adobe products. We found that if you use the TET PDF IFilter, for example, the coordinates are successfully read out of the PDF file provided by the customer that was generated by ArcGIS Pro.

解决办法

 a) Convert the PDF file to another text-based document format that can be parsed more effectively by the IFilter provided with the operating system.

b) Open the PDF file in a PDF reader. Copy the text and paste it into the Text tab on the Extract Locations pane.

重现步骤

漏洞 ID: BUG-000155378

软件:

  • ArcGIS Pro

当漏洞状态发生变化时获得通知

下载 Esri 支持应用程序

发现关于本主题的更多内容

获取来自 ArcGIS 专家的帮助

联系技术支持部门

下载 Esri 支持应用程序

转至下载选项