laptop and a wrench

Error

The ArcGIS LocateXT tools fail to extract locations from PDFs created in ArcGIS Pro.

Última publicación: March 6, 2023 ArcGIS Pro
Número de ID del error BUG-000155378
EnviadoJanuary 26, 2023
Última modificaciónJune 20, 2025
Relacionado conArcGIS Pro
Versión encontradaN/A
Sistema operativoWindows OS
Versión de sistema operativo10.0 64 Bit
EstadoKnown Limit

Información adicional

Detailed information about how LocateXT works is provided in the Pro help system in the topic "Adjust how locations and attributes are extracted" in the section 'Scan files' (https://pro.arcgis.com/en/pro-app/latest/help/data/locatext/adjust-how-locations-and-attributes-are-extracted.htm#ESRI_SECTION1_5B0CE46F1BE4444FA60E37132F3D4BAC) under the subheading 'Some files are not processed' . LocateXT does not extract text from the PDF. Text is extracted from PDFs using IFilter plug-ins to the Windows operating system. IFilters are primarily created to facilitate Windows Search. LocateXT asks the IFilters available on the local computer to extract text from the PDF file. LocateXT then processes the text returned by the IFilter to find x, y coordinates, place names, etc., and generate spatial features and attributes for those features accordingly. If the IFilters on the local machine cannot extract text from the PDF, LocateXT has nothing to process. A PDF file is not a text file. A PDF file contains objects and data. In the PDF, text can be encoded as binary data and compressed inside the file. Every PDF file is not structured in the same manner. It is true that with the PDF files provided by the customer, both files display text that defines an x, y coordinate. However, you can open these PDFs in some text editors and see that the two files are structured in very different ways. It makes sense that the standard IFilter provided by Microsoft that comes with the Windows operating system would be able to read PDF files generated by Microsoft Office products. The different internal PDF structure used in the files generated by ArcGIS Pro is obviously not as readable to that specific IFilter for some reason. Other IFilters are available that can extract more text out of different PDF files. Adobe provides IFilters that can extract text from PDFs. One Adobe IFilter that was tested would not read text from the PDF file provided by the customer that was generated by ArcGIS Pro, but other newer IFilters may be available with more advanced Adobe products. We found that if you use the TET PDF IFilter, for example, the coordinates are successfully read out of the PDF file provided by the customer that was generated by ArcGIS Pro.

Solución alternativa

 a) Convert the PDF file to another text-based document format that can be parsed more effectively by the IFilter provided with the operating system.

b) Open the PDF file in a PDF reader. Copy the text and paste it into the Text tab on the Extract Locations pane.

Pasos para reproducir

ID del error: BUG-000155378

Software:

  • ArcGIS Pro

Recibir notificaciones cuando cambie el estado de un error

Descargar la aplicación de soporte de Esri

Descubrir más sobre este tema

Obtener ayuda de expertos en ArcGIS

Contactar con el soporte técnico

Descargar la aplicación de soporte de Esri

Ir a opciones de descarga