Hubdoc - Extract description
Once the file is uploaded to Hubdoc, it effectively becomes a JPEG once at the review screen, this means you can not copy paste or grab contents from within the document and you will have to either open up the PDF on a separate screen or edit it within Xero.
It would be great if the description could also be extracted - OR allow the document to be copied/stay a PDF to be able to copy. This will reduce human errors when entering data. Thank you.
-
Pete Hall
commented
This issue is critical because Hubdoc’s current handling of uploaded PDFs increases manual work and the risk of data entry errors. When documents are flattened into image based files at the review stage, text cannot be copied or extracted, forcing users to re type key details.
Other software already supports this functionality. For example, Re Leased commercial property management software allows invoice description extraction, demonstrating that accurate automated capture is achievable and effective.
Why this matters.Increased errors
Manual re typing of descriptions leads to higher risk of transcription mistakes and incorrect coding.Reduced efficiency
Users must open original PDFs separately or edit documents in Xero, slowing processing and reviews.Weaker data quality
Accurate descriptions are important for audit, reporting, and future reference. Manual entry reduces consistency and reliability.Allowing description extraction or keeping documents as copyable PDFs would reduce errors, improve efficiency, and better align Hubdoc with modern automation standards.
-
Jane Stergio
commented
A pdf file can be displayed in a pdf viewer window, which would enable the copy function. This should not be difficult to do.
A similar idea has been posted here: https://productideas.xero.com/forums/940636-for-accountants-bookkeepers/suggestions/46532647-hubdoc-ability-to-copy-paste-document-details -
Raymond Golugwa
commented
This is very critical, and would make shift for Hubdoc, as accuracy of invoice description are most important when posting an entry. Uploading hundreds of documents in Hubdoc and visit one by one uploaded entry in Xero to manually change the incorrect description provided by machine learning is hectic and cumbersome
-
Raymond Golugwa
commented
Extracting invoice description from PDF invoices when uploading multiple invoices in pdf format. I have more than 100 invoices in pdf format, If you have more than 100 invoices in pdf with different invoices descriptions and want the draft entries in xero to have the actual original description from the pdf invoice. Hubdoc would be required to be developed with this feature and go extra mile than producing description based on historical data extraction of similar documents. Sometime machine learning based on history would result top wrong output, so data extraction by machine reading on PDF is more correct.
-
Jean Sutherland
commented
This would be very useful. Currently have to download the file to be able to copy the information.