Issues with scanned literary text PDF file converting to machine readable literary text Word file
Thread poster: Wei Ralph
Wei Ralph
Wei Ralph  Identity Verified
United States
Local time: 03:07
Member (2013)
English to Chinese
+ ...
Feb 25, 2021

Issues with scanned literary text PDF file converting to machine readable literary text Word file:
1. How to obtain clean machine readable literary text Word format from scanned literary text PDF file?
2. How to obtain clean machine readable literary text Word layout from scanned literary text PDF file?
Any experience you have to share will be greatly appreciated.


 
Gerard de Noord
Gerard de Noord  Identity Verified
France
Local time: 09:07
Member (2003)
English to Dutch
+ ...
Have you tried opening the PDF in Word? Feb 25, 2021

Have you tried opening the PDF file with File/Open in a current version of Word?

Cheers,
Gerard


Jorge Payan
Mamadou djiby Wane
 
Jorge Payan
Jorge Payan  Identity Verified
Colombia
Local time: 03:07
Member (2002)
German to Spanish
+ ...
Next: get yourself an OCR software Feb 25, 2021

Gerard de Noord wrote:

Have you tried opening the PDF file with File/Open in a current version of Word?

Cheers,
Gerard


If what Gerard de Noord suggests fail, you should try ABBYY FineReader or similar.

Saludos


 
Samuel Murray
Samuel Murray  Identity Verified
Netherlands
Local time: 09:07
Member (2006)
English to Afrikaans
+ ...
@Wei Feb 26, 2021

Wei Ralph wrote:
How [can I] obtain clean machine readable literary text [in] Word [with correct] format [and/or layout] from [a] scanned literary text PDF file?


You need to either use a very good OCR program or you have to hire a typist. If you hire a good typist, you would not need to do anything further, but even the best OCR programs only get it 95% right, requiring you to fix layout and formatting manually afterwards.


Wei Ralph
 
Wei Ralph
Wei Ralph  Identity Verified
United States
Local time: 03:07
Member (2013)
English to Chinese
+ ...
TOPIC STARTER
Issues with scanned literary text PDF file converting to machine readable literary text Word file Feb 26, 2021

Mr. Murray,

Do you have an email address that can receive a page of this PDF file? or I can go ahead and upload a page here.

Wei Ralph


 
Wei Ralph
Wei Ralph  Identity Verified
United States
Local time: 03:07
Member (2013)
English to Chinese
+ ...
TOPIC STARTER
Issues with scanned literary text PDF file converting to machine readable literary text Word file Feb 26, 2021

Gerard,

Did try and not successful. A OCR specific software is probably the next best thing , other than time consuming typing.

Wei Ralph


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Issues with scanned literary text PDF file converting to machine readable literary text Word file







Trados Business Manager Lite
Create customer quotes and invoices from within Trados Studio

Trados Business Manager Lite helps to simplify and speed up some of the daily tasks, such as invoicing and reporting, associated with running your freelance translation business.

More info »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »