This site uses cookies.
Some of these cookies are essential to the operation of the site,
while others help to improve your experience by providing insights into how the site is being used.
For more information, please see the ProZ.com privacy policy.
How to get rid of junk OCR character leftover in Word
Thread poster: Susan Welsh
Susan Welsh United States Local time: 22:08 Russian to English + ...
Apr 16, 2013
I have converted a PDF to Word using ABBYY Finereader, and wherever there was a hyphen at a line ending, the Word version has put it a junk character than I cannot search and replace to get rid of. It looks like a horizontal line with a short vertical line hanging down from the back of it -- like an L rotated 90 degrees clockwise. I have copied it into my Find field, but Word can't find it.
There are hundreds of these things in this rather long document, and I would really like to ... See more
I have converted a PDF to Word using ABBYY Finereader, and wherever there was a hyphen at a line ending, the Word version has put it a junk character than I cannot search and replace to get rid of. It looks like a horizontal line with a short vertical line hanging down from the back of it -- like an L rotated 90 degrees clockwise. I have copied it into my Find field, but Word can't find it.
There are hundreds of these things in this rather long document, and I would really like to get a clean text to make translating easier.
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
LEXpert United States Local time: 21:08 Member (2008) Croatian to English + ...
Easy!
Apr 16, 2013
This is very common in multi-column articles. Open Word's Find&Replace dialog. Under Find, click the button "More >>" Place the cursor in the Find box, and from the Special drop-down menu select "optional hyphen". Leave the Replace box blank. Replace All.
That's it.
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
esperantisto Local time: 05:08 Member (2006) English to Russian + ...
SITE LOCALIZER
Better take care of it in FR
Apr 16, 2013
In FineReader, go to Tools → Options → 4. Save → Format Settings → RTF/DOC/Word XML and tick Remove Optional Hyphens and re-export your document.
[Edited at 2013-04-16 07:57 GMT]
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Susan Welsh United States Local time: 22:08 Russian to English + ...
TOPIC STARTER
Thanks!
Apr 16, 2013
I used Rudolf's solution, and it worked like a charm. (I didn't want to go back to FR, because I had already done some formatting work on the Word file, like moving footnotes around.)
Thanks to all.
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Manage your TMs and Terms ... and boost your translation business
Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!
The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.