Japanese OCR suddenly has issues

Vilis · November 17, 2018, 2:09pm

I’ve been using the OCR Space API through ShareX for several months and rarely encountered issues. But today, it keeps returning all Japanese text in the wrong order. Characters can appear in the wrong line, or even in the wrong order in the same line. I tried several different kinds of text and even typed some myself, and the result always seems to be the same. However, when I typed it horizontally, the OCR worked properly (but Japanese text is normally vertical so that isn’t particularly useful).

This is an example of some pretty clear text. As you can see, the characters are all in the wrong order. For the first line, for example, it has “入口はにり” when the correct order is “入り口には” (Japanese is read left to right, top to bottom, in case you’re not familiar with the language).

I tried using a different PC and even had friends who live in different countries try and got the same result. What seems to be the issue here?

admin · November 17, 2018, 7:45pm

We confirmed the issue. It is related to our new OCR update last week. We are working on a fix for this.

Vilis · November 17, 2018, 9:07pm

Thank you for the quick reply.

admin · November 21, 2018, 6:37pm

Thanks for reporting this issue. It was a side-effect of the new table ocr feature. The issue is fixed now on all OCR servers:

Vilis · November 22, 2018, 8:50am

Thank you for your hard work.