By Benjamin Deboute -
Hi all !
Some of you who have Xerox copiers and an administrative department
that archives paper via OCR might be interested in knowing that a
misconfiguration of their compression algorithm might have corrupted
their data : (
The other can laugh at how stupidly logical/logically stupid the bug is : )
TL;DR: OCRr'd numbers might be switched with other on some page layouts...
Quote: "Several mails I got suggest that the xerox machines use JBIG2
for compression. This algorithm creates a dictionary of image patches
it finds “similar”. Those patches then get reused instead of the
original image data, as long as the error generated by them is not
“too high”. Makes sense. "
http://www.dkriesel.com/en/blog/2013/0802_xerox-workcentres_are_switching_written_numbers_when_scanning?
Quite a nice bug <3
--
benjamin debouté
To unsubscribe from the list send a blank e-mail to mailto:studiosysadmins-discuss-request@studiosysadmins.com?subject=unsubscribe
↧