Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

From the original post: "numbers seem to be mangled independently from the compression mode, which makes the issue hard to avoid"


What the author refers to are the three available compression modes on Xerox scanners, which are "normal", "higher", and "highest". It is not clear what that means exactly, but it is possible that all three of those are lossy compression modes.

In my opinion, the only reasonable mode for JBIG2 is lossless, as the lossy mode of JBIG2 is, in general, prone to these character mangling issues. File sizes for lossless JBIG2 compression are already very low, so I would claim that using lossless is almost always worth it, unless character mangling is explicitly not a problem.


Reasonable settings meaning no compression setting which was mentioned to not have the issue


Thanks would love a 45mb one page PDF


I would love not to send a scanned document with different numbers to my accountant, or to the IRS for that matter.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: