Error reading input file input.pdf: 'utf-8' codec can't decode byte 0x80 in position 10: invalid start byte