ѰÇóרҵ֧³Ö
Èç¹ûÄú³¢ÊÔÁËÒÔÉÏËùÓз½·¨£¬ÈÔÈ»ÎÞ·¨½â¾öÂÒÂëÎÊÌ⣬¿ÉÒÔѰÇóרҵ¼¼ÊõÖ§³Ö¡£¿ÉÒÔÁªÏµÈí¼þ¿ª·¢ÉÌ»ò¼¼ÊõÂÛ̳£¬Ñ°Çó¸üרҵµÄ°ïÖú¡£
ͨ¹ýÒÔÉÏÏêϸµÄ?½â¾ö·½·¨ºÍ·ÖÇøÂÒÂë¹ÊÕÏÅŲ鲽Ö裬ÏàÐÅÄú¿ÉÒÔÇáËɸ㶨¹ú²úӰƬÂÒÂëÎÊÌâ£¬ÖØÊ°¹ÛÓ°µÄÓäÔã¡Ï£ÍûÕâÆªÈíÎÄÄÜΪÄú´øÀ´Êµ¼ÊµÄ°ïÖú£¬×£Äú¹ÛÓ°Óä¿ì£¡
½â¾ö·½°¸£º
ͳһ±àÂë±ê×¼£ºÑ¡ÔñUTF-8×÷ΪӦÓóÌÐòµÄͳһ±àÂë±ê×¼¡£×ÖÌ弿ÈÝÐÔ£ºÈ·±£ËùÓÐÆ½Ì¨É϶¼ÄÜʹÓÃͨÓÃ×ÖÌ壬ÈçArial¡¢HelveticaµÈ£¬ÒÔ±ÜÃâ×ÖÌ弿ÈÝÐÔÎÊÌâ¡£Êý¾ÝУÑ飺ÔÚÊý¾Ý´«ÊäºÍ½ÓÊÕʱ£¬Ìí¼ÓУÑéºÍ»úÖÆ£¬ÒÔÈ·±£Êý¾ÝÍêÕûÐÔ¡£
ͨ¹ýÕâЩ´ëÊ©£¬ÎÒÃÇÄܹ»ÔÚ¿çÆ½Ì¨Ó¦ÓóÌÐòÖУ¬±£Ö¤Îı¾Êý¾ÝµÄ׼ȷ´«ÊäºÍÏÔʾ¡£
×Ô¶¯»¯´¦Àí
importchardetimportcodecsdefdetect_and_convert_encoding(file_path):#¼ì²âÎļþ±àÂëwithopen(file_path,'rb')asfile:raw_data=file.read()result=chardet.detect(raw_data)encoding=result'encoding'#´ò?¿ªÎļþ²¢¶ÁÈ¡ÄÚÈÝwithcodecs.open(file_path,'r',encoding=encoding,errors='replace')asfile:content=file.read()#ͳһ±àÂë¸ñʽΪUTF-8utf8_content=content.encode('utf-8',errors='replace')#±£´æÐÞ¸´ºóµÄÎļþwithcodecs.open('repaired_'+file_path,'w',encoding='utf-8')asfile:file.write(utf8_content.decode('utf-8'))#ʹÓÃʾÀýdetect_and_convert_encoding('example.txt')
ʹÓ÷½·¨
ÏÂÔØºÍ°²×°£º·ÃÎÊMultiDecodePro¹Ù·½ÍøÕ¾£¬ÏÂÔØ×îаæ?±¾µÄÈí¼þ²¢½øÐа²×°¡£´ò¿ªÎļþ£ºÆô¶¯Èí¼þºó£¬Ñ¡Ôñ¡°´ò?¿ªÎļþ¡±¹¦ÄÜ£¬¼ÓÔØÐèÒª½âÂëµÄÎļþ¡£Ñ¡Ôñ±àÂ룺ÔÚÈí¼þ½çÃæÖУ¬Ñ¡ÔñÎļþµÄÔʼ±àÂ룬Ȼºóµã»÷¡°½âÂ롱°´Å¥¡£Ô¤ÀÀºÍ±£´æ£º½âÂëÍê³Éºó£¬¿ÉÒÔÔÚÔ¤ÀÀ´°¿ÚÖв鿴½á¹û£¬È·±£Ã»ÓÐÂÒÂëºó£¬Ñ¡Ôñ¡°±£´æ¡±¹¦Äܱ£´æ?Îļþ¡£
¶àÓïÑÔ¿ª·¢µÄ»ù±¾ÔÔò
³é?Ï󻯺ÍÄ£¿é»¯½«ÓïÑÔÏà¹ØµÄ´úÂë³éÏ󻯣¬½«²»Í¬ÓïÑÔµÄÎı¾´æ´¢ÔÚ¶ÀÁ¢µÄÎļþ»òÊý¾Ý¿âÖС£ÕâÑù¿ÉÒÔ·½±ãµØ½øÐÐÓïÑÔµÄÇл»ºÍ¸üС£¹ú¼Ê»¯£¨i18n£©ºÍ±¾µØ»¯£¨l10n£©¹ú¼Ê»¯ÊÇÖ¸¿ª·¢Ó¦ÓÃʱ£¬Ê¹Æä¾ßÓпÉÀ©Õ¹ÐÔ£¬ÒÔ±ãÔÚ²»¸Ä±ä´úÂëµÄÇé¿öÏ£¬Ö§³Ö¶àÖÖÓïÑÔºÍÇøÓò¡£
±¾µØ»¯ÔòÊÇÖ¸Õë¶ÔÌØ¶¨ÓïÑÔºÍÎÄ»¯£¬¶Ô¹ú¼Ê»¯Ó¦ÓýøÐб¾µØ»¯¸ÄÔì¡£
У¶Ô£º³Â¼ÎÓ³(p6mu9CWFoIx7YFddy4eQTuEboRc9VR7b9b)


