This article is a mirror article of machine translation, please click here to jump to the original article.

View: 9156|Reply: 0

node.js crawling pages of GBK websites

[Copy link]
Posted on 9/26/2016 10:18:29 AM | | | |
Our code farmer network is the web page encoding of GBK, if you directly grab it, there will be a garbled phenomenon.

Because node.js default is UTF-8 format, that is, after capture, it will be treated as UTF-8, resulting in garbled characters.

Need to transcode!!



Code:



Note: Packages that reference iconv-lite and bufferhelper are required

npm install iconv-lite -g
npm install bufferhelper -g




Previous:C# develops the POP3 client of the email to receive the email
Next:discuz keeps the logged in cookie (session)
Disclaimer:
All software, programming materials or articles published by Code Farmer Network are only for learning and research purposes; The above content shall not be used for commercial or illegal purposes, otherwise, users shall bear all consequences. The information on this site comes from the Internet, and copyright disputes have nothing to do with this site. You must completely delete the above content from your computer within 24 hours of downloading. If you like the program, please support genuine software, purchase registration, and get better genuine services. If there is any infringement, please contact us by email.

Mail To:help@itsvse.com