This article is a mirror article of machine translation, please click here to jump to the original article.

View: 106044|Reply: 24

[WinForm] ASP.NET web crawler

[Copy link]
Posted on 11/6/2017 6:36:20 PM | | | |
Many crawlers on the Internet are written in python, and some time ago, a aps.net simple crawler was also written, which can crawl the data you want to crawl. Nowadays, many websites have made a backcrawling mechanism, which makes it very difficult for crawlers to scrape data. There are probably several ways to reverse crawl most websites: there are verification codes, IP addresses, blacklists, etc., and some more advanced reverse crawling methods.
This crawler has also taken some measures to deal with anti-crawling, bypassing verification codes, using proxies, etc., paste some of the code below, discuss and learn with you, please correct what is wrong!
This crawler is mainly aimed at a certain website.

After entering the URL, you can crawl back the data according to the URL, and then filter and clean the data through XPath to obtain the data you want
To bypass backcrawling, you can use a proxy IP to access, you can download or grab a high-hiding IP on the Internet, and then randomly switch the proxy IP to grab
The above code is to first determine whether the switched IP is accessible
Look at the source code for the specific code, and provide the source code!

Source code download
Tourists, if you want to see the hidden content of this post, pleaseReply

Score

Number of participants3MB+3 contribute+3 Collapse reason
A little novice who loves to learn + 1 + 1 Very powerful!
moxuan + 1 + 1 Support the landlord to post a good post
Little scum + 1 + 1 Very powerful!

See all ratings





Previous:{:1_7:} {:1_9:}
Next:Reset the vs2017 development environment
 Landlord| Posted on 11/7/2017 9:30:14 AM |
Published on 2017-11-6 18:44
I have sorted out the content of the post for you

Thanks, I just wanted to delete a duplicate! Thank you for your hard work!
Posted on 12/13/2019 10:32:09 AM |
I want to know what that stored procedure you wrote is like, man.
Posted on 11/6/2017 6:44:57 PM |
I have sorted out the content of the post for you   
Posted on 11/7/2017 3:00:04 PM |
Thank you for sharing, let's take a look
Posted on 11/8/2017 3:46:42 PM |
Look at the source code first
Posted on 11/10/2017 5:14:31 PM |
ASP.NET web crawler
Posted on 12/8/2017 10:15:43 PM |
Learn to learn
Posted on 12/10/2017 8:25:22 AM |
ASP.NET web crawler good idea!
Posted on 12/23/2017 8:54:35 PM |
ASP.NET web crawler
Posted on 4/16/2019 11:46:03 AM |
Thank you for sharing, learn from it.
Disclaimer:
All software, programming materials or articles published by Code Farmer Network are only for learning and research purposes; The above content shall not be used for commercial or illegal purposes, otherwise, users shall bear all consequences. The information on this site comes from the Internet, and copyright disputes have nothing to do with this site. You must completely delete the above content from your computer within 24 hours of downloading. If you like the program, please support genuine software, purchase registration, and get better genuine services. If there is any infringement, please contact us by email.

Mail To:help@itsvse.com