ASP.NET web crawler

Little gray hat · Posted on 11/6/2017 6:36:20 PM

Many crawlers on the Internet are written in python, and some time ago, a aps.net simple crawler was also written, which can crawl the data you want to crawl. Nowadays, many websites have made a backcrawling mechanism, which makes it very difficult for crawlers to scrape data. There are probably several ways to reverse crawl most websites: there are verification codes, IP addresses, blacklists, etc., and some more advanced reverse crawling methods.
This crawler has also taken some measures to deal with anti-crawling, bypassing verification codes, using proxies, etc., paste some of the code below, discuss and learn with you, please correct what is wrong!
This crawler is mainly aimed at a certain website.

After entering the URL, you can crawl back the data according to the URL, and then filter and clean the data through XPath to obtain the data you want

Login is visible.

To bypass backcrawling, you can use a proxy IP to access, you can download or grab a high-hiding IP on the Internet, and then randomly switch the proxy IP to grab

Login is visible.

The above code is to first determine whether the switched IP is accessible
Look at the source code for the specific code, and provide the source code!

Source code download

Tourists, if you want to see the hidden content of this post, pleaseReply

Little gray hat · Posted on 11/7/2017 9:30:14 AM

Published on 2017-11-6 18:44
I have sorted out the content of the post for you

Thanks, I just wanted to delete a duplicate! Thank you for your hard work!

18479403 · Posted on 12/13/2019 10:32:09 AM

I want to know what that stored procedure you wrote is like, man.

Little scum · Posted on 11/6/2017 6:44:57 PM

I have sorted out the content of the post for you

lightweight · Posted on 11/7/2017 3:00:04 PM

Thank you for sharing, let's take a look

dotnet_charlay · Posted on 11/8/2017 3:46:42 PM

Look at the source code first

do827261756 · Posted on 11/10/2017 5:14:31 PM

ASP.NET web crawler

Little monkey · Posted on 12/8/2017 10:15:43 PM

Learn to learn

zherp · Posted on 12/10/2017 8:25:22 AM

ASP.NET web crawler good idea!

cd37ycs · Posted on 12/23/2017 8:54:35 PM

ASP.NET web crawler

Naughty Rooster · Posted on 4/16/2019 11:46:03 AM

Thank you for sharing, learn from it.

[WinForm] ASP.NET web crawler

Score

Related Posts

Sections viewed