Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Baidu can not crawl CloudFlare CDN #81

Open
huan opened this issue Nov 5, 2019 · 3 comments
Open

Baidu can not crawl CloudFlare CDN #81

huan opened this issue Nov 5, 2019 · 3 comments
Assignees
Labels
bug Something isn't working

Comments

@huan
Copy link
Member

huan commented Nov 5, 2019

This problem caused the Baidu.com can not index the pre-angel.com website at all. (Zero Index on 5 Nov 2019)

From: https://ziyuan.baidu.com/dashboard/index?site=http://www.pre-angel.com/

Crawled Failure

image

HTTP Error 040 (???)

image

HTTP Error 403

image

@huan huan added the bug Something isn't working label Nov 5, 2019
@huan huan self-assigned this Nov 5, 2019
@Yang2001-created
Copy link

Hi Huan, i've encountered the exact issue like yours, i was wondering if you have solved crawler issue yet?Thanks!

@huan
Copy link
Member Author

huan commented Mar 18, 2021

Hi, Yang,

Unfortunately, I did not solved this problem.

Now I'm not using the cloud flare cdn anymore by switching to a singapore server with nginx.

Please let me know if you found any solution for this issue in the future, thanks!

@Yang2001-created
Copy link

Hi Huan,

We've added user agent to allow Baidu spider and we did see on the Cloudflare firewall that its allowed attempts made by Baidu crawler.

But still didn't work... HTTP/1.1 403 Forbidden, Crawled Failure.

I ask Cloudflare customer service about this, seems like they couldn't find what is the issue either...

Thanks anyway.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants