-
Notifications
You must be signed in to change notification settings - Fork 452
Description
Before creating an issue, first upgrade cfscrape with pip install -U cfscrape
and see if you're still experiencing the problem. Please also confirm your Node version (node --version
or nodejs --version
) is version 10 or higher.
Make sure the website you're having issues with is actually using anti-bot protection by Cloudflare and not a competitor like Imperva Incapsula or Sucuri. And if you're using an anonymizing proxy, a VPN, or Tor, Cloudflare often flags those IPs and may block you or present you with a captcha as a result.
Please confirm the following statements and check the boxes before creating an issue:
- I've upgraded cfscrape with
pip install -U cfscrape
- I'm using Node version 10 or higher
- The site protection I'm having issues with is from Cloudflare
- I'm not using Tor, a VPN, or an anonymizing proxy
Python version number
Run python --version
and paste the output below:
Python 2.7.18
cfscrape version number
Run pip show cfscrape
and paste the output below:
Name: cfscrape
Version: 2.1.1
Summary: A simple Python module to bypass Cloudflare's anti-bot page. See https://github.com/Anorov/cloudflare-scrape for more information.
Home-page: https://github.com/Anorov/cloudflare-scrape
Author: Anorov
Author-email: anorov.vorona@gmail.com
License: UNKNOWN
Location: /home/sshuser/.local/lib/python3.9/site-packages
Requires: requests
Required-by:
Code snippet involved with the issue
url = "https://www.investing.com/commodities/us-cotton-no.2"
session = requests.Session()
params = {
"curr_id": 8851,
"smlID": str(randint(1000000, 99999999)),
"header": "US Cotton #2 Futures Historical Data",
"interval_sec": "Daily".capitalize(),
"sort_col": "date",
"sort_ord": "DESC",
"action": "historical_data",
}
head = {
"User-Agent":"Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.5; en-US; rv:1.9.1b3) Gecko/20090305"
" Firefox/3.1b3 GTB5",
"X-Requested-With": "XMLHttpRequest",
"Accept": "text/html",
"Accept-Encoding": "gzip, deflate",
"Connection": "keep-alive",
}
scrapers = cfscrape.create_scraper(
sess=session,
delay=10
)
print(scrapers.get(url,headers=head,data=params).content)
Complete exception and traceback
(If the problem doesn't involve an exception being raised, leave this blank)
URL of the Cloudflare-protected page
https://www.investing.com/commodities/us-cotton-no.2