Scrapy 1.2 Documentation
performed to any single domain. See also: AutoThrottle extension and its AUTOTHROTTLE_TARGET_CONCURRENCY option. 3.11. Settings 93 Scrapy Documentation, Release 1.2.3 CONCURRENT_REQUESTS_PER_IP Default: CONCURRENT_REQUESTS_PER_DOMAIN setting is ignored, and this one is used instead. In other words, concurrency limits will be applied per IP, not per domain. This setting also affects DOWNLOAD_DELAY and AutoThrottle AUTOTHROTTLE_ENABLED • AUTOTHROTTLE_MAX_DELAY • AUTOTHROTTLE_START_DELAY • AUTOTHROTTLE_TARGET_CONCURRENCY • AWS_ACCESS_KEY_ID • AWS_SECRET_ACCESS_KEY • BOT_NAME • CLOSESPIDER_ERRORCOUNT • CLOSESPIDER_ITEMCOUNT0 码力 | 266 页 | 1.10 MB | 1 年前3Scrapy 1.1 Documentation
performed to any single domain. See also: AutoThrottle extension and its AUTOTHROTTLE_TARGET_CONCURRENCY option. CONCURRENT_REQUESTS_PER_IP Default: 0 The maximum number of concurrent (ie. simultaneous) CONCURRENT_REQUESTS_PER_DOMAIN setting is ignored, and this one is used instead. In other words, concurrency limits will be applied per IP, not per domain. This setting also affects DOWNLOAD_DELAY and AutoThrottle AUTOTHROTTLE_ENABLED • AUTOTHROTTLE_MAX_DELAY • AUTOTHROTTLE_START_DELAY • AUTOTHROTTLE_TARGET_CONCURRENCY • AWS_ACCESS_KEY_ID • AWS_SECRET_ACCESS_KEY • BOT_NAME • CLOSESPIDER_ERRORCOUNT • CLOSESPIDER_ITEMCOUNT0 码力 | 260 页 | 1.12 MB | 1 年前3Scrapy 1.3 Documentation
performed to any single domain. See also: AutoThrottle extension and its AUTOTHROTTLE_TARGET_CONCURRENCY option. CONCURRENT_REQUESTS_PER_IP Default: 0 The maximum number of concurrent (ie. simultaneous) CONCURRENT_REQUESTS_PER_DOMAIN setting is ignored, and this one is used instead. In other words, concurrency limits will be applied per IP, not per domain. This setting also affects DOWNLOAD_DELAY and AutoThrottle AUTOTHROTTLE_ENABLED • AUTOTHROTTLE_MAX_DELAY • AUTOTHROTTLE_START_DELAY • AUTOTHROTTLE_TARGET_CONCURRENCY • AWS_ACCESS_KEY_ID • AWS_SECRET_ACCESS_KEY • BOT_NAME • CLOSESPIDER_ERRORCOUNT • CLOSESPIDER_ITEMCOUNT0 码力 | 272 页 | 1.11 MB | 1 年前3Scrapy 1.5 Documentation
performed to any single domain. See also: AutoThrottle extension and its AUTOTHROTTLE_TARGET_CONCURRENCY option. CONCURRENT_REQUESTS_PER_IP Default: 0 The maximum number of concurrent (ie. simultaneous) CONCURRENT_REQUESTS_PER_DOMAIN setting is ignored, and this one is used instead. In other words, concurrency limits will be applied per IP, not per domain. This setting also affects DOWNLOAD_DELAY and AutoThrottle AUTOTHROTTLE_ENABLED • AUTOTHROTTLE_MAX_DELAY • AUTOTHROTTLE_START_DELAY • AUTOTHROTTLE_TARGET_CONCURRENCY • AWS_ACCESS_KEY_ID • AWS_SECRET_ACCESS_KEY • BOT_NAME • CLOSESPIDER_ERRORCOUNT • CLOSESPIDER_ITEMCOUNT0 码力 | 285 页 | 1.17 MB | 1 年前3Scrapy 1.6 Documentation
performed to any single domain. See also: AutoThrottle extension and its AUTOTHROTTLE_TARGET_CONCURRENCY option. CONCURRENT_REQUESTS_PER_IP Default: 0 The maximum number of concurrent (ie. simultaneous) CONCURRENT_REQUESTS_PER_DOMAIN setting is ignored, and this one is used instead. In other words, concurrency limits will be applied per IP, not per domain. This setting also affects DOWNLOAD_DELAY and AutoThrottle AUTOTHROTTLE_ENABLED • AUTOTHROTTLE_MAX_DELAY • AUTOTHROTTLE_START_DELAY • AUTOTHROTTLE_TARGET_CONCURRENCY • AWS_ACCESS_KEY_ID • AWS_ENDPOINT_URL • AWS_REGION_NAME • AWS_SECRET_ACCESS_KEY • AWS_USE_SSL0 码力 | 295 页 | 1.18 MB | 1 年前3Scrapy 1.7 Documentation
performed to any single domain. See also: AutoThrottle extension and its AUTOTHROTTLE_TARGET_CONCURRENCY option. CONCURRENT_REQUESTS_PER_IP Default: 0 The maximum number of concurrent (ie. simultaneous) CONCURRENT_REQUESTS_PER_DOMAIN setting is ignored, and this one is used instead. In other words, concurrency limits will be applied per IP, not per domain. This setting also affects DOWNLOAD_DELAY and AutoThrottle AUTOTHROTTLE_ENABLED • AUTOTHROTTLE_MAX_DELAY • AUTOTHROTTLE_START_DELAY • AUTOTHROTTLE_TARGET_CONCURRENCY • AWS_ACCESS_KEY_ID • AWS_ENDPOINT_URL • AWS_REGION_NAME 120 Chapter 3. Basic concepts Scrapy0 码力 | 306 页 | 1.23 MB | 1 年前3Scrapy 1.8 Documentation
performed to any single domain. See also: AutoThrottle extension and its AUTOTHROTTLE_TARGET_CONCURRENCY option. 3.11. Settings 113 Scrapy Documentation, Release 1.8.4 CONCURRENT_REQUESTS_PER_IP Default: CONCURRENT_REQUESTS_PER_DOMAIN setting is ignored, and this one is used instead. In other words, concurrency limits will be applied per IP, not per domain. This setting also affects DOWNLOAD_DELAY and AutoThrottle AUTOTHROTTLE_ENABLED • AUTOTHROTTLE_MAX_DELAY • AUTOTHROTTLE_START_DELAY • AUTOTHROTTLE_TARGET_CONCURRENCY • AWS_ACCESS_KEY_ID • AWS_ENDPOINT_URL • AWS_REGION_NAME • AWS_SECRET_ACCESS_KEY • AWS_USE_SSL0 码力 | 335 页 | 1.44 MB | 1 年前3Scrapy 0.16 Documentation
crawl. 5.5.1 Increase concurrency Concurrency is the number of requests that are processed in parallel. There is a global limit and a per-domain limit. The default global concurrency limit in Scrapy is identifying at what concurrency your Scrapy process gets CPU bounded. For optimum performance, You should pick a concurrency where CPU usage is at 80-90%. To increase the global concurrency use: CONCURRENT_REQUESTS extension builds on that premise. 5.12.3 Throttling algorithm This adjusts download delays and concurrency based on the following rules: 1. spiders always start with one concurrent request and a download0 码力 | 203 页 | 931.99 KB | 1 年前3Scrapy 1.4 Documentation
performed to any single domain. See also: AutoThrottle extension and its AUTOTHROTTLE_TARGET_CONCURRENCY option. CONCURRENT_REQUESTS_PER_IP Default: 0 The maximum number of concurrent (ie. simultaneous) CONCURRENT_REQUESTS_PER_DOMAIN setting is ignored, and this one is used instead. In other words, concurrency limits will be applied per IP, not per domain. This setting also affects DOWNLOAD_DELAY and AutoThrottle AUTOTHROTTLE_ENABLED • AUTOTHROTTLE_MAX_DELAY • AUTOTHROTTLE_START_DELAY • AUTOTHROTTLE_TARGET_CONCURRENCY • AWS_ACCESS_KEY_ID • AWS_SECRET_ACCESS_KEY • BOT_NAME • CLOSESPIDER_ERRORCOUNT • CLOSESPIDER_ITEMCOUNT0 码力 | 281 页 | 1.15 MB | 1 年前3Scrapy 2.10 Documentation
performed to any single domain. See also: AutoThrottle extension and its AUTOTHROTTLE_TARGET_CONCURRENCY option. CONCURRENT_REQUESTS_PER_IP Default: 0 The maximum number of concurrent (i.e. simultaneous) CONCURRENT_REQUESTS_PER_DOMAIN setting is ignored, and this one is used instead. In other words, concurrency limits will be applied per IP, not per domain. This setting also affects DOWNLOAD_DELAY and AutoThrottle lower the effective per-domain concurrency below CONCURRENT_REQUESTS_PER_DOMAIN. If the response time of a domain is lower than DOWNLOAD_DELAY, the effective concurrency for that domain is 1. When testing0 码力 | 419 页 | 1.73 MB | 1 年前3
共 62 条
- 1
- 2
- 3
- 4
- 5
- 6
- 7