Scrapy 1.4 Documentation
rules to crawl your websites. Selectors Extract the data from web pages using XPath. Scrapy shell Test your extraction code in an interactive environment. Items Define the data you want to scrape. Item reports. bench New in version 0.17. Syntax: scrapy bench Requires project: no Run a quick benchmark test. Benchmarking. Custom project commands You can also add your custom project commands by using the org/regular-expressions regular expressions set http://exslt.org/sets set manipulation Regular expressions The test() function, for example, can prove quite useful when XPath’s starts- with() or contains() are not0 码力 | 394 页 | 589.10 KB | 1 年前3Scrapy 2.10 Documentation
useful for bug reports. bench • Syntax: scrapy bench • Requires project: no Run a quick benchmark test. Benchmarking. 34 Chapter 3. Basic concepts Scrapy Documentation, Release 2.10.1 3.1.6 Custom project manipulation 60 Chapter 3. Basic concepts Scrapy Documentation, Release 2.10.1 Regular expressions The test() function, for example, can prove quite useful when XPath’s starts-with() or contains() are not sufficient getall() ['link1.html', 'link2.html', 'link3.html', 'link4.html', 'link5.html'] >>> sel.xpath('//li[re:test(@class, "item-\d$")]//@href').getall() ['link1.html', 'link2.html', 'link4.html', 'link5.html'] Warning:0 码力 | 419 页 | 1.73 MB | 1 年前3Scrapy 2.11.1 Documentation
useful for bug reports. bench • Syntax: scrapy bench • Requires project: no Run a quick benchmark test. Benchmarking. 34 Chapter 3. Basic concepts Scrapy Documentation, Release 2.11.1 3.1.6 Custom project org/regular-expressions regular expressions set http://exslt.org/sets set manipulation Regular expressions The test() function, for example, can prove quite useful when XPath’s starts-with() or contains() are not sufficient getall() ['link1.html', 'link2.html', 'link3.html', 'link4.html', 'link5.html'] >>> sel.xpath('//li[re:test(@class, "item-\d$")]//@href').getall() ['link1.html', 'link2.html', 'link4.html', 'link5.html'] Warning:0 码力 | 425 页 | 1.76 MB | 1 年前3Scrapy 2.11 Documentation
useful for bug reports. bench • Syntax: scrapy bench • Requires project: no Run a quick benchmark test. Benchmarking. 34 Chapter 3. Basic concepts Scrapy Documentation, Release 2.11.1 3.1.6 Custom project org/regular-expressions regular expressions set http://exslt.org/sets set manipulation Regular expressions The test() function, for example, can prove quite useful when XPath’s starts-with() or contains() are not sufficient getall() ['link1.html', 'link2.html', 'link3.html', 'link4.html', 'link5.html'] >>> sel.xpath('//li[re:test(@class, "item-\d$")]//@href').getall() ['link1.html', 'link2.html', 'link4.html', 'link5.html'] Warning:0 码力 | 425 页 | 1.76 MB | 1 年前3Scrapy 2.11.1 Documentation
useful for bug reports. bench • Syntax: scrapy bench • Requires project: no Run a quick benchmark test. Benchmarking. 34 Chapter 3. Basic concepts Scrapy Documentation, Release 2.11.1 3.1.6 Custom project org/regular-expressions regular expressions set http://exslt.org/sets set manipulation Regular expressions The test() function, for example, can prove quite useful when XPath’s starts-with() or contains() are not sufficient getall() ['link1.html', 'link2.html', 'link3.html', 'link4.html', 'link5.html'] >>> sel.xpath('//li[re:test(@class, "item-\d$")]//@href').getall() ['link1.html', 'link2.html', 'link4.html', 'link5.html'] Warning:0 码力 | 425 页 | 1.79 MB | 1 年前3Scrapy 2.9 Documentation
useful for bug reports. bench • Syntax: scrapy bench • Requires project: no Run a quick benchmark test. Benchmarking. 34 Chapter 3. Basic concepts Scrapy Documentation, Release 2.9.0 3.1.6 Custom project org/regular-expressions regular expressions set http://exslt.org/sets set manipulation Regular expressions The test() function, for example, can prove quite useful when XPath’s starts-with() or contains() are not sufficient getall() ['link1.html', 'link2.html', 'link3.html', 'link4.html', 'link5.html'] >>> sel.xpath('//li[re:test(@class, "item-\d$")]//@href').getall() ['link1.html', 'link2.html', 'link4.html', 'link5.html'] 600 码力 | 409 页 | 1.70 MB | 1 年前3Scrapy 1.5 Documentation
bench New in version 0.17. • Syntax: scrapy bench • Requires project: no Run a quick benchmark test. Benchmarking. 3.1.5 Custom project commands You can also add your custom project commands by using org/regular-expressions regular expressions set http://exslt.org/sets set manipulation Regular expressions The test() function, for example, can prove quite useful when XPath’s starts-with() or contains() are not sufficient [u'link1.html', u'link2.html', u'link3.html', u'link4.html', u'link5.html'] >>> sel.xpath('//li[re:test(@class, "item-\d$")]//@href').extract() [u'link1.html', u'link2.html', u'link4.html', u'link5.html']0 码力 | 285 页 | 1.17 MB | 1 年前3Scrapy 2.4 Documentation
useful for bug reports. bench • Syntax: scrapy bench • Requires project: no Run a quick benchmark test. Benchmarking. 3.1. Command line tool 31 Scrapy Documentation, Release 2.4.1 3.1.6 Custom project org/regular-expressions regular expressions set http://exslt.org/sets set manipulation Regular expressions The test() function, for example, can prove quite useful when XPath’s starts-with() or contains() are not sufficient getall() ['link1.html', 'link2.html', 'link3.html', 'link4.html', 'link5.html'] >>> sel.xpath('//li[re:test(@class, "item-\d$")]//@href').getall() ['link1.html', 'link2.html', 'link4.html', 'link5.html'] 30 码力 | 354 页 | 1.39 MB | 1 年前3Scrapy 1.6 Documentation
bench New in version 0.17. • Syntax: scrapy bench • Requires project: no Run a quick benchmark test. Benchmarking. 3.1.6 Custom project commands You can also add your custom project commands by using org/regular-expressions regular expressions set http://exslt.org/sets set manipulation Regular expressions The test() function, for example, can prove quite useful when XPath’s starts-with() or contains() are not sufficient getall() ['link1.html', 'link2.html', 'link3.html', 'link4.html', 'link5.html'] >>> sel.xpath('//li[re:test(@class, "item-\d$")]//@href').getall() ['link1.html', 'link2.html', 'link4.html', 'link5.html'] >>>0 码力 | 295 页 | 1.18 MB | 1 年前3Scrapy 2.11 Documentation
rules to crawl your websites. Selectors Extract the data from web pages using XPath. Scrapy shell Test your extraction code in an interactive environment. Items Define the data you want to scrape. Item useful for bug reports. bench Syntax: scrapy bench Requires project: no Run a quick benchmark test. Benchmarking. Custom project commands You can also add your custom project commands by using the org/regular-expressions regular expressions set http://exslt.org/sets set manipulation Regular expressions The test() function, for example, can prove quite useful when XPath’s starts- with() or contains() are not0 码力 | 528 页 | 706.01 KB | 1 年前3
共 62 条
- 1
- 2
- 3
- 4
- 5
- 6
- 7