Scrapy 1.4 Documentation
sources using extended CSS selectors and XPath expressions, with helper methods to extract using regular expressions. An interactive shell console (IPython aware) for trying out the CSS and XPath expressions efficient XML and HTML parser parsel [https://pypi.python.org/pypi/parsel], an HTML/XML data extraction library written on top of lxml, w3lib [https://pypi.python.org/pypi/w3lib], a multi-purpose helper for dealing Besides the extract() and extract_first() methods, you can also use the re() method to extract using regular expressions: >>> response.css('title::text').re(r'Quotes.*') ['Quotes to Scrape'] >>> response.css('title::text')0 码力 | 394 页 | 589.10 KB | 1 年前3Scrapy 0.24 Documentation
http://www.mininova.org/tor/NUMBER where NUMBER is an integer. We’ll use that to construct the regular expression for the links to follow: /tor/\d+. We’ll use XPath [http://www.w3.org/TR/xpath] for selecting name is contained inside atag:
Darwin - The Evolution Of An Exhibition
An XPath expression to extract the name could be: //h1/text() And the description is contained inside atag exhibit about Charles Darwin in conjunction with the 200th anniversary of his birth. ... An XPath expression to select the description could be: //div[@id='description'] Finally, the file size is contained0 码力 | 298 页 | 544.11 KB | 1 年前3Scrapy 0.14 Documentation
http://www.mininova.org/tor/NUMBER where NUMBER is an integer. We’ll use that to construct the regular expression for the links to follow: /tor/\d+. We’ll use XPath [http://www.w3.org/TR/xpath] for selecting that the file name is contained inside atag:
Home[2009][Eng]XviD-ovd
An XPath expression to extract the name could be: //h1/text() And the description is contained inside atag depletion of natural resources and the catastrophic evolution of the Earth's climate. ... An XPath expression to select the description could be: //div[@id='description'] Finally, the file size is contained0 码力 | 235 页 | 490.23 KB | 1 年前3Scrapy 0.22 Documentation
http://www.mininova.org/tor/NUMBER where NUMBER is an integer. We’ll use that to construct the regular expression for the links to follow: /tor/\d+. We’ll use XPath [http://www.w3.org/TR/xpath] for selecting name is contained inside atag:
Darwin - The Evolution Of An Exhibition
An XPath expression to extract the name could be: //h1/text() And the description is contained inside atag exhibit about Charles Darwin in conjunction with the 200th anniversary of his birth. ... An XPath expression to select the description could be: //div[@id='description'] Finally, the file size is contained0 码力 | 303 页 | 566.66 KB | 1 年前3Scrapy 0.20 Documentation
http://www.mininova.org/tor/NUMBER where NUMBER is an integer. We’ll use that to construct the regular expression for the links to follow: /tor/\d+. We’ll use XPath for selecting the data to extract from that the file name is contained inside atag:
Home[2009][Eng]XviD-ovd
An XPath expression to extract the name could be: //h1/text() And the description is contained inside atag depletion of natural resources and the catastrophic evolution of the Earth’s climate. ... An XPath expression to select the description could be: //div[@id=’description’] Finally, the file size is contained0 码力 | 197 页 | 917.28 KB | 1 年前3Scrapy 0.24 Documentation
http://www.mininova.org/tor/NUMBER where NUMBER is an integer. We’ll use that to construct the regular expression for the links to follow: /tor/\d+. We’ll use XPath for selecting the data to extract from name is contained inside atag:
Darwin - The Evolution Of An Exhibition
An XPath expression to extract the name could be: //h1/text() And the description is contained inside atag exhibit about Charles Darwin in conjunction with the 200th anniversary of his birth. ... An XPath expression to select the description could be: //div[@id='description'] Finally, the file size is contained0 码力 | 222 页 | 988.92 KB | 1 年前3Scrapy 0.9 Documentation
http://www.mininova.org/tor/NUMBER where NUMBER is an integer. We’ll use that to construct the regular expression for the links to follow: /tor/\d+. For extracting data we’ll use XPath [http://www.w3.org/TR/xpath] that the file name is contained inside atag:
Home[2009][Eng]XviD-ovd
An XPath expression to extract the name could be: //h1/text() And the description is contained inside atag depletion of natural resources and the catastrophic evolution of the Earth's climate. ... An XPath expression to select the description could be: //div[@id='description'] Finally, the file size is contained0 码力 | 204 页 | 447.68 KB | 1 年前3Scrapy 0.22 Documentation
http://www.mininova.org/tor/NUMBER where NUMBER is an integer. We’ll use that to construct the regular expression for the links to follow: /tor/\d+. We’ll use XPath for selecting the data to extract from name is contained inside atag:
Darwin - The Evolution Of An Exhibition
An XPath expression to extract the name could be: //h1/text() And the description is contained inside atag exhibit about Charles Darwin in conjunction with the 200th anniversary of his birth. ... An XPath expression to select the description could be: //div[@id=’description’] Finally, the file size is contained0 码力 | 199 页 | 926.97 KB | 1 年前3Scrapy 0.20 Documentation
http://www.mininova.org/tor/NUMBER where NUMBER is an integer. We’ll use that to construct the regular expression for the links to follow: /tor/\d+. We’ll use XPath [http://www.w3.org/TR/xpath] for selecting that the file name is contained inside atag:
Home[2009][Eng]XviD-ovd
An XPath expression to extract the name could be: //h1/text() And the description is contained inside atag depletion of natural resources and the catastrophic evolution of the Earth's climate. ... An XPath expression to select the description could be: //div[@id='description'] Finally, the file size is contained0 码力 | 276 页 | 564.53 KB | 1 年前3Scrapy 0.16 Documentation
http://www.mininova.org/tor/NUMBER where NUMBER is an integer. We’ll use that to construct the regular expression for the links to follow: /tor/\d+. We’ll use XPath [http://www.w3.org/TR/xpath] for selecting that the file name is contained inside atag:
Home[2009][Eng]XviD-ovd
An XPath expression to extract the name could be: //h1/text() And the description is contained inside atag depletion of natural resources and the catastrophic evolution of the Earth's climate. ... An XPath expression to select the description could be: //div[@id='description'] Finally, the file size is contained0 码力 | 272 页 | 522.10 KB | 1 年前3共 62 条- 1
- 2
- 3
- 4
- 5
- 6
- 7