Skip to content

Commit 599edbf

Browse files
authored
Update README.md
1 parent 92b326d commit 599edbf

File tree

1 file changed

+46
-3
lines changed

1 file changed

+46
-3
lines changed

Scrapy/README.md

Lines changed: 46 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -112,7 +112,7 @@ for href in response.xpath("//h2[contains(@class, 'title headline-font')]/a[cont
112112
url = "https:" + href.extract()
113113
```
114114

115-
2. Exit Scrapy Shell using <b>exit()</b>. We do this while we should now understand the structure of where individual campaigns links are, we havent looked at where things are on individual campaigns.
115+
2. Exit Scrapy Shell by typing <b>exit()</b>. We do this while we should now understand the structure of where individual campaigns links are, we havent looked at where things are on individual campaigns.
116116

117117
![](https://github.com/mGalarnyk/Python_Tutorials/blob/master/Scrapy/Tutorial_Images/exitScrapyShell.png)
118118
<br>
@@ -142,7 +142,7 @@ In command line type (windows):
142142
scrapy shell "https://fundrazr.com/savemyarm"
143143
```
144144

145-
The code to get the title is
145+
The code to get the campaign title is
146146

147147
```
148148
response.xpath("//div[contains(@id, 'campaign-title')]/descendant::text()").extract()[0]
@@ -151,7 +151,50 @@ response.xpath("//div[contains(@id, 'campaign-title')]/descendant::text()").extr
151151
![](https://github.com/mGalarnyk/Python_Tutorials/blob/master/Scrapy/Tutorial_Images/GettingTitleIndividualCampaignShell.png)
152152
<br>
153153

154-
4. We can do the same for the other parts of the page.
154+
4. We can do the same for the other parts of the page.
155+
156+
amount Raised:
157+
158+
```
159+
160+
```
161+
162+
goal:
163+
```
164+
165+
```
166+
167+
currency type:
168+
169+
```
170+
171+
```
172+
173+
campaign end date:
174+
```
175+
176+
```
177+
178+
number of contributors:
179+
```
180+
181+
```
182+
183+
story:
184+
```
185+
186+
```
187+
188+
url:
189+
```
190+
191+
```
192+
193+
5. Exit scrapy shell by typing:
194+
195+
```
196+
exit()
197+
```
155198

156199
## Items
157200
The main goal in scraping is to extract structured data from unstructured sources, typically, web pages. Scrapy spiders can return the extracted data as Python dicts. While convenient and familiar, Python dicts lack structure: it is easy to make a typo in a field name or return inconsistent data, especially in a larger project with many spiders.

0 commit comments

Comments
 (0)