![]() |
|
|
|
Thread Tools | Display Modes |
|
|
|
|
#1 |
|
Newbie
Join Date: Oct 2006
Posts: 23
Rep Power: 0
![]() |
def find_all_items(site):
site = urllib.urlopen(site).read()
all_items = re.findall(r'watch\.asp\?\w+\=\w*\&\w*\=\w+',site)
next_page = re.findall(r'Watches\.asp\?\w+\=[0-9]+\&pg\=\w+',site)
try:
find_all_items(urllib.basejoin("<some url>",next_page[0]))
except IndexError:
pass
return remove_dups(all_items) # ,<---- this removes all the duplicate itemsHow do i keep the value of all_items that i have within the first function call and return a list of all the items at once also from the second function call Now the list is being overwritten |
|
|
|
| Bookmarks |
| Currently Active Users Viewing This Thread: 1 (0 members and 1 guests) | |
| Thread Tools | |
| Display Modes | |
|
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| dev c++ software, template problem | cairo | C++ | 11 | Jun 2nd, 2006 12:42 PM |
| libraries | matko | C | 1 | Jan 22nd, 2006 2:12 PM |
| Jackpot game | zorin | Visual Basic | 3 | Jun 10th, 2005 1:19 PM |
| User-defined creatNode and deleteNode functions for a doubly-linked list | jgs | C | 2 | Apr 28th, 2005 8:53 AM |
| airport Log program using 3D linked List : problem reading from file | gemini_shooter | C++ | 0 | Mar 2nd, 2005 4:12 PM |