parsing, regexp HELP!
i have hit a massive wall with my patch, basically what i would like to do is have max download pictures from a website and display them. I have got to the point where i have the website code downloaded, and i can find lines of code that have .jpg in them ( this is using regexp ). I now need to
extract the url from these lines so i can send the message on to the downloader.
the kind of messages i am working with are :
< p> < a href=" http://www.flickr.com/photos/library_of_congress/6056606287/" title=" Photographer Carol M. Highsmith joins Sharon Tyson and visiting children as they interact with a colorful artistic illusion screen at the ArtWorks exhibit in the Museum of Fine Art Montgomery Alabama (LOC)" > < img src=" http://farm7.staticflickr.com/6183/6056606287_c3d3741afc_m.jpg" width=" 240" height=" 162" alt=" Photographer Carol M. Highsmith joins Sharon Tyson and visiting children as they interact with a colorful artistic illusion screen at the ArtWorks exhibit in the Museum of Fine Art Montgomery Alabama (LOC)" /> < /a> < /p>
something like this, however they are not all the same length, so simple list slicing doesnt work. I tried understand how regexp works but its so different from what im used to, there must be some way of telling it to look for a http and a jpg and output everything in between aswell
as soon as this is worked out i can carry on and finsh my patch!
If you reply with the URL that you are downloading the HTML from I will take a look and see if I can put together an expression that will work for you.
OK so this post was definitely in frustration, a cup of tea cleared my head, i read the helpfile and finally understood
the answer to my question answered by me is
from what i understand this looks for a http, then checks to see if white spaces appear anywhere and then looks for a jpg and chucks everything out
:D happy days
Luke – the url i am using to test is
when the patch is finished i will upload it so you can have a look, thank you for your offer though it is much appreciated