I want to do a project for school that uses jit.uldl and jit.str.regexp to scrape images from google images. I've been using the parser/downloader patch found in the jit.str.regexp help file as an example, but have had little luck. From my understanding, this patch (unedited) is suppose to download the html source from the c74 homepage, send the info as a matrix to jit.str.regexp which parses the html and extracts the gif and jpg files, reconfigures them back into a web address and then sends that back to jit.uldl to be downloaded. Is that right? Anyway, it doesn't appear to be doing that, or I don't know where the images are being downloaded to. As a note, I sometimes get the error jit.str.regexp: PCRE error -10 in the max window.
could anybody help me understand jit.str.exp better, and what exactly is going on in the jit.str.regexp parser/downloader example? does anybody have any good examples of web scraping that could help?
thank you so much!