WebPage Content Analysing


    Jan 27 2009 | 10:29 am
    HI,
    With the following message
    message to jit.uldl
    I get the the html - code text if i connect the jit.textfile object from the first output of jit.uldl.
    If i connect now the outlet of jit.uldl to a
    jit.str.regexp @re Aarau
    I get a "jit.str.regexp PCRE error -10"
    in the Max Window and the jit.str.regexp Object doesn't give out any matches although as you can See at the very beginning there should be a first match with "Aarau".
    I hope anybody is understanding what I am trying to explain.
    sorry

    • Jan 27 2009 | 4:05 pm
      dont know what the error means but is there a reason you need to
      download this into a matrix?
      this example will d/l the html file to the same directory as yr patch. u
      load it into "text" and search for whatever you looking for...
      p
      FHAStudent1 wrote:
      > HI,
      > With the following message
      > http://de.weather.yahoo.com/SZXX/SZXX0001/index_c.html matrix
      >
      > message to jit.uldl
      >
      > I get the the html - code text if i connect the jit.textfile object from the first output of jit.uldl.
      >
      > If i connect now the outlet of jit.uldl to a
      > jit.str.regexp @re Aarau
      > I get a "jit.str.regexp PCRE error -10"
      > in the Max Window and the jit.str.regexp Object doesn't give out any matches although as you can See at the very beginning there should be a first match with "Aarau".
      >
      > I hope anybody is understanding what I am trying to explain.
      > sorry
      >
      >
      >
      > ------------------------------------------------------------------------
      >
      >
      --
      http://pure.test.at
      http://www.myspace.com/pvre
      http://www.heartchamberorchestra.org
    • Jan 27 2009 | 9:02 pm
      I've come across this problem before when using [jit.str.regexp]. I think the error is something to do with not understanding character encodings but I might be wrong. One way that I've managed to get around it is to go directly from your [jit.uldl] to a [jit.textfile] and bang it out of there. The annoying thing is that the first time after you download something you need to open and close the [jit.textfile] window. Here's an example.
      lh