This project is read-only.

unexpected output from text/html input

Jun 9, 2011 at 3:17 PM

Hi There

First of all thanks for all the hard work. What a great resource. I'm so glad I stumbled across this before duplicating (baddly) the project.

I'm comparing the output from the Calais rest service on my own basic unitility classes making a http post. The results I get differ wildly from the output I get from Looking at the results it appears that the call from is not cleaning the HTML or is cleaning the HTML differently to the restful service on . I've checked the input parameter is set corrrectly to text/html but I cant replicate the same results in my call to the rest service no matter what input type I select.

If I take the trouble to pass in some specific chunks of content rather than the screen scraped response html then I'm getting equivalent results.

Why is processing the html differently to an equivalanet call to the restful service? Am I doing something wrong or just fundamentally missing the obvious?