19 Jul '14, 11pm

@fhwang I did some related experiments but it was very "web developer tries naive nlp"

@fhwang I did some related experiments but it was very "web developer tries naive nlp"

A little while back, a Disqus plugin annoyed me in some way. I don't remember how, but I do remember that I tweeted Disqus about it, and they fixed it, mostly. I've decided to return the favor. Yesterday, the MetaOptimize blog post NLP Challenge: Find semantically related terms over a large vocabulary (>1M)? attracted a ton of retweets on Twitter. A Disqus plugin on the blog post adds those RTs to the post as "comments," using a system called BackType . For instance, in this screenshot, the top "comment" comes from a guy called turian, and the bottom comment retweets him. It would be easy to eliminate the pure, classic, literal RTs with a regular expression: next if alleged_comment =~ /$RT @/ But if you look in the middle, there's a nearly identical tweet with no actual "RT" string. That's because it comes from @hntweets , which apparently tweets links found on Hacker News...

Full article: http://gilesbowkett.blogspot.com/2010/11/blog-comment-sim...

Tweets

Ruby Developer (Rails Web Apps) #London #UK #News

jobview.monster.co.uk 20 Jul '14, 2pm

Thank you for your interest in the Ruby Developer (Rails Web Apps) role. Whilst optional, we recommend entering your infor...