Productification

Intersection between Technology, User Experience and Product Innovation


30 Nov

UIMA and GATE Applications over Hadoop


There is a new project on google code called Behemoth that utilizes Hadoop to achieve scale for GATE and UIMA applications. Here is a more detailed post from Julien:

"Behemoth allows to deploy GATE or UIMA applications over a Hadoop cluster in
order to do very large scale document analysis. It uses a very simple
representation format which can be used as a common ground between UIMA and
GATE-generated annotations, hence achieving compatibility between both
systems. Since it is Hadoop-based it benefits from all its features
(scalability, fault-tolerance, etc…) and most notably the back up of a
thriving open source community. Quite a few Apache resources already do or
will fit into it: Nutch, Tika, Mahout, Hbase etc…"

Posted via email from Sameer’s posterous


No Response Filed under: General
29 Nov

Continuous Development - That’s the way it’s done


A presentation by Timothy Fitz on Continuous Development

Posted via web from Sameer’s posterous


No Response Filed under: General
29 Nov

Foursquare, Facebook and Geo


Great post on Techcrunch on Facebook and Foursquare including what could Foursquare do if Facebook introduces and nails Geo.

http://www.techcrunch.com/2009/11/28/facebook-foursquare/

Posted via email from Sameer’s posterous


No Response Filed under: General
20 Nov

Google finally getting that Faceted Navigation is a powerful paradigm


A few days ago Daniel announced that he is joining Google where he would be focusing on changing the simplistic view of the google search (layout and all that), and today Techcrunch has a post that shows Google is flirting with the Search Options more (see screenshot below). It definitely shows that Google is finally understanding that Faceted Navigation (filtering) is a very powerful paradigm that helps users slice and dice results and help them narrow down to the result they are most interested in.

I think we will see a lot of activity from Google in this area as they enrich this functionality more and more. I can see them exposing dynamic faceting capability where based on the higher level type of information / type chosen, the facets that show up would change.

Posted via email from Sameer’s posterous


No Response Filed under: General
18 Nov

How Solr powers Local / Geospatial Applications


Checkout the Webinar on how Solr and Lucene are used to power Local Search and Geo Spatial applications. This includes the work I was involved with at AT&T Interactive building YP.com

http://www.bitpipe.com/detail/RES/1257457967_42.html

Posted via email from Sameer’s posterous


No Response Filed under: Products, Search
15 Nov

Webpage load time as a Ranking Factor


Google has been talking about speed of their search engine as well as published how users react when search takes a longer time. Now, with a short interview with Mike from WebProNews, Matt Cutts says the speed of a site (load time) might become a ranking factor in search.

What does that mean?

- Web Site creators will be forced to ensure that their sites are faster
- Greater demand for performance and scalability gurus
- With speed being a factor in everyone's mind - Google would treat their quick response times as a competitive edge. Other search engines will have to raise the bar and focus a lot on ensuring that the search experience is super fast.

Posted via email from Sameer’s posterous


No Response Filed under: General
15 Nov

Spot Signature Extraction - Near Duplicate Detection


Download now or preview on posterous

SpotSigs.ppt (2964 KB)

Work done by Stanford on Near Duplicate Detection using a technique called "Spot Signature Extraction". Seems like a decent approach for news articles. Planning to try it out on a small data set.

Posted via email from Sameer’s posterous


No Response Filed under: General
14 Nov

User to Developer Ratio at your company?


Interesting to find that Facebook has a user to developer ratio of 1.1 million (detailed post here)!! That's definitely some motivation for the developer as how much impact does his work has on the end users. What's the ratio at your company?

Posted via email from Sameer’s posterous


No Response Filed under: General
13 Oct

Reviews and Ratings on Web Sites


Seems like when it comes to ratings it’s pretty much all or nothing. Great videos prompt action; anything less prompts indifference. Thus, the ratings system is primarily being used as a seal of approval, not as an editorial indicator of what the community thinks about a video. Rating a video joins favoriting and sharing as a way to tell the world that this is something you love.

http://youtube-global.blogspot.com/2009/09/five-stars-dominate-ratings.html

, the company says its ranking system already factors in the number of reviews, whether they come from experienced Yelpers or first-time reviewers, and whether those reviews were voted helpful.

http://venturebeat.com/2009/10/12/how-yelp-deals-with-everybody-getting-four-stars-on-average/


No Response Filed under: General
15 Sep

Frequent Releases are the way to go


Something worth quoting - from Greg Liden.

Frequent releases are desirable because of the changes it forces in software engineering.  It discourages risky, expensive, large projects.  It encourages experimentation, innovation, and rapid iteration.  It reduces the cost of failure while also minimizing the risk of failure.  It is a better way to build software.

For full post visit the CACM post.