Reverberations

News, Views, Rants and Raves About Technology and More

Archive for the ‘Search’ Category

Internet Explorer’s Trailing Comma Woes

with 8 comments

Internet Explorer is notorious for breaking on trailing commas in JavaScript array declaration. e.g.

var obj = {
     a: 1,
      b: 2,
};

fails on IE, while all other browsers just ignore the innocuous trailing comma after second element.

Weeding out these commas from JavaScript code is absolute PITA. However, here is a regular expression search string I wrote to search such instances in the code.

,\s*\n+\s*[\}\)\]]

Even better,
,\s*\n+(\s*\/\/.*\n)*\s*[\}\)\]]
matches multiple new lines and comments.

Advertisements

Written by Brajesh

March 18, 2008 at 10:31 pm

Robots-Nocontent for Page Sections

leave a comment »

From my relatively little but significant “web-crawling” experience, one of the major problems is to scavenge meaningful content from the page- which requires that no navigation crap,  menus,  javascript and adverts should be indexed. Since there is no standard way web-devs design navigation, menus etc. it is impossible to code a parser that works 100% and is a big PITA.
However this piece on Yahoo! Search Blog is welcome news

webmasters can now mark parts of a page with a ‘robots-nocontent’ tag which will indicate to our crawler what parts of a page are unrelated to the main content and are only useful for visitors.

If the trend catches on, and becomes a standard (has to get Google’s support), it would be greatly helpful.

Written by Brajesh

May 3, 2007 at 9:47 am

Posted in Coding, Search, Trends, Yahoo!

Entity Resolution

with one comment

$50,000 for million dollar problem 🙂

Written by Brajesh

April 22, 2007 at 4:29 pm

Posted in Coding, Money, Search

Inline Seach: Cool IE Add-On

leave a comment »

I hate modal “Find” of IE, which MS chose not to change even in IE7. So here is one cool add-on by IEForge for Firefox-like inline search functionality in IE.

Written by Brajesh

February 23, 2007 at 4:19 pm

Posted in Firefox, Microsoft, Search

Google ditches its SOAP API

leave a comment »

I noticed this on the very next day, when I was looking for some documentation on proxy auto configuration scripts support in the SOAP API. The replacement AJAX api not only has limited application (website only), but also it promises to show google ads beside the results.

Not that Google Search API has ever been very stable – it works almost only 80% of the time, so one has to pray and hope that it works with every call. Now even the support has been dropped and usage samples along with FAQ have been removed.

Damn! not good at all.

Written by Brajesh

December 19, 2006 at 1:37 pm

Posted in AJAX, Coding, Google, Search, WTF

I’m out of Google Sandbox

with 2 comments

Okay, I’m out of Google Sandbox :-), I’ve a PageRank now. It took almost two months.

Written by Brajesh

April 5, 2006 at 9:20 am

Posted in Google, Personal, Search

My Little Experiments with Google Search

leave a comment »

I’ve been doing little experiments with Google lately. This blog is up for little more than two weeks now, since I imported all posts from my old blog. That old blog used be #1 result for ‘Reverberations‘ ( #2 as of now) and ‘brajesh‘ ( eh! I do my share of egosurfing) on Google, probably because of my slashdot backlinks, though this blog has none yet. yahoo! has been less benevolent to me, but MSN been pretty favorable.

I acted link-conscious by linking this blog with ‘reverberations’ from the blogger and elsewhere. For the first week this blog was on the first page* in the results for ‘reverberations’ as well as ‘brajesh’ (*my default preference is 100 search results per page). But, by the second week, this blog surprisingly disappeared from first page. What’s happening here? I guess, it’s to do with some google algorithm for unnatural linking and/or Google Sandbox.

There are some other experiments I did with search engines, e.g. deliberate spelling mistakes, cross linking. I’ll write about them sometime later perhaps.

Links are the new (or not so new) currency of the Web. That is why Mike Arrington at TechCrunch can ask for contributions in exchange for linking back.

By the way, it looks like that there has been some major updates in Google PageRanks recently.

Written by Brajesh

February 19, 2006 at 5:16 pm

Posted in Google, Rant, Search